OK so I put this under accessible technology for reasons that you will learn.
I have recently been playing around with the new ChatGPT ai agent and my fellow blind folks, this is something else entirely! I am having it use the back end of Wix to design and format an entire website that Iāve been wanting to build. Itās created the text, itās created the images, itās place things in the proper location, it is uploading Audio files to the website for me, itās changing all of the fonts to make it look appealing to the eyes, itās changing layouts to what Iāve prompted it to do, this is nuts. This pretty much makes every single web design platform or website accessible if it can take the actions on your behalf.
Comments
is it free?
Hi, Is it free? or not. and can it work with, say, emailing a friend?
iād love a podcast on this
iāve tried creating websites before with minimal success, so I would really love to know how you get ChatGPT to communicate with wix.
@ JC
No you either get it with the plus or pro plan of ChatGPT.
@ xenacat3
I wish I had time to do podcasts because this one wouldāve been a great one!
So what happens is chat GPT opens up its own internal computer, and you tell it what you wanted to do. You do have to provide login information so ChatGPT can go login for you and do what youāve asked it to do. For example, Iāve been giving it quite extensive prompts in regards to color schemes for the website, creating search engine optimization, writing and inserting a terms of service, adding files to the website and my goodness it has been 6 hrs of omg lol. It will also tell you what it is doing and how it is doing it as it navigates webpages, clicks on buttons, and makes selections based off of your instructions. Thereās way too much to put in hereā¦
@ @ xenacat3 Bookmark
It will also go through and see if thereās any sort of design flaws, it will give you feedback and then you can tell it whether or not you want it to improve what it had suggested or if youāve changed your mind and completely wanted to do something else.
@ xenacat3
The coolest thing is watching the system problem solve. It does move slower, but this only came out about a week ago so I donāt expect it to be as fast as the latest ChatGPT model but this thing has reasoning, problem-solving, navigation abilities etc etc.
My initial thoughts
I had access to the ChatGPT Agent before my plan expired, and honestly, at first it was difficult to even log into the portal. I remember trying to use it literally the second day it launched, and I needed extra help to get into the interface. Iām hearing a lot of mixed reviewsāsome users say itās accessible, others say it isnāt. When I started using it, I went in with the expectation that Iād be able to book flights, pay the bills, schedule meetings, do a bit of shopping, maybe even order food. But in reality, it wasnāt able to handle most of those things. To be fair, I havenāt tested it extensively, especially since my plan has expired now. I keep meaning to renew it and mess around with it more once I have the free time. From what I experienced, it mostly felt like an upgraded version of the memory feature. That seems to be the main use case right nowāhandling files like documents, scheduling reminders, creating calendar events. But Iām wondering: can it do anything with media, like videos?
One important thing to remember is that when it comes to ChatGPT Agents, a lot depends on which websites are integrated or onboard. For example, I thought Iād be able to order groceries from Amazon through the agent, but that didnāt happen. Amazon, being the giant that it is, doesnāt really want third-party systems getting involved in their operations. So because of that limitation, those kinds of shopping features arenāt availableāat least not with the bigger names like Amazon, Walmart, or Target. That said, I think thereās still potential here. If smaller independent retailers and grocery stores are smart, they could partner or integrate with systems like this and carve out a space. That might be the route forward. Iāve seen people doing some impressive tasks with the Agentāwriting code, designing flyers, formatting documents, creating spreadsheets, prepping PowerPoint presentations. I have no idea how theyāre doing all that, but it looks like thereās a lot of potential for power users. Still, for me personally, I found it a little underwhelming compared to what I was hoping for. Itās also a bit tricky to get the hang of.
I would love to see a full walkthrough or demo of what this thing can actually do. Iām surprised they havenāt done one yet, but maybe itās because itās still earlyāonly been out for a couple of weeks, if I remember right. Ultimately, I think it needs more time to mature and really live up to the hype. But if someone wants to try it now just to get a sense of where itās starting and compare that to where itās heading later on, $20 isnāt a bad priceāif you can afford it.
@ Winter Roses
honestly, it just uses its own internal computer and accesses the website like you would if you were accessing it yourself. I mean, it works with the files, audio, video, but the coolest thing is it navigating the web and taking actions on your behalf. Those actions itās taking obviously is based on your instructions. Using it as fully accessible. I use NVDA. To be fair though I have not tried it with jaws. Itās as accessible as a regular ChatGPT window. Iāve posted it here because Iāve just been using it to completely design an entire website from scratch. You do have to ensure your prompting properly and your prompts are as detailed as possible.
@ Winter Roses
If for some reason, it canāt access the website, it does problem-solving and it works around that and it finds a solution to access the site.
Iām recording a little demo for you guys now.
Iām going to do my best at recording a little demo. Iām not the most entertaining so bear with me. lol.
What ChatGPT AI agent can do for you
ChatGPT AI agent can assist with a wide range of tasks. It can generate text, answer questions, write and rewrite copy, translate languages, summarize articles, brainstorm ideas, produce outlines or scripts, and even help with coding tasks. It can interact with websites and applications on your behalf, reading content aloud or filling in forms, making technology more accessible. Because it's conversational, you can refine the results by asking followāup questions until you get what you need. This makes ChatGPT an extremely versatile tool for productivity and accessibility.
donāt mind that last comment
I actually was doing a demonstration for you guys on audio on how the ChatGPTAI agent can work. I just essentially had it navigate to the Applevis website under my login information and it posted it under my account. Here is the audio file.
https://www.dropbox.com/scl/fi/y0ah9bogluanro9yh8vkm/audio1798085360.m4a?rlkey=j0es6fliaa6oq3u8bgnis3c99&st=55htxlsd&dl=0
Winter Roses
I've been playing arount with it for a little while, and I've made it add stuff to my amazon shopping cart, book me movie tickets on bookmyshow, so... Haven't tried booking a flight because I haven't had to yet.
and @JC certainly you can make it email your friend so long as you are ready to give it your email cridencials.
Generally speaking, it's truly a game-changer as far as web accessibility is concerned... Should try it with, say, wordpress or youtube...
Happy path
One of the problems with current large language models is that they get relatively good at tackling the so-called happy path in coding problems, which is when all the interactions are easily predictable, but fail to tackle even trivial edge cases sometimes, potentially resulting in security problems. Furthermore they are also prone to hallucinate, and this problem has actually been getting worse lately, with consequences like misspelling dependencies that don't really exist, opening a window of opportunity for bad actors to register them and perform supply chain attacks similar to typo squatting for humans. Beyond this there are also code quality problems, in which the AI tends to generate extremely verbose solutions to problems that experienced programmers can solve a lot more efficiently, which makes the generated code unnecessarily much harder to reason about.
All the above combined results in a huge pile of bloated code with lots of technical debt, skyrocketing costs from token usage, and since the time and memory complexity of context windows increases quadratically with their size, it's not even that hard for a medium-sized codebase to hit resource limits so the whole thing is extremely unsustainable. While I think it's perfectly possible to build hybrid models that take as much advantage of existing algorithmic solutions as possible to significantly improve their efficiency, and I have my own theories about them that I will start experimenting with soon, I think that doing so will require a huge paradigm shift that may not happen before the current AI hype bubble pops.
One potential time-bomb issue for this technology is a phenomenon in which training new AI models on the output of other AI is known to lead to model collapse due to a yet not understood increased tendency to hallucinate, which is becoming a problem given the proliferation of AI-generated content on the Internet, and might already be adversely affecting the latest frontier models significantly. This content is often called AI slop mostly because it's easy to generate without providing much in terms of actual value.
loggin sessions?
If using this requires me to share my login credentials with ChatGPT, then thatās a definite no from me. Even if thereās a way to do it manually, Iād want to know if the process is accessible.
Until those concerns are addressed, I think Iāll be steering clear of it.
Also, once you're logged in to a service through ChatGPT, how do you end that session? Is it as simple as deleting the conversation?
Using chat gpt agent on iphone
Hey guys has anyone used chat gpt agent on the iphone and if so what has been your experiences?
Is it easier to use it on the computer or can you use it on iphone too.
Other things that people have tried with Chat GPT agent.
Hi guys.
So with chat GPT agent what else have people tried to do with it.
Using ChatGPT agent on the iPhone
Hi guys, so I wanted to let you know that I finally activated a ChatGPT plan to try out the Agent. What I attempted to do was to log into this website to post a commentāsimilar to the example shown above. Unfortunately, when using the iPhone, that doesnāt seem to be fully possible. I turned on Screen Recognition and was able to confirm that the username and password fields were on the screen, but they werenāt accessible with VoiceOver. If this is a bug or an accessibility oversight, it needs to be reported to OpenAI so it can be addressed as soon as possible.
On the bright side, ChatGPT did successfully manage to navigate to the website and locate the login page, which worked well. I was also able to type my username and password directly into the chat, and the agent was able to enter those details and log me in. That said, from what Iām seeing so far, you have to be extremely specific with your instructions. In some cases, you need to know exactly what youāre looking for in order to get the results you want. For example, I wanted to post a comment on this specific post, but I couldnāt remember the exact name or title. That ended up confusing the model a bit, so maybe websites with a clearer structure or layout might work better. I realized that ChatGPT doesnāt automatically recognize that a post is about itself, which makes sense, but it means youāll need to be extra clear when giving instructions on sites with dynamic content.
Let me see if I can explain this a little clearer. So imagine youāre on a virtual supermarket website. You decide that for breakfast today, you want a box of Cocoa Puffs, a bottle of Pepsi, and a loaf of bread. Now, on these virtual supermarket shelves, ChatGPT is scanning through categories like āCereals,ā āBeverages,ā and āBakery.ā If the Pepsi is sitting in the āRefrigerated Drinksā section or the bread is in āBakery,ā then ChatGPT will likely find those items pretty quickly because it knows where to look and what those categories typically mean. But letās say thereās another person who owns a completely different websiteālike Mary, who runs a baking site. She sells chocolate chip cookies. Now you say, āChatGPT, order me a box of chocolate chip cookies and a sugar-free glazed blackberry doughnut.ā If the doughnut section is clearly labeled or easy to access, the model might find it right away. But if Mary filed her cookies under something more abstract like āMary's Confectionariesā or āSweet Bites,ā ChatGPT might still be able to get thereāitāll just take a bit more time and work. Thatās the part Iām trying to highlight. For the model to be most effective, you need to be specific. The reason I couldnāt post my comment on the site was literally because I didnāt remember the title of the post, and I couldnāt recall which section it was under. If you donāt have a good mental layout of the website, it can be much harder for the model to perform the task, even if it gets you in the right general area.
It was able to locate the username and password fields easily because those are common across websites and clearly labeled. ChatGPT understands those elements wellāit knows, āThis is the login box, and this is where I need to input credentials.ā But if something is tucked away under an unusual label or section that isnāt visible on the screen directly, I donāt know how many places the model actually searches before it gives up or times out. Unfortunately, I didnāt get to explore that part much because, like a lot of people are discovering, thereās a time limit. Once you hit it, youāre no longer able to interact with the agent for the rest of the day, and I had already used up my window.
Right now, many of the more advanced features are limited. It looks like you only get 15 minutes per dayāor maybe per sessionāwith the browser, though Iām not entirely sure yet. I assumed Iād be able to talk to the agent hands-free in voice mode and have it carry out the tasks for me, but that doesnāt seem to be possible. I noticed that when the task is completed, my phone vibrates and I get a notificationāwhich is a nice touch. Itās definitely a bit slow, but thatās expected given that weāre still in the early stages. If someone were going to do a full review of the product, I imagine theyād need to edit the pause time or task to fit while the model processes everything in the background. Anyway, I couldnāt get it to post the comment, but this is only my first time using it. Iām assuming things will improve in the future as they continue building it out.
not too impressed with this
Hi,
So when using this on Windows, both through my browser and the desktop app,the virtual browser, the browser you can use to enter your username and password, plus takeover from the agent in general if you need to click something the agent won't do like a Captcha is totally inaccessible. I've tried with JAWS and NVDA, NVDA object navigation OCR, and the JAWS cursors and OCR but nothing works. And it won't go to amazon.co.uk or amazon.com at all, even if I tell it to go to this page without completing any task. There's a checkbox on audiogames.net that it won't click because it's a Captcha, and if I take over from the agent, I can't access the checkbox no matter what JAWS or NVDA commands I try. I mean I could give it my username and password for something, but I'd have to keep changing the password just in case it stores it and my security is compromised. I'd only give it my credentials to log into something if something was really inaccessible, but I'd be changing my password after logging out that's for sure.
@ Winter Roses
You only have it for 15 minutes? Thatās strange because I was using it for 6 hours yesterday editing my website and still have time left and Iām use the $20 a monthly but I might go to pro now. Iām loving this thing because it can work on my business while I work my regular job.
Pro and Accessibility
I was wondering too about how accessible interactions are, as it is using a VM. SO it seems Stephen is doing tasks that do not require him to interact with the virtual browser?...
As for usage, according to a chatGPT.com page, the Pro plan allows you 400 messages a month. So I guess try to pack those messages?
It is an interesting project for sure and I will keep monitoring it but need some more advances before it can help me with my job.
By the way, Claude has a similar agent but its not been in the news lately.
Answers and clarifications
When I was using the ChatGPT agent this morning, it disconnected, and I couldnāt get it to reconnect again. Iām pretty sure I saw a time and date saying when it would be working againāthough I could be totally wrong about that. But the second I saw the message, I instantly assumed the product was limited in some way, kind of like how the advanced voice feature is restricted. A lot of the more advanced features with ChatGPT seem to come with limitations, which makes sense. I mean, with the agent especially, itās pretty obvious whyāmany members are trying to use it, and the system needs to keep up and handle all those tasks efficiently. I donāt even think anyone using the free version is going to get access to the agent. If they do , itās gonna be extremely limited. So if I want to explore more of what it can do, Iām gonna need to play around with it some more when I have the time.
Now, regarding Amazon and shoppingābased on what Iāve been reading online, Amazon is not one of the supported shopping websites you can use through the ChatGPT agent. And again, this isnāt that surprising. Amazon has worked hard to become one of the biggest names in online shopping, and the last thing they want is some third-party AI stepping in and acting as a middleman. Theyāre not going to give that kind of access freely. My thinking is this: smaller businesses, if theyāre smart, will absolutely jump on this opportunity. If they can integrate with the agent, lower their prices, maybe offer free delivery or other perks to shoppersāthen I could see customers choosing to shop with them instead. This could be a major advantage for smaller vendors looking to grow. As for whether thereās an official list of supported shopping partners, Iām not sure we have this feature as yet, but it certainly seems like the next logical step in the chain of evolution based on current trends.
I havenāt played around with the agent enough to speak definitively on everything. But I do think it depends on what you already know. ChatGPT can browse the internet and get relevant info, sureābut the more you understand about the site youāre trying to use, how it works, and what to look for, the more effective it seems to be. Some tasks are always gonna be easier because theyāre direct and straightforward. Others, though, are going to be more obscure or ambiguousāand thatās probably where a lot of the confusion and inconsistency comes in.
I didnāt know that Claude had an agent-style product of its own. I might have to subscribe and check it out. Iāve never subscribed to any of Claudeās plans, and thatās mostly because Iā know the context windowālike how many messages you can send in a chatāis limited. Even on the paid plan, Iāve heard it fills up quickly. And instead of starting a new thread when you hit the limit, you only find out when your message doesnāt go through. Another thing I donāt like about Claude is that if Iām typing a message and I accidentally close the app or something interrupts me, the entire message disappears. Itās not like ChatGPT, which keeps the text in the box, so when you reopen it, your content is still there. Thatās one of those little actions that makes a big difference.
Donāt get me wrongāClaude gives grounded, logical responses. It's more human than ChatGPT in certain ways. But because of those limitations, Iāve been hesitant to give it a serious try. Iām going to take a closer look and do some research myself. My biggest issue with Claude has always been the censorship and restrictionsāitās more limited than ChatGPT in that sense. They're trying to be that āethical, moralā AI, but in doing that, they might be missing the mark a bit. Not trying to knock them too hardāthey do have a solid product. It just needs a bit of refinement⦠or loosening up.
A couple of things.
First off, I haven't noticed any time limits per session as such. The limitation however is that for plus users, there are 40 chats using agent per month. that's like 40 tasks. Also, the virtual browser, as some of you mentioned, is inaccessible. I guess for it to work, the screenreader providers will have to work with open ai to implement a sollution. That's why as of now, we will have to provide the login cridencials to the agent. What Claud has is Claud compute, which is arkitecturally different from gpt agent. agents generally creates a VM in the cloud, whereas what claud compute does is take over your computer which means it can also access your files etc in the computer.
I request that the thread title be changed.
This has explicit references to Christianity but I consider the expression to be not only a baseless and flawed assumption but also an intolerable accusation as a Muslim and ask that it be changed to something else regardless of my own stance but due to the fact that one has to either let others use such phrases like "Allahu akbar! I can't believe this or that happened!" or avoid using such phrases himself/herself.
Claude Compute
Hi,
If people want to try Claude Compute, the Guide AI Assistant for Windows uses it as their model.
https://www.guideinteraction.com/
It's about $8 a month at the moment, which I imagine is cheaper than Claude.
Replying To @Enes Deniz
Hi Enes,
We appreciate your feedback.
We want AppleVis to be a place where everyone, no matter their religious beliefs (including having no beliefs at all), is welcome. This includes allowing discussion of apps and technologies related to one's faith and ensuring that those discussions are free from harassment.
You gave the example of including a Muslim-specific reference in a post. Our position is that the use of phrases like "Allahu akbar! I can't believe this or that happened!" that you gave as an example, would be perfectly allowable. Were we to disallow all types of casual religious references in posts, this would set us on a very slippery slope.
Thanks,
Michael
Agree with your stance Michael
Hi,
I have no problem with people putting things like "Allahu akbar!" in their subject lines or posts as long as it relates to tech. If we can put things like 'holy mother of Mary as part of a subject line, then we should be able to put "Allahu akbar!
What would Batman do?
The subject line is my official stance on official subject lines.
Thank you, that is all. š
Okay...
Let me be honest and clear: I was targeted and offended just because of posting apps addressing muslims in particular. While I don't really assert or think everyone on Applevis defends the reactions to my posts or views them acceptable, I just want to know where exactly the line is drawn here. I'm not willing to interfere with others' posts so long as I can make sure that mine are tolerated, which I hope is the case despite some unfortunate exceptions that the majority appears to disapprove.
I always wonder...
Honestly I am wondering if the output, in this case the website was checked by sighted people as well. Sure, AI will tell you that it did what you asked it to do. But even Apple admits that there is at best a 72 % chance that the info AI gives is correct. Or in other words: Do not trust AI to do stuff for you which you can not a) do yourself and b) you are able to verify that the expected outcome has been reached.
I get why the hype regarding AI seems so amazing. But honestly: In most cases it is bloated machine learning which wastes so much water and energy. Creating a Website should not waste galones of water. Thanks to the fairly easy HTML you could write one yourself which would load lightning fast. And as a bonus: You learn how stuff works. If AI put in something that you did not want but everything else was perfect, you would have a hard time just getting this part out. Instead AI will attempt to rewrite the entire thing thanks to your new promt. That might change the complete page.
I am not against AI. But I am against the hype with more and more promises even though the last couple of releases did not work as advertised. Maybe one should take a step back and evaluate how much AI really does do correctly all of the time. Cause computers generaly are pretty good at doing the exact task over and over again without suddonly injecting other stuff no one was asking about. The only benefit I can see is that we have a better way to get pictures described to have an idea what they show. And even that will fail when you ask for detials. When you take a step back and reflect on this you might encounter how the output is currated under miserable working conditions for not a lot of money. In the end human labour is trying to correct for the flaws that are inherent in AI. That should in my opinion not be acceptable.
@ Enes Deniz
Ok so the subject line isnāt a statement of religion. Itās a form of reaction like when somebody says oh my God when something dramatic happens. Nobodyās expressing any form of religion here.
@ Dennis Westphal
According to my spouse, it did the website perfectly. Not only that, but it put the comment on Apple this perfectly too. When it comes to things like website design you really need to be specific in your prompting. While you could learn HTML, thatās not gonna help you when it comes to format and look of the website. While it may be functional to us blind users, it may not look visually appealing to the sighted population. If youāre running a business, you do kind of want to appeal to the majority.
Disagree with @Stephen and agree with @Dennis Westphal
What if someone is an atheist and denies the existence of the Creator entirely? Or I am a Muslim and oppose referring to the Creator as "God" as this implies a human-like/anthropomorphic, mythological character that is both the "father" and the "son" of himself and needs a father to be begotten and dies. And then there's the spirit. Sorry, I don't know much about what's called "Trinity", nor have I ever quite got how it's explained reasonably. But the main idea is that you would've referred to female gods as "goddesses" had they ever existed, and male gods as "gods", as the name suggests, had they ever existed. So one may believe in or practice whatever one desires, but I am equally free to deny it or state that certain phrases imply or refer to certain beliefs or practices. Anyway, back to topic... So AI will probably become a real "game changer" if you can fit an LLM inside a quantum computer and do everything locally without token limits or connection issues. You will just prompt the LLM to develop exactly the game or app you have in mind and it will be able to write the entire code within one response without getting interrupted. Currently, however, using AI even for simple tasks is overkill, let alone not even having AI involved in reality while claiming the opposite for manipulative purposes, or "marketing" purposes if expressed in a more "politically-correct" manner.
Get into the interface
How can I Even get into the Interface? I asked it to log in to a website and it told me to Type in the credentials Modell in a browser Window. Then there is a windows called Virtual browser. But how exactly can I See the website then? In the Virtual browser window it just Telfs me that I have Control over the Virtual browser and a button called Stop.
@ Enes Deniz
Huh? Although I may be confused on everything that I just read, this is not the place to be debating religion? Is that whatās going on? Iām soooooooooooo confused š¤. How about we talk about the topic at hand on this thread which is Chat GPT AI agent.
@ Jokyboy129
When I get it to log into websites, I just put the username in the password included in the prompt field along with the specific task, you would like it to do and then it should do it for you. The only thing it wonāt be able to do is captia if needed.
The virtual browser on ChatGPT isn't accessible with voiceover
The virtual browser on the website isnāt accessible. Yes, ChatGPT can manually type in my username and password for the site, but it canāt complete the CAPTCHAāwhich is understandable. That part is fine. The real issue is this: if I navigate to a website like American Eagle, Netflix, eBay, or Instacart, and I need to take over manuallyāwhether itās to read my messages, browse the site, or move around until Iām ready for ChatGPT to assistāIām stuck. Once I take over on the virtual browser, the screen is taken over by that interface. Using the screen recognition feature, I can tell Iām on the correct website, and I can even tell that the information is somewhere on the screenābut I canāt actually interact with it. Thatās a huge, huge accessibility issue. Until this is fixed, the ChatGPT agent is not as useful as it could be to blind users. Yes, Iāve already sent an email to ChatGPT, but I donāt know if or when this will be resolved. Hopefully they fix it, but you never know. ChatGPT can log into Instacart for me, sureābut it canāt let me take over and browse. I canāt explore the interface, go to the produce section, pick out the milk I want, then ask ChatGPT to help finish the checkout. That only works if I can interact with the siteāand right now, I canāt, unless I spell out every step, which completely defeats the purpose if I don't know the exact elements that are on the website.
@ Winter Roses
What? Itās most certainly not useless to blind users. You just really need to be specific as to what you want in your prompting. It has read messages to me and has done almost everything you mentioned above. The only barrier I have found with it is the captias. Otherwise itās helping me do quite a lot that normally I would need cited assistance for. This is one of those scenarios in regards to input versus output. The better your input, the better the output. Prompting with the agent is very sophisticated. And itās only gonna get more sophisticated.
My thoughts
Well then, with all due respect, I guess I must be sophisticatedly stupidābecause no matter how hard I try, it never seems to work in my favor. Iām getting 40 messages with the agent. And if I have to waste those messages trying to prompt the agent to do a task that I could easily do on my own through the browser, then thatās a complete waste in my book. I went on Project Gutenberg and had it read the first two chapters of Alice in Wonderland for me, and that worked pretty well. I wish the ChatGPT agent supported voice commands properly. If it worked with the voice mode, I could use the microphone to give instructions directlyābut Iām not sure if thatās a feature yet. From what I understand, the advanced voice feature is limited too, so Iām not sure how far Iād even get with that.
Yes, if I get stuck at a CAPTCHA, Iām done. If I need to manually interact with a website for any reason, I have to go back and forth with itāand that takes a lot of time. Maybe itās different on a computer, I donāt know. But on a phone, itās pretty slow, and tasks that should take seconds end up dragging for minutes. Sometimes it takes 2ā3 minutes to complete a task that shouldnāt take more than 15 seconds. Of course, I know that the product is new, so this is to be expected. For now, at least. I'm not holding this against the developers. This is one aspect I can confidently say will most likely be improved in the future.
Now Iām not saying the agent is completely useless, but the fact remainsāitās not as accessible as it could be. You said itās all about sophisticated prompting, but Iāve been using ChatGPT since it launched. Iāve seen most versions. Iāve paid for different features when I needed them. I know what Iām doing. So Iām not new to this spaceānot to ChatGPT, not to Gemini, not to any of these tools. Iāve been as specific as I possibly can. So no, I donāt believe the issue is on my end. The virtual browser isnāt accessible for certain tasks. If I try to take over the browser and interact with the elements myself, nothing happens. Thatās a serious accessibility issue. But heyādifferent strokes for different folks. Plus, Iām not sure if this is only happening to me, but whenever the agent disconnects for whatever reason, I canāt seem to find the option to reconnect within the app itselfāI always have to go to the website to reconnect, and it tends to disconnect quite a lot during sessions, so Iām not sure if itās different on the pro plan or what, but there are a few issues that need to be ironed out.
Think you have it backwards
You're kinda trying to sell the idea that a non-deterministic service can do a better job than a deterministic language that gives you full control over all visual aspects and is also produced by the aforementioned non-deterministic service, which makes absolutely zero sense. Just because the sight might look good doesn't mean it's not possible to accomplish the same or even much better writing the code yourself, and the fact that the AI itself has to express your intent in that code is irrefutable proof of that.
I'm totally blind and do both user interface design and computer graphics, taking advantage of having lived most of my life with sight as well as the fact that in the end it all boils down to math. You can definitely do at least just as well as the AI, it requires getting creative with your solutions, like investing on a graphics embosser like I'm on the verge of doing, but it's all within the reasonable realm of possibility. Just a couple of weeks ago I designed the logo for the international brand that I am in the process of registering in vector graphics, because I knew exactly what I wanted visually and I knew how to express that mathematically. However since I don't fully trust other people's opinions as they have failed to tell me about important visual details in the past, I really need to feel the visual stuff that I make, hence my plan to invest on a graphics embosser in the near future.