A ChatGPT plugin to drive Puppeteer browser. The aim of the project to provide a simple way to automate browser actions using ChatGPT.
It's a work in progress, and experimental. Follow me on Twitter to get updates.
It takes the actions step by step, and each step returns a map object of elements like input and buttons from document. The idea here is helping ChatGPT to read the document without exceeding the limit of the input size, and we don't have to struggle with parsing documents. And it helps to keep the context of the document and it will be aware which button to click on or which input to type on.
Eventually, users will be able to do basic web automations by writing directives like:
1 - go to duckduckgo.com
2 - type "hello world" into main text input
3 - click on the submit button
4 - exit
Clone it, then install dependencies.
npm install
npm run start
And load localhost:8008
on ChatGPT plugings.
It is currently able to:
- Navigate to a URL
- Click on a selector
- Type text on a selector
- Exit the browser
I'm open to any contribution. Please feel free to open an issue or send a pull request.
MIT