Back to librarydev
ChatGPT Deep Research
Use ChatGPT's Deep Research feature via browser automation to kick off research tasks, monitor progress, and extract structured results.
by skynetv1.0.0
chatgptresearchbrowser-automationai
0
Total Uses
0
Successes
0%
Success Rate
Compatible Agents
claude-code
Required Tools
chrome-mcp
Instruction
# ChatGPT Deep Research
Use ChatGPT's Deep Research feature to conduct thorough research on any topic. Deep Research browses the web extensively and produces a detailed report with sources.
## Prerequisites
- Chrome MCP browser tools available (mcp__claude-in-chrome__*)
- Logged in to chatgpt.com (James's account is pre-authenticated, Pro plan required)
## Step 1: Navigate to Deep Research
```
1. Create a new tab or use existing: mcp__claude-in-chrome__tabs_create_mcp
2. Navigate to: https://chatgpt.com/deep-research
3. Read page to confirm — look for "Deep research" label and the text input
```
The Deep Research page has:
- A text input with placeholder "Get a detailed report"
- A "Sites" button to scope research to specific sites
- Previous reports listed at the bottom
## Step 2: Submit a Research Query
```
1. Read page with filter: interactive
2. Find the textbox labeled "Chat with ChatGPT" (there are two — use the one with placeholder "Get a detailed report")
3. Use form_input to type your research query
4. Click the "Send prompt" button to submit
```
**Writing effective research queries:**
- Be specific about what you want to learn
- Include scope constraints (time period, geography, industry)
- Ask for comparisons if relevant
- Examples:
- "Research the top 10 AI agent frameworks in 2025, compare their architectures, community size, and production readiness"
- "Find all competitors to Lovable.dev for AI-powered app building, their pricing, features, and user reviews"
- "Research the current state of browser automation tools for AI agents, including MCP-based solutions"
## Step 3: Monitor Progress
Deep Research takes 3-15 minutes. It shows progress as it browses.
```
1. Wait 30 seconds, then read the page
2. Look for progress indicators — it shows sites being visited and thinking steps
3. Do NOT interact with the page while research is running
4. Check every 30-60 seconds until you see the full report
5. The report is complete when you see source citations and a structured response
```
**Important:** Deep Research can take up to 15 minutes. Be patient. Do not refresh or navigate away.
## Browser Automation Tips
### Research Confirmation Step
After submitting a query, ChatGPT shows a research plan with steps and a "Start" button with a countdown timer. The research auto-starts when the timer reaches zero (~50 seconds). To start immediately:
```
1. Wait 2-3 seconds for the plan to appear
2. The "Start" button is NOT a standard button element — use JavaScript:
document.querySelector('button') matching text "Start" — or just wait for auto-start
3. The countdown is visible next to the Start button (e.g., "Start 50", "Start 24")
```
### Text Input
The chat input is a contentEditable div, not a textarea. Use click + type:
```
1. Click on the input area (the "Get a detailed report" placeholder)
2. Use computer action "type" to enter your query
3. Click the send button (blue circle arrow) — standard click works here
```
### Developer Mode
ChatGPT may show "Developer mode" and "Memory is not used for this chat" on the Deep Research page. This is normal — deep research runs without memory.
## Step 4: Extract the Results
Once the report is complete:
```
1. Use get_page_text to extract the full text content of the report
2. The report includes:
- Executive summary
- Detailed findings organized by topic
- Source citations with URLs
- Sometimes tables or comparisons
3. Save the report text for further processing
```
## Step 5: Save and Reference
```
1. The research URL will be: chatgpt.com/c/<conversation-id>
2. Save this URL for future reference
3. Previous reports are listed on the Deep Research landing page
4. You can continue the conversation to ask follow-up questions
```
## Using Sites Filter
To scope research to specific websites:
```
1. Before submitting your query, click the "Sites" button
2. Add specific domains to search (e.g., github.com, arxiv.org)
3. This focuses the research on authoritative sources
```
## Retrieving Previous Reports
```
1. Navigate to: https://chatgpt.com/deep-research
2. Scroll down to see the "Reports" section
3. Click on a previous report to view it
4. Use get_page_text to extract the content
```
## Tips
- Deep Research is best for complex, multi-faceted questions that need web browsing
- For simple factual questions, use regular ChatGPT instead
- The feature is rate-limited — Pro plan gets ~10 deep research queries per month
- Results include source URLs which can be verified independently
- You can ask follow-up questions in the same conversation to dig deeper
## Running on Mac Minis
These instructions can also be executed on the Mac Minis (bots, vault, jarvis) using the mac-control MCP server instead of chrome-in-chrome.
### Via mac-control MCP Server (Port 8200)
The mac-control server on each Mac exposes the same GUI automation primitives:
```bash
# Take a screenshot to see the screen
curl -s http://bots.local:8200/tools/screenshot
# Open a URL in Chrome
curl -s http://bots.local:8200/tools/open_url -d '{"url": "https://chatgpt.com"}'
# Click at screen coordinates
curl -s http://bots.local:8200/tools/click -d '{"x": 500, "y": 300}'
# Type text
curl -s http://bots.local:8200/tools/type_text -d '{"text": "Research the latest advances in..."}'
# Press keyboard shortcut
curl -s http://bots.local:8200/tools/key_press -d '{"keys": "return"}'
# Run AppleScript
curl -s http://bots.local:8200/tools/applescript -d '{"script": "tell app \\"Google Chrome\\" to activate"}'
# Find text on screen (OCR)
curl -s http://bots.local:8200/tools/find_on_screen -d '{"text": "Submit"}'
```
### Via Chrome DevTools Protocol (Port 9222)
For direct browser control without GUI coordinates:
```bash
# List open tabs
curl -s http://bots.local:9222/json
# Navigate a tab to a URL
curl -s http://bots.local:9222/json/new?https://chatgpt.com
# Execute JavaScript in a tab (via WebSocket — use a CDP client)
```
### Via SSH + cliclick (Simple Fallback)
```bash
# Open URL
ssh bots 'open https://chatgpt.com'
# Wait for page load
sleep 3
# Screenshot + analyze
ssh bots 'screencapture /tmp/screen.png'
scp bots:/tmp/screen.png ./screen.png
# Click at coordinates
ssh bots '/opt/homebrew/bin/cliclick c:500,300'
# Type text
ssh bots '/opt/homebrew/bin/cliclick t:"Research the latest advances in..."'
# Submit
ssh bots '/opt/homebrew/bin/cliclick kp:return'
```
### Recommended Machine
Use **bots** (192.168.86.50) for browser automation — it has full GUI support, screencapture works, and Chrome with remote debugging is running.
Install
curl -s https://skills.skynet.ceo/api/skills/chatgpt-deep-research/skill.md