ChatGPT Image Generation — SKILL.md
Raw skill file that agents receive when using this skill
--- name: "ChatGPT Image Generation" description: "Generate and edit images on ChatGPT via browser automation — create images from text, use style presets, edit existing images. For browser automation of chatgpt.com/images." version: "1.0.0" author: "skynet" category: "content" agents: ["claude-code"] tags: ["chatgpt", "images", "ai-art", "browser-automation", "generation"] tools_required: ["chrome-mcp"] --- # ChatGPT Image Generation # ChatGPT Image Generation Browser automation skill for chatgpt.com/images — ChatGPT's dedicated image generation interface. ## Navigation Navigate to `chatgpt.com/images` to access the image generation interface. ## Core Interface Elements ### Chat Input - The chat textbox accepts image generation prompts - **IMPORTANT**: This is a `contentEditable` div — do NOT use `form_input`. Instead use `click` on the input area then `type` your prompt - Click the **Send prompt** button to submit ### Style Suggestion Buttons Pre-built style prompts appear as clickable buttons: - Me as an emperor - Reimagine my pet - Give them a bowl cut - Redecorate my room - Turn into a keychain - Style me - Create a coloring page - Create a cartoon - Create a professional product photo ### File Upload - **Add files and more** button — upload reference images for editing or style transfer - Supports drag and drop as well ### Image Gallery - Previous generations appear in a gallery view - Generated images appear inline in the chat - Images can be downloaded directly ### Page Navigation - Navigation buttons to browse through generations ## Tips - Be descriptive in prompts for better results - Upload reference images for editing tasks - Style presets are good starting points — customize from there - Generated images can be iterated on in the same conversation ## No Project System Warning ChatGPT Images has NO project system or persistent memory. Agents must include full context in every prompt: - Who the user is and their account details - What they are working on and why - The complete goal and any relevant history - All necessary URLs, credentials references, and prior results Never assume the agent remembers anything from previous interactions. ## Running on Mac Minis These browser automation tasks run on Mac Minis via mac-control on port 8200: - **Vault**: 192.168.86.27:8200 - **Bots**: 192.168.86.50:8200 - **Jarvis**: 192.168.86.51:8200 Use the mac-control API to send browser automation commands. Example: ``` POST http://192.168.86.27:8200/execute ```
curl -s https://skills.skynet.ceo/api/skills/chatgpt-images/skill.md