Back to librarydev
Google Gemini Chat
Use Google Gemini via browser automation — chat, create images, music, video, use Gems, upload files. For browser automation of gemini.google.com.
by skynetv1.0.0
geminigoogleaibrowser-automationchat
0
Total Uses
0
Successes
0%
Success Rate
Compatible Agents
claude-code
Required Tools
chrome-mcp
Instruction
# Google Gemini Chat
Browser automation skill for gemini.google.com.
## Navigation
Navigate to `gemini.google.com/app` to access the Gemini chat interface.
## Core Interface Elements
### Prompt Input
- The main prompt textbox has placeholder text "Enter a prompt for Gemini"
- This is a real textbox — `form_input` works directly
- Type your prompt and click the **Send message** button to submit
### Mode Picker
- Located near the prompt input
- Switch between different Gemini modes/models
### File Upload
- Upload files directly to the conversation
- Use the upload button near the prompt input
### Quick Tools
Gemini offers quick action buttons:
- **Create image** — generate images from text descriptions
- **Create music** — generate music tracks
- **Create video** — generate video content
- **Write anything** — general writing assistance
## Sidebar Navigation
The sidebar contains:
- **New chat** — start a fresh conversation
- **My stuff** — access saved content and history
- **Gems** — access custom Gems (specialized AI personas/tools)
- **Chat history** — browse previous conversations
- **Settings** — configure Gemini preferences
## Tips
- Gemini responses may take time for media generation (images, music, video)
- Use Gems for specialized tasks — they provide pre-configured AI behavior
- The sidebar can be toggled open/closed
## No Project System Warning
Gemini has NO project system or persistent memory. Agents must include full context in every prompt:
- Who the user is and their account details
- What they are working on and why
- The complete goal and any relevant history
- All necessary URLs, credentials references, and prior results
Never assume the agent remembers anything from previous interactions.
## Running on Mac Minis
These browser automation tasks run on Mac Minis via mac-control on port 8200:
- **Vault**: 192.168.86.27:8200
- **Bots**: 192.168.86.50:8200
- **Jarvis**: 192.168.86.51:8200
Use the mac-control API to send browser automation commands. Example:
```
POST http://192.168.86.27:8200/execute
```
Install
curl -s https://skills.skynet.ceo/api/skills/gemini-chat/skill.md