18 releases
new 0.3.1 | Jan 6, 2025 |
---|---|
0.3.0 | Jan 6, 2025 |
0.2.14 | Jan 4, 2025 |
0.2.13 | Dec 26, 2024 |
0.1.0 | Dec 23, 2024 |
#116 in Text processing
1,485 downloads per month
615KB
1.5K
SLoC
Colossus
Voice-Controlled Multi-Agent AI Development System
Colossus is an innovative development platform that combines real-time voice interaction with a coordinated team of AI agents to streamline the software development process. It orchestrates multiple specialized AI agents working together through different phases of development while maintaining voice communication with you.
How It Works
Colossus operates in three distinct phases:
1. Planning Phase
During this phase, multiple agents work simultaneously:
- Product Manager: Processes voice transcripts to maintain project requirements
- Architect: Designs and updates the technical architecture
- Project Manager: Breaks down work into specific tasks
- Test Strategist: Develops comprehensive test plans
2. Development Phase
Once planning is complete, the system switches to development mode where:
- Developer Agent: Implements tasks in order, following test-driven development
- Automated build and test processes run after each implementation
- Tasks are automatically marked complete when tests pass
3. Human Intervention Mode
If critical issues arise that AI cannot resolve:
- System automatically halts development
- Signals need for human intervention
- Provides detailed error context
- Returns to development mode once issues are resolved
Key Features
- Real-time Voice Interface: Natural conversation with the AI system
- Multi-Agent Coordination: Specialized AI agents working in concert
- Automated Development Cycle: Continuous implementation, testing, and validation
- Context-Aware Development: Uses multiple context files for specialized tasks
- Web Search Integration: Perplexity-powered web search for additional information
- Error Management: Smart detection and handling of critical issues
Important Notes
This project uses OpenAI's real-time API pricing. Please be aware:
- Requires appropriate API credits and budget
- Tier 1 credits have daily request limits
- Recommended minimum credit balance: $50 for unrestricted usage
Installation
-
Install Rust and Cargo using rustup:
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
-
Restart your terminal or reload your shell configuration:
source "$HOME/.cargo/env"
-
Install Colossus:
cargo install colossus
Required API Keys
To use all features of Colossus, you'll need the following API keys:
-
OpenAI API Key: Required for real-time voice interactions and code analysis (based off aider leader board )
- Get it from: https://platform.openai.com/api-keys
- Set as:
OPENAI_API_KEY
-
Perplexity API Key: Required for web search integration
- Get it from: https://www.perplexity.ai/settings/api
- Set as:
PERPLEXITY_API_KEY
You can set these keys in your .env
file or as environment variables.
Quick Start
-
Create a new project:
colossus init ./my-project
This will:
- Create the project directory if it doesn't exist
- Initialize a git repository
- Create a template .env file
- Set up language-specific configuration files
- Create a Makefile with build and test targets
-
Add your API keys to the .env file:
- OPENAI_API_KEY (required)
- PERPLEXITY_API_KEY (optional, for web search)
-
Start the Colossus server:
colossus serve
-
Open the web interface shown in the terminal
-
Click "Start Session" to begin
-
Use the microphone button to talk with Colossus
Advanced Usage
Colossus supports several command line options for customization:
USAGE:
colossus [OPTIONS]
OPTIONS:
-d, --project-dir <DIR> Directory to serve project files from [default: "./"]
-p, --port <PORT> Port number to run the server on [default: 49999]
-m, --model <MODEL> OpenAI model name to use [default: gpt-4o-realtime-preview-2024-12-17]
-l, --preferred-language <LANG> Preferred language for communication [default: english]
-i, --instructions <TEXT> Custom instructions for the AI assistant
-v, --voice <VOICE> Voice to use for speech [default: ash] (supported: alloy, ash, coral, echo, fable, onyx, nova, sage, shimmer)
-c, --code-model <MODEL> OpenAI model to use for code analysis
-h, --help Print help information
-V, --version Print version information
Example with custom settings:
colossus -c deepseek/deepseek-chat -d /path/to/project
How to prepare a project for colossus
- add a
Makefile
that has abuild
andtest
target - add a
CONTEXT.md
that loads all appropriate files
# example
/add TASKS.md
/read-only ARCHITECTURE.md
/read-only PROJECT.md
/read-only TEST_STRATEGY.md
/read-only Makefile
/add **/\*.js
/add **/_.css
/add \*\*/_.html
/add package.json
Contributing
This is all incredibly new, but feel free to drop suggestions!
License
This project is licensed under the MIT License - see the LICENSE file for details.
Art
Dependencies
~17–31MB
~462K SLoC