BioAgents AgentKit

An advanced AI agent framework for biological and scientific research. BioAgents provides powerful conversational AI capabilities with specialized knowledge in biology, life sciences, and scientific research methodologies.

Setup

Check out SETUP.md

Agent Backend

Routes

The system operates through two main routes:

/api/chat - Agent-based chat for general research questions with automatic literature search
/api/deep-research - Deep research mode with iterative hypothesis-driven investigation

Chat diagram

Deep research diagram

Both routes use the same agent architecture but differ in their orchestration and iteration patterns.

Agents

Agents are the core concept in this repository. Each agent is a self-contained, independent function that performs a specific task in the research workflow. Agents are designed to be modular and reusable across different routes and contexts.

Available Agents

File Upload Agent - Handles file parsing, storage, and automatic description generation
- Supports PDF, Excel, CSV, MD, JSON, TXT files
- Generates AI-powered descriptions for each dataset
- Stores files in cloud storage with metadata
Planning Agent - Creates research plans based on user questions
- Analyzes available datasets and research context
- Generates task sequences (LITERATURE or ANALYSIS)
- Updates current research objectives
Literature Agent - Searches and synthesizes scientific literature
- OPENSCHOLAR: General scientific literature search with citations
- EDISON: Edison AI literature search (deep research mode only)
- KNOWLEDGE: Searches your custom knowledge base with semantic search and reranking
- Returns synthesized findings with inline citations in format: (claim)[DOI or URL]
Analysis Agent - Performs data analysis on uploaded datasets
- EDISON: Deep analysis via Edison AI agent with file upload to Edison storage
- BIO: Basic analysis via BioAgents Data Analysis Agent
- Uploads datasets to analysis service and retrieves results
Hypothesis Agent - Generates research hypotheses
- Synthesizes findings from literature and analysis
- Creates testable hypotheses with inline citations
- Considers current research context and objectives
Reflection Agent - Reflects on research progress
- Extracts key insights and discoveries
- Updates research methodology
- Maintains conversation-level understanding
Reply Agent - Generates user-facing responses
- Deep Research Mode: Includes current objective, next steps, and asks for feedback
- Chat Mode: Concise answers without next steps
- Preserves inline citations throughout

Adding New Agents

To add a new agent:

Create a folder in src/agents/
Implement the main agent function in index.ts
Add supporting logic in separate files within the folder
Export the agent function for use in routes
Shared utilities go in src/utils

State Management

State is separated into two types:

Message State (State):

Ephemeral, tied to a single message
Contains processing details for that message only
Automatically cleared after processing
Used for temporary data like raw file buffers

Conversation State (ConversationState):

Persistent across the entire conversation
Contains cumulative research data:
- Uploaded datasets with descriptions
- Current plan and completed tasks
- Key insights and discoveries
- Current hypothesis and methodology
- Research objectives
Stored in database and maintained across requests
This is the primary state that drives the research workflow

LLM Library

The LLM library provides a unified interface for multiple LLM providers. It allows you to use any Anthropic/OpenAI/Google or OpenRouter LLM via the same interface. Examples of calling the LLM library can be found in all agents.

Key Features:

Unified API across providers (Anthropic, OpenAI, Google, OpenRouter)
Extended thinking support for Anthropic models
System instruction support
Streaming and non-streaming responses
Examples in every agent implementation

Literature Agent & Knowledge Base

The Literature Agent includes multiple search backends:

OPENSCHOLAR (Optional):

General scientific literature search with high-quality citations
Requires custom deployment and configuration
Set OPENSCHOLAR_API_URL and OPENSCHOLAR_API_KEY to enable
Paper: https://arxiv.org/abs/2411.14199
Deployment: https://github.com/bio-xyz/bio-openscholar

EDISON (Optional):

Edison AI literature search (deep research mode only)
Requires custom deployment and configuration
Set EDISON_API_URL and EDISON_API_KEY to enable
Deployment: https://github.com/bio-xyz/bio-edison-api

KNOWLEDGE (Customizable):

Vector database with semantic search (embeddings)
Cohere reranker for improved results (requires COHERE_API_KEY)
Document processing from docs directory
Documents are processed once per filename and stored in vector DB

To add custom knowledge:

Place documents in the docs/ directory
- Supported formats: PDF, Markdown (.md), DOCX, TXT
Documents are automatically processed on startup
Vector embeddings are generated and stored
Available to Literature Agent via KNOWLEDGE tasks

Docker Deployment Note: When deploying with Docker, agent-specific documentation in docs/ and branding images in client/public/images/ are persisted using Docker volumes. These directories are excluded from git (see .gitignore) but automatically mounted in your Docker containers via volume mounts defined in docker-compose.yml. This allows you to customize your agent with private documentation without committing it to the repository.

Analysis Agent Configuration

The Analysis Agent supports two backends for data analysis:

EDISON (Default):

Deep analysis via Edison AI agent
Automatic file upload to Edison storage service
Requires EDISON_API_URL and EDISON_API_KEY
https://github.com/bio-xyz/bio-edison-api

BIO (Alternative):

Basic analysis via BioAgents Data Analysis Agent
Set PRIMARY_ANALYSIS_AGENT=bio in .env
Requires DATA_ANALYSIS_API_URL and DATA_ANALYSIS_API_KEY
https://github.com/bio-xyz/bio-data-analysis

Both backends receive datasets and analysis objectives, execute analysis code, and return results.

Character File

The character file defines your agent's identity and system instructions. It's now simplified to focus on core behavior:

name: Your agent's name
system: System prompt that guides agent behavior across all interactions

The character's system instruction is automatically included in LLM calls for planning, hypothesis generation, and replies, ensuring consistent behavior throughout the research workflow. You can enable the system prompt in any LLM call by setting the 'systemInstruction' parameter.

UI

Component system:

Custom hooks in client/src/hooks/
UI components in client/src/components/ui/
Lucide icons via client/src/components/icons/

Styling:

Main styles: client/src/styles/global.css
Button styles: client/src/styles/buttons.css
Mobile-first responsive design

Payment Integration:

The UI includes integrated support for x402 micropayments using Coinbase embedded wallets:

Embedded wallet authentication via client/src/components/EmbeddedWalletAuth.tsx
x402 payment hooks in client/src/hooks/useX402Payment.ts
Seamless USDC payment flow for paid API requests
Toast notifications for payment status

Authentication

BioAgents supports two independent auth systems:

Setting	Options	Purpose
`AUTH_MODE`	`none` / `jwt`	JWT authentication for external frontends
`X402_ENABLED`	`true` / `false`	x402 USDC micropayments

JWT Authentication (Production)

For external frontends connecting to the API:

# .env
AUTH_MODE=jwt
BIOAGENTS_SECRET=your-secure-secret  # Generate with: openssl rand -hex 32

Your backend signs JWTs with the shared secret:

// Your backend generates JWT for authenticated users
const jwt = await new jose.SignJWT({ sub: userId })  // sub must be valid UUID
  .setProtectedHeader({ alg: 'HS256' })
  .setExpirationTime('1h')
  .sign(new TextEncoder().encode(process.env.BIOAGENTS_SECRET));

// Call BioAgents API
fetch('https://your-bioagents-api/api/chat', {
  headers: { 'Authorization': `Bearer ${jwt}` },
  body: JSON.stringify({ message: 'What is rapamycin?' })
});

📖 See AUTH.md for complete JWT integration guide

x402 Payment Protocol (Optional)

For pay-per-request access using USDC micropayments:

# .env
X402_ENABLED=true
X402_ENVIRONMENT=testnet  # or mainnet
X402_PAYMENT_ADDRESS=0xYourWalletAddress

📖 See AUTH.md for x402 configuration details

Job Queue (Production)

BioAgents supports BullMQ for reliable background job processing with:

Horizontal scaling: Run multiple worker instances
Job persistence: Jobs survive server restarts
Automatic retries: Failed jobs retry with exponential backoff
Real-time updates: WebSocket notifications for job progress
Admin dashboard: Bull Board UI at /admin/queues

# Enable job queue
USE_JOB_QUEUE=true
REDIS_URL=redis://localhost:6379

# Start API server and worker separately
bun run dev      # API server
bun run worker   # Worker process

📖 See JOB_QUEUE.md for complete setup and configuration guide

Project Structure

├── src/                      # Backend source
│   ├── index.ts             # API server entry point
│   ├── worker.ts            # BullMQ worker entry point
│   ├── routes/              # HTTP route handlers
│   │   ├── chat.ts          # Agent-based chat endpoint
│   │   ├── deep-research/   # Deep research endpoints
│   │   ├── x402/            # x402 payment-gated routes
│   │   ├── b402/            # b402 payment-gated routes
│   │   └── admin/           # Bull Board dashboard
│   ├── agents/              # Independent agent modules
│   │   ├── fileUpload/      # File parsing & storage
│   │   ├── planning/        # Research planning
│   │   ├── literature/      # Literature search (OPENSCHOLAR, EDISON, KNOWLEDGE)
│   │   ├── analysis/        # Data analysis (EDISON, BIO)
│   │   ├── hypothesis/      # Hypothesis generation
│   │   ├── reflection/      # Research reflection
│   │   └── reply/           # User-facing responses
│   ├── services/            # Business logic layer
│   │   ├── chat/            # Chat-related services
│   │   ├── queue/           # BullMQ job queue system
│   │   │   ├── connection.ts    # Redis connection management
│   │   │   ├── queues.ts        # Queue definitions & config
│   │   │   ├── workers/         # Job processors
│   │   │   └── notify.ts        # Pub/Sub notifications
│   │   ├── websocket/       # Real-time notifications
│   │   │   ├── handler.ts       # WebSocket endpoint
│   │   │   └── subscribe.ts     # Redis Pub/Sub subscriber
│   │   └── jwt.ts           # JWT verification service
│   ├── middleware/          # Request/response middleware
│   │   ├── authResolver.ts  # Multi-method authentication
│   │   ├── rateLimiter.ts   # Rate limiting
│   │   ├── x402/            # x402 payment protocol (Base/USDC)
│   │   └── b402/            # b402 payment protocol (BNB/USDT)
│   ├── llm/                 # LLM providers & interfaces
│   ├── embeddings/          # Vector database & document processing
│   ├── db/                  # Database operations
│   ├── storage/             # File storage (S3-compatible)
│   ├── utils/               # Shared utilities
│   ├── types/               # TypeScript types
│   └── character.ts         # Agent identity & system prompt
├── client/                  # Frontend UI (Preact)
│   ├── src/
│   │   ├── components/     # UI components
│   │   ├── hooks/          # Custom hooks (chat, payments, etc.)
│   │   └── styles/         # CSS files
│   └── public/             # Static assets
├── documentation/           # Project documentation
│   └── docs/               # Detailed guides (AUTH.md, SETUP.md, JOB_QUEUE.md)
├── docs/                    # Custom knowledge base documents (scientific papers)
└── package.json

Built with Bun - A fast all-in-one JavaScript runtime.

BioAgents
BioAgents copied to clipboard

Metadata

BioAgents AgentKit

Setup

Agent Backend

Routes

Agents

Available Agents

Adding New Agents

State Management

LLM Library

Literature Agent & Knowledge Base

Analysis Agent Configuration

Character File

UI

Authentication

JWT Authentication (Production)

x402 Payment Protocol (Optional)

Job Queue (Production)

Project Structure

← Metadata

Owner

Metadata

BioAgents BioAgents copied to clipboard

Metadata

BioAgents AgentKit

Setup

Agent Backend

Routes

Agents

Available Agents

Adding New Agents

State Management

LLM Library

Literature Agent & Knowledge Base

Analysis Agent Configuration

Character File

UI

Authentication

JWT Authentication (Production)

x402 Payment Protocol (Optional)

Job Queue (Production)

Project Structure

← Metadata

Owner

Metadata

BioAgents
BioAgents copied to clipboard