# BYOK Chat Platform (Next.js)

BYOK Chat is a versatile Next.js platform designed to integrate with various AI providers, enabling seamless communication and customization. This platform supports both standard OpenAI-compatible APIs and Cloudflare Workers AI.

GitHub: https://github.com/0xarchit/ByokChat
Live Demo: https://byok.0xarchit.is-a.dev

Tip

Quick start: Clone the repo, run npm install, then npm run dev to launch the development server at http://localhost:3000.

# Features

Beautiful Landing Page: Modern, animated landing page with Aurora background effects and smooth scroll animations using ReactBits components
Next.js 16 App Router: Built with the latest Next.js for optimal performance and SEO
Provider Management: Add, edit, and manage multiple AI providers
Model Selection: Fetch available models or manually add custom models
API Key Validation: Test API keys with selected models
Cloudflare Workers Support: Rotate between multiple accounts using worker URLs
Customizable UI: Tailor the interface to your needs with Shadcn UI components
Client-Side Storage: Uses IndexedDB for local data persistence
Theme Support: Built-in light, dark, and cyber-aurora themes
TypeScript: Fully typed for better development experience
Animated Components: Leverages framer-motion and GSAP for smooth animations

# Routes

/ - Landing page showcasing features, advantages, and use cases
/chat - Main chat interface for AI conversations

# Supported AI Providers

Note

BYOK Chat supports all OpenAI-compatible endpoints with several fallbacks for maximum reliability.

# Popular Free AI Providers

Groq

Base URL: https://api.groq.com/openai/

Mistral

Base URL: https://api.mistral.ai/v1

OpenRouter

Base URL: https://openrouter.ai/api/v1

Cerebras

Base URL: https://api.cerebras.ai/v1

Google Generative Language

Base URL: https://generativelanguage.googleapis.com/v1beta/openai/

Cloudflare Worker AI

Deploy your own worker version to integrate seamlessly with the platform.
Refer to the Cloudflare Workers documentation for setup instructions.

# Tech Stack

Category	Technologies
Framework	Next.js 16 (App Router)
Language	TypeScript
Styling	Tailwind CSS v3
UI Components	Shadcn UI
State Management	Zustand
Database	IndexedDB (via localforage)
Markdown	React-Markdown with Mermaid support
Icons	Lucide React

# Getting Started

# Prerequisites

Node.js (v20 or higher)
npm, pnpm, or yarn

# Installation

Clone the repository:

git clone https://github.com/0xarchit/ByokChat.git
cd ByokChat/byok-chat

Install dependencies:

npm install
# or
pnpm install
# or
yarn install

Start the development server:
```
npm run dev
# or
pnpm dev
# or
yarn dev
```
Open your browser and navigate to http://localhost:3000.

# Building for Production

npm run build
npm run start

This will create an optimized production build and start the production server.

# Usage

# Adding a Provider

Click on "Add Provider".
Select the provider type (Standard or Cloudflare).
Fill in the required fields:
- Standard: API URL, API keys, and models.
- Cloudflare: Worker URL, max index, and models.
Test the API key and save the provider.

# Editing a Provider

Navigate to the "Providers" section.
Click on the edit icon next to the provider.
Update the fields and save changes.

# Fetching Models

For standard providers, click "Fetch Available Models" to retrieve models from the API.
For Cloudflare providers, manually add models using the input field.

# Project Structure

byok-chat/
├── app/                    # Next.js App Router
│   ├── layout.tsx         # Root layout with metadata
│   ├── page.tsx           # Home page
│   ├── not-found.tsx      # 404 page
│   └── globals.css        # Global styles and theme variables
├── components/            # React components
│   ├── ui/               # Shadcn UI components
│   ├── settings/         # Settings-related components
│   └── *.tsx             # Feature components
├── lib/                  # Utility functions
│   ├── db.ts            # IndexedDB configuration
│   ├── api.ts           # API utilities
│   └── utils.ts         # Helper functions
├── store/               # Zustand state management
│   ├── chatStore.ts     # Chat state
│   ├── providerStore.ts # Provider state
│   └── settingsStore.ts # Settings state
├── types/               # TypeScript type definitions
├── hooks/               # Custom React hooks
└── public/              # Static assets

# Context Management & Summarization

This project includes built-in context management features designed to keep conversations coherent while staying within model context windows and token limits. The system balances retaining recent, relevant messages with periodic summarization of older content so the model keeps essential context without sending the entire chat history.

Important

Context management ensures your conversations remain coherent even with long chat histories while respecting token limits.

# Key Behaviors

Automatic summarization: After a configurable number of recent messages (default: 20) the app will create a concise summary of the previous segment of the conversation and store it with the chat. Future requests include that summary as system context so the assistant retains the important facts and decisions from earlier messages.
Context windowing: The app keeps the most recent N messages (configurable per chat and globally; default: 10) and forwards them along with the latest user prompt and the latest summary. This keeps the input size smaller while preserving immediate conversational context.
Background summarization: Summaries are generated asynchronously in the background so they don't block the active chat. If summarization fails, the system logs the error and continues; it will retry during subsequent summarization cycles.
Provider rotation & resiliency: When using Cloudflare worker providers or multiple API keys, the system rotates indices and keys to avoid throttling. Summaries and requests are sent through the same provider route and will respect provider-specific requirements (for example, sending an index field when using Cloudflare Workers).

# Configurable Settings

summarizeAfter: Number of messages per summarization batch (default 20). When this many non-system messages accumulate, the app will produce a summary for that segment.
retainMessages: Number of recent messages to keep after summarization (default 10). These messages are sent directly with the prompt to preserve local conversational continuity.
temperature: Optional override for the model temperature used in chat requests or summarization (leave empty to use provider/API defaults).
maxTokens: Optional override for max tokens for completions and summarization (leave empty to use provider/API defaults).

# Best Practices

Tune summarizeAfter and retainMessages based on the model you use (larger models can handle larger contexts). Lower retainMessages when you need to conserve tokens.
Keep summaries concise but informative — the summarization prompt in the app aims for clear, structured summaries focusing on topics, decisions, and facts necessary to continue the conversation.
Monitor token usage and set maxTokens conservatively if you are on a rate limit or cost-sensitive provider.

# Implementation Notes

The app builds a messages array that includes:
- the system prompt (including the stored summary when present),
- the most recent retained user/assistant messages,
- the latest user message triggering the request.
When the configured summarizeAfter threshold is reached, older segments are summarized using the same chat completion endpoint in the background. The produced summary is stored in the chat object and used as system context for later requests.
Summarization requests use a low temperature (for determinism) and a higher max_tokens to allow longer, structured summaries.

If you want to change defaults or experiment with different summarization prompts, check the implementation in lib/api.ts, store/chatStore.ts, and the settings UI in components/settings/ContextControls.tsx.

# Deployment

# Vercel (Recommended)

The easiest way to deploy your Next.js app is to use Vercel:

Push your code to GitHub
Import your repository to Vercel
Vercel will automatically detect Next.js and configure the build settings
Deploy!

# Other Platforms

You can also deploy to:

Netlify: Supports Next.js with automatic configuration
AWS Amplify: Full support for Next.js
Docker: Use the provided Dockerfile for containerized deployments
Self-hosted: Build the app and run with npm start

# Contributing

Contributions are welcome! Please follow these steps:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

# License

This project is licensed under the MIT License. See the LICENSE file for details.

# Support

For issues, questions, or contributions, please visit the GitHub repository.

Happy chatting! 🚀

next.js typescript ai chat zustand indexeddb shadcn ui