Server¶

Neurosurfer’s Server layer brings together a FastAPI backend and a React frontend (NeurosurferUI) that communicate over a clean, OpenAI‑style API. You can keep the defaults and ship fast, or swap/extend chat handlers, tools, and routes for a fully custom stack.

What you get: OpenAI‑compatible endpoints, RAG‑ready flows, streaming/tool‑calling support, and a React UI designed for iterative agent development.

Neurosurfer API (Backend)

OpenAI‑style /v1/chat/completions, lifecycle hooks, chat handlers, custom endpoints, auth/users.

Read the Backend guide
NeurosurferUI (Frontend)

Chat UX, threads/sessions, file uploads for RAG, and a typed API client. Easy to extend with your own components.

Explore the NeurosurferUI
Example Application

A minimal but complete reference app: startup/shutdown hooks, a custom chat handler, drop‑in RAG, and a sample tool/endpoint.

Walk through the Example

Usage Overview¶

Start the server via the CLI and use Docker/Compose or a reverse proxy in staging/production. The backend is a FastAPI app (exported as NeurosurferApp), and the frontend is a Vite dev server during development (bundled React app in production).

Start everything (dev)¶

# Auto-detects UI root (or pass --ui-root). Will run npm install on first run.
neurosurfer serve

Backend binds to NEUROSURF_BACKEND_HOST / NEUROSURF_BACKEND_PORT (from config).
UI root auto‑detected; override with --ui-root /path/to/neurowebui.
UI talks to the backend using VITE_BACKEND_URL (injected automatically when binding to 0.0.0.0 via NEUROSURF_PUBLIC_HOST).

Common variants

# Backend only (no UI)
neurosurfer serve --only-backend --backend-host 0.0.0.0 --backend-port 8000

# UI only (point to your UI root)
neurosurfer serve --only-ui --ui-root ./neurosurferui

# Serve your own app file (must expose a NeurosurferApp instance)
neurosurfer serve --backend-app ./app.py --backend-reload

# Serve a module with an instance or factory
neurosurfer serve --backend-app mypkg.myapp:ns
neurosurfer serve --backend-app mypkg.myapp:create_app()

Backend app resolution

--backend-app accepts a module path with :attr or :factory(), or a Python file containing a NeurosurferApp instance. Don’t call app.run() at import time; the CLI orchestrates the process.

Deployment¶

You can containerize the backend and UI, or keep the UI as a static build behind a reverse proxy. Below are pragmatic patterns that work well in practice.

Docker & Compose (backend focus)¶

Dockerfile (backend)

FROM python:3.11-slim

WORKDIR /app
COPY . /app

# Install core + LLM extras if you’re serving models locally
RUN pip install -U pip \
 && pip install -e . \
 && pip install 'neurosurfer[torch]'

ENV NEUROSURF_SILENCE=1 \
    NEUROSURF_BACKEND_HOST=0.0.0.0 \
    NEUROSURF_BACKEND_PORT=8000

EXPOSE 8000

# Run your app module or file; replace with your backend entry
CMD ["neurosurfer", "serve", "--only-backend", "--backend-app", "neurosurfer.examples.quickstart_app:ns"]

docker-compose.yml (backend + proxy skeleton)

version: "3.9"
services:
  api:
    build: .
    image: neurosurfer-api:latest
    environment:
      NEUROSURF_SILENCE: "1"
      NEUROSURF_BACKEND_HOST: "0.0.0.0"
      NEUROSURF_BACKEND_PORT: "8000"
    ports:
      - "8000:8000"
    restart: unless-stopped

  # Optional: Caddy / Nginx as reverse proxy serving UI and proxying /v1 to api
  # proxy:
  #   image: caddy:2
  #   volumes:
  #     - ./Caddyfile:/etc/caddy/Caddyfile:ro
  #     - ./ui-dist:/srv  # built React app
  #   ports:
  #     - "80:80"
  #     - "443:443"
  #   restart: unless-stopped

UI build for production

In production, serve the built UI (static files) from your proxy or a CDN and reverse‑proxy API calls (e.g., /v1/*) to the backend. During development, the CLI runs the Vite dev server for you.

Reverse Proxy (Nginx/Caddy)¶

In a reverse proxy, route / to your built UI and /v1/* to the backend (and /docs if you expose FastAPI docs). Ensure CORS settings in the backend match your deployed origins.

Nginx: location /v1/ { proxy_pass http://api:8000/v1/; }
Caddy: reverse_proxy /v1/* api:8000

TLS and WebSockets

If you use streaming or WebSockets, confirm your proxy is configured to forward upgrade headers. Both Nginx and Caddy can do this with their standard reverse_proxy settings.

Configuration & Environment¶

Configuration lives with the backend and is also influenced by CLI/env. For a deeper dive, see Backend → Configuration.

Common environment variables

NEUROSURF_PUBLIC_HOST — used to craft VITE_BACKEND_URL when backend binds 0.0.0.0/::
NEUROSURF_UI_ROOT — path to the UI project for dev mode
NEUROSURF_SILENCE=1 — suppress banner/optional‑deps warnings on import
NEUROSURF_BACKEND_HOST / NEUROSURF_BACKEND_PORT — default bind for the API
NEUROSURF_BACKEND_LOG, NEUROSURF_BACKEND_WORKERS, NEUROSURF_BACKEND_WORKER_TIMEOUT — backend behavior

Model/runtime dependencies

The server core is light. Install the full LLM stack when you need local inference/finetuning:

pip install -U 'neurosurfer[torch]'

For CUDA wheels (Linux x86_64):

pip install -U torch --index-url https://download.pytorch.org/whl/cu124

or CPU‑only:

pip install -U torch --index-url https://download.pytorch.org/whl/cpu

Worked Example¶

The Example Application shows: startup hooks (model load, RAG wiring), a custom chat handler, file uploads driving RAG, and a typed server that streams completions.

Read it here → Example Application
Looking for custom handlers? → Chat Handlers
Want to wire tools or extra routes? → Custom Endpoints