FlashCap

Open-source camera / webcam capture library. Feeds webcam frames to vision-capable LLMs.

FlashCap is an open-source camera capture library. The Voxta module uses it to grab webcam frames and feed them to your active vision-capable LLM, so your character can see what your camera sees.

Marked experimental in the registry. Behavior and config may change.

Setup

Add the service

Manage Services → + Add Services → FlashCap → Add.

Pick a camera

In the config, choose your webcam from the device list. Multiple cameras can be defined and switched between.

Enable the Vision augmentation

Manage Services → + Add Services → Voxta Utilities: Vision → Add. This is what actually wires the captured frame into the chat prompt for a vision-capable LLM to interpret.

The character only sees frames when the Vision augmentation triggers — either on a user message that mentions the camera, or continuously if configured. They never see a live feed; just snapshots.

Setup

Add the service

Pick a camera

Enable the Vision augmentation

What the character sees

On this page