Scene Scout search results preview

Search through your local video collection using natural language queries or image similarity, powered by Google's SigLIP 2 model.

Download Latest Release

Features

Various functions to find the scenes you are looking for...

Natural Language Search

Find video scenes using text descriptions. Simply type what you are looking for and get results.

Image-to-Scene Search

Use a reference image to find similar scenes in your video collection with visual similarity matching.

Database & Queue management system

Search through multiple databases at once. Also queue up different files and folders to process.

Video Playback and Export

Watch scenes play out directly in the GUI. And export a selected scene to a seperate video file.

Multiplatform Support

Works on Windows, Linux, and Mac with automated install scripts for easy setup.

GPU Acceleration

Supports CUDA, TensorRT, DML, Intel Arc/Xe, AMD ROCm, and Apple MPS for fast processing.

Local SQLite Database

Local efficient embedding storage with scene data and thumbnails. Incremental indexing in the database for new files only.

CLI & Interactive Mode

Interact with Scene Scout through the terminal. Run searches, index media, and export results as JSON. Includes an interactive REPL with tab completion, command history, and scripting support for automation.

Run Offline

After the initial download of the model and dependencies, Scene Scout runs completely offline on your own system. No internet connection required for searching or indexing.

Screenshots

A few example screenshots of the tool in action

Computer Monitor Search
Text query: computer monitor with a glowing screen
Frog Suit Search
Text query: wearing a frog costume
Glowing Eye Search
Text query: yellow glowing eye
Interactive CLI interface showing JSON output from database
CLI Text query: person wearing frog costume
Exporting a scene to a video file
Exporting a chosen scene to a video file

How to Use

When opening the GUI, you can follow the next few steps to start searching for scenes

1

Launch

After installing with the install script, start Scene Scout using the GUI or CLI script. On first launch the vision model downloads automatically.

2

Database

Create a new database via Database → Create New or drag and drop a database file onto the GUI.

3

Add to queue

You can add multiple files and/or folder to the queue to be processed. To get an overview of the current queue click on the Inspect Queue button.

4

Index Media

Click Process Media Folder to extract scenes and build embeddings for all the files in the queue. This may take a while depending on your collection size, hardware and selected option during installation. Only new or modified files are processed on subsequent runs.

5

Search

Enter a text description in the search field, load a reference image, or combine both for a multimodal query. Press Enter or click Search Scene. This can be performed for one or multiple loaded databases at once.

6

Browse & Play

Scroll through ranked results with similarity scores. Click on any result to play the scene directly in the built-in video viewer.

7

Export scene

If you have found just the right scene. Then you can export it to a seperate file by clicking on the Export Scene button.

Installation

Get up and running in three simple steps

1

Download

Get the latest release from GitHub or clone the repository

git clone https://github.com/Mark-Shun/scene-scout
2

Install

Run the install script to set up UV, Python, VLC, and dependencies

Windows: install.bat
Mac/Linux: ./install.sh
3

Launch

Start Scene Scout GUI and begin searching your video collection

Windows: scene-scout.bat
Mac/Linux: ./scene-scout.sh

GPU Acceleration

During installation you can choose the acceleration method that matches your hardware. Select the option below that applies to your system.

CUDA
NVIDIA GPU

Recommended for all NVIDIA graphics cards. Best overall performance and compatibility.

TensorRT
NVIDIA GPU

Maximum inference speed with additional NVIDIA-only optimization.

DirectML
AMD / Intel (Windows)

GPU acceleration for AMD Radeon and Intel integrated graphics on Windows systems.

Intel XPU
Intel Arc / Xe

Dedicated acceleration option for Intel Arc and Xe graphics cards.

AMD ROCm
AMD GPU (Linux)

GPU acceleration for AMD graphics cards on Linux systems. Not available on Windows.

Apple MPS
Apple Silicon

Accelerated processing on M1/M2/M3 Macs. Select the CPU option during install for MPS support.

CPU
All Systems

Universal fallback that works on any hardware. Slower than GPU options but requires no special setup.

Need Help?

Running into issues or have a question?

If you run into problems, encounter bugs, or want to request a feature (pull requests welcome), please create an issue on the GitHub repository. Check existing issues first to avoid duplicates.

Open an Issue on GitHub

About

Created by Mark-Shun / Sonicfreak
Forked from Gabrjiele's SigLip 2 NaFlex Project

Scene Scout is a natural language scene search tool powered by Google's SigLIP 2 model with NaFlex architecture. This project has been forked from Gabrjiele's open source SigLip 2 NaFlex project, with a focus shift to specifically searching for scenes through natural language queries.

Icon and logo made by Miwo

Thanks to Zarxrax for helping out with UV.