Argus - The All-Seeing File Search Tool

Named after Argus Panoptes, the all-seeing giant from Greek mythology, Argus is a powerful CLI tool that searches for text across any file format.

Features

Universal File Search: Search through PDFs, Word documents (.docx), images (with OCR), text files, and code files
Interactive TUI: Run argus with no arguments to open a full-screen dashboard — compose a query with chips for file types, toggles for every option, a live progress bar, and a results pane with preview. Loop from result back to a fresh search without leaving the app.
Fast Parallel Processing: Leverages multi-core CPUs with Rayon for blazing-fast searches
Index Caching: Save extracted text to an index file for instant subsequent searches
Beautiful CLI: Colorful output with file type icons, confidence bars, and match highlighting
Interactive Selection: Navigate results with arrow keys and open files instantly
Regex Support: Full regex pattern matching when you need precise searches
OCR Capability: Extract and search text from images using Tesseract (fast, native C++ via leptess) or ocrs (pure-Rust ONNX engine, higher accuracy on modern docs) — both selectable from the CLI and the TUI
Cross-Platform: Works on Linux and Windows

Installation

From Source

# Clone the repository
git clone https://github.com/Aswikinz/Argus.git
cd argus

# Build without OCR (faster build, smaller binary)
cargo build --release

# Build with Tesseract OCR support (requires system Tesseract)
cargo build --release --features ocr

# Build with the pure-Rust ocrs ONNX backend (no Tesseract needed,
# downloads ~25 MB of models on first use)
cargo build --release --features ocrs

# Install to your PATH
cargo install --path .

Prerequisites

Rust 1.82+: Install from rustup.rs
Tesseract (optional, for OCR):
- Ubuntu/Debian: sudo apt install tesseract-ocr libtesseract-dev libleptonica-dev
- Fedora: sudo dnf install tesseract tesseract-devel leptonica-devel
- Windows: Download from UB-Mannheim/tesseract
- macOS: brew install tesseract

Usage

Interactive TUI (recommended for first-time users)

Run argus with no arguments and you land in a full-screen dashboard:

argus

What you get:

Section	What it does
search for	the text/regex you're looking for
in folder	where to search (defaults to the directory you launched from, `~/` and `$HOME` are expanded)
file types	a row of toggleable chips for 22 common extensions — pick as many as you want, leave all unticked to scan everything
options	case-sensitive, regex, OCR, include hidden, preview matches, use saved index, save/update index
OCR engine	switch between Tesseract and `ocrs` live (only meaningful when OCR is enabled)
max depth	cycle through `unlimited / 1 / 2 / 3 / 5 / 10 / 20` levels
limit	slider for the top-N cap on displayed matches
▶ run	runs the search

Key bindings:

Tab / Shift+Tab — move between sections
Space — toggle the highlighted chip / option (or flip the OCR engine)
← / → — adjust sliders and pickers (extensions, OCR engine, max depth, limit)
Enter — run the search from any field
Esc — quit from Setup, go back from Results
↑ / ↓ + Enter — navigate the results list and open a file
n — new search (keeps your filters, clears the query)
b — back to Setup with everything preserved so you can tweak and re-run
Ctrl+C — quit from anywhere

The Searching phase shows a live current / total files progress bar, so you can see exactly how the scan is going.

Command-line usage

# Basic search in current directory
argus "search term"

# Search in a specific directory
argus -d /path/to/project "function"

# Case-sensitive search
argus -s "TODO"

# Use regex pattern
argus -r "\bfn\s+\w+"

# Search only specific file types
argus -e pdf,docx,txt "report"

# Enable OCR for images (requires --features ocr or --features ocrs)
argus -o "text in screenshot"

# Use the ocrs ONNX engine instead of Tesseract
argus -o --ocr-engine ocrs "text in screenshot"

# Show content preview
argus -p "error"

# Limit results
argus -l 50 "warning"

# Include hidden files
argus -H ".env"

# Set maximum directory depth
argus --max-depth 3 "config"

# Non-interactive mode (just print results)
argus -n "TODO"

# Save index for faster future searches
argus -i "pattern"

# Use existing index (skip re-extraction for unchanged files)
argus -I "pattern"

# Save and use index together (recommended for repeated searches)
argus -iI "pattern"

# Use a custom index file location
argus -i --index-file ~/my_index.json "pattern"

Command Line Options

Flag	Long	Description	Default
`[PATTERN]`		Search pattern — omit to launch the interactive TUI	-
`-d`	`--directory`	Directory to search	Current dir
`-l`	`--limit`	Maximum results	20
`-s`	`--case-sensitive`	Case-sensitive search	Off
`-o`	`--ocr`	Enable OCR for images	Off
	`--ocr-engine`	Which OCR backend (`tesseract` \| `ocrs`)	Depends on enabled feature
`-r`	`--regex`	Use regex matching	Off
`-p`	`--preview`	Show match previews	Off
`-e`	`--extensions`	Filter by extensions (comma-separated)	All
	`--max-depth`	Max directory depth	Unlimited
`-H`	`--hidden`	Include hidden files	Off
`-n`	`--non-interactive`	Non-interactive mode	Off
`-i`	`--save-index`	Save index after scanning	Off
`-I`	`--use-index`	Use existing index	Off
	`--index-file`	Custom index file path	`.argus_index.json`

Every option above is also surfaced in the TUI (with the exception of --index-file, which is an advanced override kept on the CLI only; the TUI uses whatever path is in effect when you launched argus, defaulting to .argus_index.json).

Output Example

╔══════════════════════════════════════════════════════════════════╗
║  ARGUS - The All-Seeing Search Tool                              ║
╚══════════════════════════════════════════════════════════════════╝

  Stats: 1,234 files scanned, 42 matches in 8 files • 1.23s
  Types: PDF: 3 • Code: 4 • Text: 1

  Found 8 files with matches:

  #1  README.md • 12 matches [████████████ 100%]
      .../project/README.md
      "TODO: implement feature..."

  #2  src/main.rs • 8 matches [██████████░░ 83%]
      .../project/src/main.rs

  #3  docs/guide.pdf • 5 matches [████████░░░░ 67%]
      .../project/docs/guide.pdf

Supported File Types

Category	Extensions
Text	txt, md, markdown, rst, log, csv, json, yaml, yml, toml, xml, html
Code	rs, py, js, ts, jsx, tsx, java, c, cpp, go, rb, php, swift, and 40+ more
Documents	pdf, docx
Images (OCR)	png, jpg, jpeg, gif, bmp, tiff, webp

Build Scripts

Linux/macOS

./build.sh

Windows

build.bat

Architecture

src/
├── main.rs            # CLI entry point; routes to the TUI or the classic CLI flow
├── types.rs           # Core data structures (SearchResult, Match, FileType, configs)
├── search.rs          # Parallel search engine with a shared progress handle
├── extractors.rs      # Text extraction for each file format
├── index.rs           # Index caching for extracted text
├── ocrs_backend.rs    # Pure-Rust ocrs ONNX OCR backend (feature = "ocrs")
├── ui.rs              # Classic terminal output and interactive file selector
└── tui.rs             # Full-screen ratatui dashboard (Setup → Searching → Results)

Indexing

Argus can cache extracted text to an index file, making subsequent searches nearly instant. This is especially useful for:

Large codebases or document collections
Directories with PDFs, DOCX files, or images (expensive to extract)
Repeated searches with different patterns

How it works

First run with -i: Argus scans files, extracts text, and saves to .argus_index.json
Subsequent runs with -I: Argus loads the index and skips extraction for unchanged files
Smart invalidation: Modified files (different timestamp/size) are automatically re-extracted
New files: Automatically detected and added to the index

Index file format

The index is stored as human-readable JSON:

{
  "version": 1,
  "directory": "/path/to/searched/dir",
  "created_at": 1234567890,
  "updated_at": 1234567890,
  "entries": {
    "/path/to/file.txt": {
      "path": "/path/to/file.txt",
      "file_type": "Text",
      "extracted_text": "file contents...",
      "modified_timestamp": 1234567890,
      "file_size": 1234
    }
  }
}

Performance Tips

Use indexing (-iI) for directories you search frequently
Use extension filters (-e) when you know the file types
Set max depth (--max-depth) for large directory trees
Use literal search instead of regex when possible
OCR Performance: When OCR is enabled, Argus uses thread-local Tesseract instances to avoid re-initialization overhead, enabling efficient parallel image processing across multiple CPU cores
Faster OCR models: Install tesseract-langpack-eng-fast (Fedora) or equivalent for ~2-3x faster OCR with slightly lower accuracy

Troubleshooting

OCR not working

Ensure Tesseract is installed and in your PATH
Rebuild with: cargo build --release --features ocr
Check Tesseract works: tesseract --version

Permission denied errors

Some files may be unreadable due to permissions. Argus will skip these and continue searching.

Large files

Files over 50MB are automatically skipped to prevent memory issues.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

MIT License - see LICENSE for details.

Acknowledgments

Named after Argus Panoptes, the hundred-eyed giant from Greek mythology
Built with amazing Rust crates: clap, rayon, walkdir, colored, dialoguer, indicatif, ratatui, crossterm, and more

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github		.github
.idea		.idea
src		src
tests		tests
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
build.bat		build.bat
build.sh		build.sh
codecov.yml		codecov.yml
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Argus - The All-Seeing File Search Tool

Features

Installation

From Source

Prerequisites

Usage

Interactive TUI (recommended for first-time users)

Command-line usage

Command Line Options

Output Example

Supported File Types

Build Scripts

Linux/macOS

Windows

Architecture

Indexing

How it works

Index file format

Performance Tips

Troubleshooting

OCR not working

Permission denied errors

Large files

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Argus - The All-Seeing File Search Tool

Features

Installation

From Source

Prerequisites

Usage

Interactive TUI (recommended for first-time users)

Command-line usage

Command Line Options

Output Example

Supported File Types

Build Scripts

Linux/macOS

Windows

Architecture

Indexing

How it works

Index file format

Performance Tips

Troubleshooting

OCR not working

Permission denied errors

Large files

Contributing

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages