ripgrep-all File Content Search

ripgrep-all File Content Search

github.com

2

About this website

ripgrep-all (rga) is a free and open-source command-line tool that extends ripgrep to search through not only plain text files but also a wide variety of file formats including PDFs, e-books, Office documents, compressed archives, images (via OCR), SQLite databases, and more. Created by Philipp K. (phiresky on GitHub) in 2019, rga wraps the popular ripgrep tool and adds adapter-based content extraction, making it possible to grep across heterogeneous file collections. Key features: file format support via adapters: PDF (via pdftotext), Office documents (docx, xlsx, pptx via pandoc), e-books (epub via pandoc), SQLite databases (via sqlite3), compressed archives (zip, tar, gz, bz2, xz), images (jpg, png via Tesseract OCR), subtitles (srt, vtt), web archives, and more. Each adapter extracts text content from the file and passes it to ripgrep for searching. Caching: rga caches extracted text content in a local cache directory (.cache/rga or configurable) to speed up repeated searches. On subsequent searches, cached content is reused without re-extraction. The cache is populated incrementally. Adapter system: adapters are pluggable and can be enabled/disabled via command-line flags. Custom adapters can be written as scripts. The rga --rga-list-adapters command lists available adapters. Compression: rga can search inside compressed archives (zip, tar.gz, tar.bz2, tar.xz, gzip) by transparently decompressing and searching the contents. SQLite: searches through table data in SQLite databases by dumping tables as CSV and searching. Performance: uses ripgrep's fast search engine, multithreading, and the caching mechanism to provide near-instant results even for large file collections. Integration: rga can be used with fzf for interactive file content search. Cross-platform (Rust). AGPL-3.0.

Statistics

2
Views
0
Clicks
0
Like
0
Dislike

Comments

Log In to post a comment

No comments yet. Be the first!