Video & Image OCR Tool

A Streamlit-based web application to extract text from uploaded images and videos using EasyOCR.

Features

Image OCR: Upload an image (PNG, JPG, JPEG) and extract text.
Video OCR: Upload a video (MP4, MOV, AVI) and extract text from frames at a specified sampling rate.
Video ASR: Extract text from the audio channel of videos using automatic speech recognition.
- Supports local inference using Hugging Face models (e.g., OpenAI Whisper, Wav2Vec2).
- Uses GPU acceleration if available.
Multi-language Support: Support for English, Chinese (Simplified), French, German, Spanish, Japanese, Korean, and Dutch.
Downloadable Results: Download the extracted text as a .txt file.

Installation

Clone the repository (if applicable) or ensure you have the project files.
Install the required dependencies:

pip install -r requirements.txt

Note: For video audio extraction, ffmpeg is required. moviepy usually handles this, but if you encounter issues, ensure ffmpeg is installed on your system.

Running the App

Run the Streamlit application using the following command:

streamlit run app.py

Usage

Select Languages: Use the sidebar to select one or more languages for OCR. The default is English.
Upload File: Upload an image or video file.
Extract Text:
- For Images: The extracted text will be displayed immediately after clicking "Extract Text".
- For Videos: You can adjust the sampling rate (seconds per frame) in the sidebar. Click "Extract Text from Video" to process.
Download: Click the "Download Text" button to save the results.

Notes

First run might be slow as EasyOCR downloads the necessary models.
Video processing can be time-consuming depending on the video length and sampling rate.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video & Image OCR Tool

Features

Installation

Running the App

Usage

Notes

About

Uh oh!

Releases

Packages

Languages

lucwens/VideoTranscriber

Folders and files

Latest commit

History

Repository files navigation

Video & Image OCR Tool

Features

Installation

Running the App

Usage

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages