--- title: Text Extractor emoji: 👀 colorFrom: gray colorTo: indigo sdk: gradio sdk_version: 6.0.2 app_file: app.py pinned: false license: mit --- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference ___________ . 🧠 OCR Text Extractor + Summarizer An AI-powered tool that extracts text from images using Tesseract OCR and then summarizes it using a transformer model. Upload any image (screenshots, photos, scanned documents, notes) → Get clean extracted text + an AI summary. 🚀 Features 📤 Upload an image with text 🔎 Extracts text using Tesseract OCR ✨ Summarizes extracted text using HuggingFace transformers ⚡ Fast, simple Gradio UI 🛠️ Works on CPU — no GPU required 🧩 How it Works Image is processed with Tesseract OCR Extracted text is cleaned Text is fed into a pretrained summarization model Output summary is displayed instantly 🗂️ Project Structure ├── app.py ├── requirements.txt ├── packages.txt └── README.md 📦 Dependencies Python packages (requirements.txt) gradio pillow pytesseract transformers torch tesseract System packages (packages.txt) tesseract-ocr tesseract-ocr-eng These ensure Tesseract OCR runs correctly on HuggingFace Spaces. ▶️ Running Locally pip install -r requirements.txt python app.py 📸 Demo Just upload an image → click Submit → done! 🙌 Acknowledgements Tesseract OCR HuggingFace Transformers Gradio for UI 🔗 Try the live Space 👉https://huggingface.co/spaces/prans-cs55/text_extractor