Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -11,3 +11,74 @@ license: mit
|
|
| 11 |
---
|
| 12 |
|
| 13 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
---
|
| 12 |
|
| 13 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
| 14 |
+
___________
|
| 15 |
+
.
|
| 16 |
+
|
| 17 |
+
π§ OCR Text Extractor + Summarizer
|
| 18 |
+
|
| 19 |
+
An AI-powered tool that extracts text from images using Tesseract OCR and then summarizes it using a transformer model.
|
| 20 |
+
Upload any image (screenshots, photos, scanned documents, notes) β Get clean extracted text + an AI summary.
|
| 21 |
+
|
| 22 |
+
π Features
|
| 23 |
+
|
| 24 |
+
π€ Upload an image with text
|
| 25 |
+
|
| 26 |
+
π Extracts text using Tesseract OCR
|
| 27 |
+
|
| 28 |
+
β¨ Summarizes extracted text using HuggingFace transformers
|
| 29 |
+
|
| 30 |
+
β‘ Fast, simple Gradio UI
|
| 31 |
+
|
| 32 |
+
π οΈ Works on CPU β no GPU required
|
| 33 |
+
|
| 34 |
+
π§© How it Works
|
| 35 |
+
|
| 36 |
+
Image is processed with Tesseract OCR
|
| 37 |
+
|
| 38 |
+
Extracted text is cleaned
|
| 39 |
+
|
| 40 |
+
Text is fed into a pretrained summarization model
|
| 41 |
+
|
| 42 |
+
Output summary is displayed instantly
|
| 43 |
+
|
| 44 |
+
ποΈ Project Structure
|
| 45 |
+
βββ app.py
|
| 46 |
+
βββ requirements.txt
|
| 47 |
+
βββ packages.txt
|
| 48 |
+
βββ README.md
|
| 49 |
+
|
| 50 |
+
π¦ Dependencies
|
| 51 |
+
Python packages (requirements.txt)
|
| 52 |
+
gradio
|
| 53 |
+
pillow
|
| 54 |
+
pytesseract
|
| 55 |
+
transformers
|
| 56 |
+
torch
|
| 57 |
+
tesseract
|
| 58 |
+
|
| 59 |
+
System packages (packages.txt)
|
| 60 |
+
tesseract-ocr
|
| 61 |
+
tesseract-ocr-eng
|
| 62 |
+
|
| 63 |
+
|
| 64 |
+
These ensure Tesseract OCR runs correctly on HuggingFace Spaces.
|
| 65 |
+
|
| 66 |
+
βΆοΈ Running Locally
|
| 67 |
+
pip install -r requirements.txt
|
| 68 |
+
python app.py
|
| 69 |
+
|
| 70 |
+
πΈ Demo
|
| 71 |
+
|
| 72 |
+
Just upload an image β click Submit β done!
|
| 73 |
+
|
| 74 |
+
π Acknowledgements
|
| 75 |
+
|
| 76 |
+
Tesseract OCR
|
| 77 |
+
|
| 78 |
+
HuggingFace Transformers
|
| 79 |
+
|
| 80 |
+
Gradio for UI
|
| 81 |
+
|
| 82 |
+
π Try the live Space
|
| 83 |
+
|
| 84 |
+
πhttps://huggingface.co/spaces/prans-cs55/text_extractor
|