prans-cs55 commited on
Commit
a4d2f09
Β·
verified Β·
1 Parent(s): 0d7204a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +71 -0
README.md CHANGED
@@ -11,3 +11,74 @@ license: mit
11
  ---
12
 
13
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  ---
12
 
13
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
14
+ ___________
15
+ .
16
+
17
+ 🧠 OCR Text Extractor + Summarizer
18
+
19
+ An AI-powered tool that extracts text from images using Tesseract OCR and then summarizes it using a transformer model.
20
+ Upload any image (screenshots, photos, scanned documents, notes) β†’ Get clean extracted text + an AI summary.
21
+
22
+ πŸš€ Features
23
+
24
+ πŸ“€ Upload an image with text
25
+
26
+ πŸ”Ž Extracts text using Tesseract OCR
27
+
28
+ ✨ Summarizes extracted text using HuggingFace transformers
29
+
30
+ ⚑ Fast, simple Gradio UI
31
+
32
+ πŸ› οΈ Works on CPU β€” no GPU required
33
+
34
+ 🧩 How it Works
35
+
36
+ Image is processed with Tesseract OCR
37
+
38
+ Extracted text is cleaned
39
+
40
+ Text is fed into a pretrained summarization model
41
+
42
+ Output summary is displayed instantly
43
+
44
+ πŸ—‚οΈ Project Structure
45
+ β”œβ”€β”€ app.py
46
+ β”œβ”€β”€ requirements.txt
47
+ β”œβ”€β”€ packages.txt
48
+ └── README.md
49
+
50
+ πŸ“¦ Dependencies
51
+ Python packages (requirements.txt)
52
+ gradio
53
+ pillow
54
+ pytesseract
55
+ transformers
56
+ torch
57
+ tesseract
58
+
59
+ System packages (packages.txt)
60
+ tesseract-ocr
61
+ tesseract-ocr-eng
62
+
63
+
64
+ These ensure Tesseract OCR runs correctly on HuggingFace Spaces.
65
+
66
+ ▢️ Running Locally
67
+ pip install -r requirements.txt
68
+ python app.py
69
+
70
+ πŸ“Έ Demo
71
+
72
+ Just upload an image β†’ click Submit β†’ done!
73
+
74
+ πŸ™Œ Acknowledgements
75
+
76
+ Tesseract OCR
77
+
78
+ HuggingFace Transformers
79
+
80
+ Gradio for UI
81
+
82
+ πŸ”— Try the live Space
83
+
84
+ πŸ‘‰https://huggingface.co/spaces/prans-cs55/text_extractor