Fei Zhang
commited on
Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,29 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
---
|
| 4 |
+
# ConText
|
| 5 |
+
|
| 6 |
+
This is **[ConText](https://arxiv.org/abs/2506.03799)**, a powerful generalists that could do perfect text removal and segmentation framework.
|
| 7 |
+
We also first exploration of establishing a visual in-context learning (V-ICL) paradigm for fine-grained text recognition tasks, including text segmentation and removal. To achieve this, we sought a single-task-targeted baseline solution based on the prevailing V-ICL frameworks, which typically regulates in-context inference as a query-label-reconstruction process. Beyond simple task-specific fine-tuning, we proposed an end-to-end in-context generalist elicited from a task-chaining prompt that explicitly chaining up tasks as one enriched demonstration, leveraging inter-task correlations to improve the in-context reasoning capabilities. A Through quantitative and
|
| 8 |
+
qualitative experiments, we demonstrated the grounding
|
| 9 |
+
effectiveness and superiority of our framework across various
|
| 10 |
+
in-domain and out-of-domain text recognition tasks,
|
| 11 |
+
outperforming both current generalists and specialists. Overall,
|
| 12 |
+
we hope this pioneering work will encourage further
|
| 13 |
+
development of V-ICL in text recognition.
|
| 14 |
+
|
| 15 |
+
The code source is in [here](https://github.com/Ferenas/ConText).
|
| 16 |
+
|
| 17 |
+
# Model Weight & Usage
|
| 18 |
+
|
| 19 |
+
Here we provide the weights of ConText and ConTextV, you can download these checkpoints and follow the process in [here](https://github.com/Ferenas/ConText)
|
| 20 |
+
to perform OCR-level removal and segmentation. Have FUN!
|
| 21 |
+
|
| 22 |
+
## Model Performance
|
| 23 |
+
|
| 24 |
+
It reaches SOTA performance in all text segmentation and removal benchmarks.
|
| 25 |
+
|
| 26 |
+
## Model Card Contact
|
| 27 |
+
|
| 28 |
+
Feel free to contact [email protected] if you have any problem!
|
| 29 |
+
[More Information Needed]
|