AI & ML interests

None defined yet.

Recent Activity

Smith42Ā  updated a dataset about 11 hours ago
hugging-science/mmu_legacysurvey_dr10_south_21
Smith42Ā  updated a dataset 3 days ago
hugging-science/mmu_apogee_dr17
Smith42Ā  published a dataset 3 days ago
hugging-science/mmu_apogee_dr17
View all activity

Articles

AINovice2005Ā 
posted an update 3 days ago
view post
Post
53
Excited to share the release of dagster-hf-datasets: A Dagster-native integration that brings Hugging Face Datasets into Dagster's asset-oriented orchestration model

The integration enables:

• šŸ¤— Dataset and DatasetDict assets
• šŸ™ Dagster asset lineage and observability
• šŸ“¦ Parquet-backed materialization via HFParquetIOManager
• šŸš€ Publishing curated datasets back to the Hugging Face Hub
• šŸ“ Automatic dataset card generation from pipeline metadata

As the Hub continues to grow beyond 1M+ datasets, orchestration, reproducibility, and observability are becoming increasingly important parts of the dataset lifecycle. I'm also working on a longer article covering the architecture and data pipelines enabled by the integration.

More Soon!

https://github.com/dagster-io/community-integrations/tree/main/libraries/dagster-hf-datasets

https://github.com/dagster-io/community-integrations/tree/main/libraries/dagster-hf-datasets/docs

ShrijanagainĀ 
posted an update 11 days ago
view post
Post
2551
We are pleased to announce that the W-IMG Vision Dataset infrastructure is officially live.

The complete asset infrastructure is now accessible on Hugging Face for internal validation and architecture scaling targets.

Dataset Endpoint - sKT-Ai-Labs/W-IMG

#SovereignAI #ComputerVision #MachineLearning #OpenSource
AINovice2005Ā 
posted an update 27 days ago
AINovice2005Ā 
posted an update about 2 months ago
view post
Post
172
I've built a system to make open-source contributions easier to understand across repositories.

It:

aggregates merged external PRs (reviewed by maintainers)
structures them into a single contributions.md
adds a lightweight AI layer to query patterns and impact

The idea is to move from scattered PRs to a readable changelog of work.

Read about it: https://medium.com/@paragekbote23/from-commits-to-impact-building-an-automated-changelog-for-open-source-contributions-20cdfebcee58
ShrijanagainĀ 
posted an update about 2 months ago
view post
Post
4292
sKT-Ai-Labs


Join fast we will soon published tokens and all join and get started because we will soon off join request button if you want you can join fast guys
  • 1 reply
Ā·