Instructions to use Tiiny/ReluFalcon-40B-PowerInfer-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Tiiny/ReluFalcon-40B-PowerInfer-GGUF with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Tiiny/ReluFalcon-40B-PowerInfer-GGUF", dtype="auto") - Notebooks
- Google Colab
- Kaggle
ReluFalcon-40B-PowerInfer-GGUF
- Original model: SparseLLM/ReluFalcon-40B
- Converted & distributed by: PowerInfer
This model is the downstream distribution of SparseLLM/ReluFalcon-40B in PowerInfer GGUF format consisting of the LLM model weights and predictor weights.
- Downloads last month
- 2
Hardware compatibility
Log In to add your hardware
We're not able to determine the quantization variants.
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support