---
language: en
license: apache-2.0
base_model: Qwen/Qwen2.5-Coder-1.5B-Instruct
tags:
- qwen2.5
- cloud
- azure
- aws
- terraform
- docker
- kubernetes
- linux
- iac
---

# Cloud Expert Qwen

This is a version 1 of Qwen 2.5-Coder 1.5B fine-tuned for cloud computing & IaC

Fine-tuned on comprehensive cloud computing, Infrastructure as Code, containerization, and Linux system administration documentation.

## 🎯 What This Model Knows

- **Cloud Platforms**: Azure, AWS, GCP
- **Infrastructure as Code**: Terraform, CloudFormation, ARM templates
- **Containers & Orchestration**: Docker, Kubernetes
- **Linux**: System administration, troubleshooting, networking
- **DevOps**: CI/CD, monitoring, security best practices

## 🚀 Quick Start

### Option 1: Use with Transformers (Python)
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
import torch

# Load base model
base_model = "Qwen/Qwen2.5-Coder-1.5B-Instruct"
model = AutoModelForCausalLM.from_pretrained(
    base_model,
    torch_dtype=torch.float16,
    device_map="auto",
    trust_remote_code=True
)

# Load fine-tuned LoRA adapters
model = PeftModel.from_pretrained(model, "jsdjsdequinia/cloud-expert-qwen/lora-adapters")
tokenizer = AutoTokenizer.from_pretrained(base_model, trust_remote_code=True)

# Ask a question
question = "How do I troubleshoot SSH connection issues on Linux?"
prompt = f"<|im_start|>user\n{question}<|im_end|>\n<|im_start|>assistant\n"

inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
outputs = model.generate(**inputs, max_new_tokens=200, temperature=0.7)
response = tokenizer.decode(outputs[0], skip_special_tokens=True)

print(response)
```

### Option 2: Use with Ollama (Recommended for CPU)

**Perfect for laptops without GPU!**

1. **Install Ollama**: https://ollama.ai/download

2. **Download the model**:
```bash
   huggingface-cli download jsdjsdequinia/cloud-expert-qwen cloud-expert-qwen-q8_0.gguf --local-dir ./
   huggingface-cli download jsdjsdequinia/cloud-expert-qwen Modelfile --local-dir ./
```

3. **Create Ollama model**:
```bash
   ollama create cloud-expert -f Modelfile
```

4. **Run it**:
```bash
   ollama run cloud-expert
```

5. **Or use in code**:
```python
   import ollama
   
   response = ollama.chat(model='cloud-expert', messages=[
       {'role': 'user', 'content': 'What is Azure Virtual Machine?'}
   ])
   print(response['message']['content'])
```

## 📦 Available Formats

| Format | Size | Use Case | Download |
|--------|------|----------|----------|
| **LoRA Adapters** | ~100MB | Fine-tuning, GPU inference | `lora-adapters/` |
| **Merged Model** | ~3GB | Full model, GPU inference | `merged-model/` |
| **GGUF (q8_0)** | ~1.5GB | CPU inference with Ollama | `*.gguf` |

## 📊 Training Details

- **Base Model**: Qwen/Qwen2.5-Coder-1.5B-Instruct
- **Training Method**: LoRA (Low-Rank Adaptation)
- **Training Data**: 43 examples
  - Manual Q&A pairs on cloud services
  - Scraped official documentation (Azure, Docker, Kubernetes, etc.)
  - Linux troubleshooting guides
- **Training Time**: ~20-30 minutes on RTX 3070
- **Trainable Parameters**: ~2% (LoRA efficient training)

## 💡 Example Questions
```
- What is Microsoft Azure?
- How do I deploy a Docker container?
- Explain Terraform state management
- How do I troubleshoot disk usage on Linux?
- Compare Azure VM vs AWS EC2
- What are Kubernetes best practices?
- How do I configure a Linux firewall?
```

## 🖥️ System Requirements

### For Training:
- GPU with 8GB+ VRAM
- 16GB RAM
- CUDA 12.1+

### For Inference:
**With Transformers (GPU):**
- GPU with 4GB+ VRAM
- 8GB RAM

**With Ollama (CPU - Recommended for work laptops):**
- Any modern CPU
- 4GB RAM
- No GPU needed! ✅

## ⚡ Performance

| Setup | Tokens/Second | Use Case |
|-------|---------------|----------|
| GPU (RTX 3070) | ~50 tok/s | Development, training |
| CPU (Ollama, 16GB RAM) | ~10-15 tok/s | Work laptop, portable |

## 🎓 Use Cases

✅ Learning cloud technologies  
✅ Quick reference for DevOps tasks  
✅ Understanding IaC best practices  
✅ Linux troubleshooting assistance  
✅ Comparing cloud services  
✅ Interview preparation  

## ⚠️ Limitations

- Knowledge cutoff: Training data as of 2024
- May not reflect very recent service updates
- Best for general concepts and established practices
- Always verify critical production decisions with official docs
- Not a replacement for hands-on experience

## 📜 License

Apache 2.0 - Free for commercial and personal use

## 🙏 Credits

- Base model: [Qwen/Qwen2.5-Coder-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-1.5B-Instruct)
- Fine-tuned by: jsdjsdequinia
- Documentation sources: Microsoft Azure, Docker, Kubernetes, DigitalOcean, HashiCorp

## 🐛 Feedback

Found an issue or have suggestions? Feel free to open an issue on the model page!

---

**Built with ❤️ for the DevOps community**