transformers accelerate torch Pillow requests torchvision torchaudio gradio==5.49.1 gradio_client spaces opencv-python-headless datasets qwen-vl-utils pre-commit matplotlib #flash-attn