Skip to content

Error: Qwen2.5 14B Accurate - 500 #11267

@TuncayGumusdere

Description

@TuncayGumusdere

Error Details

Model: Qwen2.5 14B Accurate
Provider: ollama
Status Code: 500

Error Output

"llama runner process has terminated: error loading model: unable to allocate CUDA0 buffer\nllama_model_load_from_file_impl: failed to load model"

Additional Context
Please add any additional context about the error here

Metadata

Metadata

Assignees

Labels

kind:bugIndicates an unexpected problem or unintended behavioros:linuxHappening specifically on Linux

Type

No type

Projects

Status

Todo

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions