metadata
license: cc-by-4.0
tags:
- requests
- gguf
- quantized
Welcome to my GGUF-IQ-Imatrix Model Quantization Requests card!
Read bellow for more information.
Requirements to request model quantizations:
For the model:
- Maximum model parameter size of 11B.
At the moment I am unable to accept requests for larger models due to hardware/time limitations.
Important:
- Fill the request template as outlined in the next section.
How to request a model quantization:
Open a New Discussion with a title of "
Request: Model-Author/Model-Name
", for example, "Request: Nitral-AI/Infinitely-Laydiculous-7B
".Include the following template in your message and fill the information (example request here):
**Model name:**
**Model link:**
**Brief description:**
**Additonal quants (if you want any):**
Default list quants for reference:
"Q4_K_M", "Q4_K_S", "IQ4_XS", "Q5_K_M", "Q5_K_S",
"Q6_K", "Q8_0", "IQ3_M", "IQ3_S", "IQ3_XXS"
]´]]
**An image to represent the model (square shaped):**