Rombo-Org
/

Rombo-LLM-V2.5-Qwen-0.5b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rombodawg commited on 8 days ago

Commit

1001345

·

verified ·

1 Parent(s): 83708db

Update README.md

Files changed (1) hide show

README.md +17 -42

README.md CHANGED Viewed

@@ -1,42 +1,17 @@
----
-base_model: []
-library_name: transformers
-tags:
-- mergekit
-- merge
----
-# Rombo-LLM-V2.5-qwen-2.5-0.5b
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using E:\Open_source_ai_chatbot\OOBA-13\text-generation-webui-main\models\unsloth_Qwen2.5-0.5B as a base.
-### Models Merged
-The following models were included in the merge:
-* E:\Open_source_ai_chatbot\OOBA-13\text-generation-webui-main\models\unsloth_Qwen2.5-0.5B-Instruct
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-models:
-  - model: E:\Open_source_ai_chatbot\OOBA-13\text-generation-webui-main\models\unsloth_Qwen2.5-0.5B-Instruct
-    parameters:
-      weight: 1
-      density: 1
-merge_method: ties
-base_model: E:\Open_source_ai_chatbot\OOBA-13\text-generation-webui-main\models\unsloth_Qwen2.5-0.5B
-parameters:
-  weight: 1
-  density: 1
-  normalize: true
-  int8_mask: false
-dtype: bfloat16
-```

+---
+library_name: transformers
+base_model:
+- Qwen/Qwen2.5-0.5B-Instruct
+license: apache-2.0
+---
+# Rombos-LLM-V2.5-Qwen-0.5b
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/oL_yvvRsWj2C4niGgkT2A.jpeg)
+Rombos-LLM-V2.5-Qwen-0.5b is a continues finetuned version of Qwen2.5-0.5B. I noticed recently that the Qwen team did not learn from my methods of continuous finetuning, the great benefits, and no downsides of it. So I took it upon myself to merge the instruct model with the base model myself using the *Ties* merge method
+This version of the model shows higher performance than the original instruct and base models.
+Quants:
+GGUF: https://huggingface.co/bartowski/Replete-LLM-V2.5-Qwen-0.5b-GGUF