lucifertrj commited on
Commit
6d0cd10
·
verified ·
1 Parent(s): 5c5b42c

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - RedHenLabs/qa-news-2016
5
+ language:
6
+ - en
7
+ library_name: transformers
8
+ pipeline_tag: text-generation
9
+ ---
10
+
11
+
12
+ <h1 style="text-align: center;">Quantized GGUF version of News reporter 3B LLM</h1>
13
+ <p align="center">
14
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/630f3058236215d0b7078806/X-5xrU0p6EEVl-aKgnCXO.png" alt="Image" width="450" height="400">
15
+ </p>
16
+
17
+
18
+ ## Model Description
19
+
20
+ News Reporter 3B LLM is based on Phi-3 Mini-4K Instruct a dense decoder-only Transformer model designed to generate high-quality text based on user prompts. With 3.8 billion parameters, the model is fine-tuned using Supervised Fine-Tuning (SFT) to align with human preferences and question answer pairs.
21
+
22
+ ### Key Features:
23
+
24
+ - Parameter Count: 3.8 billion.
25
+ - Architecture: Dense decoder-only Transformer.
26
+ - Context Length: Supports up to 4,000 tokens.
27
+ - Training Data: 43.5K+ question and answer pairs curated from different News channel.
28
+
29
+ ## Model Benchmarking