yujiepan commited on
Commit
7e34f4c
1 Parent(s): 250e770

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -0
README.md ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: text-generation
3
+ inference: true
4
+ widget:
5
+ - text: 'Hello!'
6
+ example_title: Hello world
7
+ group: Python
8
+ library_name: transformers
9
+ ---
10
+
11
+ This model is randomly initialized, using the config from [Qwen/Qwen-VL-Chat](https://huggingface.co/Qwen/Qwen-VL-Chat/blob/main/config.json) but with smaller size.
12
+ Note the model is in float16.
13
+
14
+ Notable modifications:
15
+
16
+ ```python
17
+ config.fp16 = True
18
+ config.hidden_size = 8
19
+ config.intermediate_size = 16
20
+ config.kv_channels = 4
21
+ config.num_attention_heads = 2
22
+ config.num_hidden_layers = 2
23
+ config.seq_length = 2048
24
+
25
+ config.visual = {
26
+ "heads": 2,
27
+ "image_size": 448,
28
+ "image_start_id": 151857,
29
+ "layers": 2,
30
+ "mlp_ratio": 1.0,
31
+ "output_dim": 8,
32
+ "patch_size": 14,
33
+ "width": 8,
34
+ }
35
+ ```
36
+
37
+ Also, we changed the visual model attention head dim. See `upload_model.py` for details.