The license is cc-by-nc-sa-4.0.

πŸ»β€β„οΈYou_can_cry_Snowman-13BπŸ»β€β„οΈ

img

Model Details

Model Developers Seungyoo Lee(DopeorNope)

I am in charge of Large Language Models (LLMs) at Markr AI team in South Korea.

Input Models input text only.

Output Models generate text only.

Model Architecture
You_can_cry_Snowman-13B is an auto-regressive language model based on the SOLAR architecture.


Base Model

kyujinpy/Sakura-SOLAR-Instruct

Weyaxi/SauerkrautLM-UNA-SOLAR-Instruct

Implemented Method

I have merged two models by increasing the parameter size to create a larger model.

I wanted to check how much the performance of the SOLAR base model changes when the scale of the parameters is increased.


Implementation Code

Load model


from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

repo = "DopeorNope/You_can_cry_Snowman-13B"
OpenOrca = AutoModelForCausalLM.from_pretrained(
        repo,
        return_dict=True,
        torch_dtype=torch.float16,
        device_map='auto'
)
OpenOrca_tokenizer = AutoTokenizer.from_pretrained(repo)

Downloads last month
1,304
Safetensors
Model size
13.3B params
Tensor type
F32
Β·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for DopeorNope/You_can_cry_Snowman-13B

Quantizations
4 models

Spaces using DopeorNope/You_can_cry_Snowman-13B 18