README.md · oshizo/japanese-sexual-moderation at f8ce541f1421a750ad428ebef32b00ec54d501dd

metadata

license: mit

japanese-sexual-moderation is a fine-tuned version of studio-ousia/luke-japanese-large-lite.
It scores whether a short sentence is sexual or not.
The version as of 20230/9/17 was trained on a limited number of data, and scoring trends may have biases due to the data set.
This model was created to calculate ERP scores for japanese-llm-roleplay-benchmark.

japanese-sexual-moderationは、studio-ousia/luke-japanese-large-liteをファインチューニングしたモデルです。
短文が性的かどうかをスコアリングします。
20230/9/17時点のバージョンは限られたデータ数で訓練されており、スコアリングの傾向にはデータセットに起因するバイアスがある可能性があります。
このモデルはjapanese-llm-roleplay-benchmarkでのERPスコアを算出するために作成されました。

Usage

from transformers import AutoModelForSequenceClassification, AutoTokenizer
import numpy as np

model_id = "oshizo/japanese-sexual-moderation"

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForSequenceClassification.from_pretrained(
    model_id,
    problem_type="multi_label_classification",
    num_labels=1
)

text = "富士山は日本で一番高い山です。"
with torch.no_grad():
    encoding = tokenizer(text, return_tensors="pt")
    score = model(**encoding).logits

# tensor([[-2.7863]])