metadata

base_model: mini1013/master_domain
library_name: setfit
metrics:
  - accuracy
pipeline_tag: text-classification
tags:
  - setfit
  - sentence-transformers
  - text-classification
  - generated_from_setfit_trainer
widget:
  - text: 질레트 퓨전 하이드라젤 센서티브스킨 195mgX3입 (질레트퓨전하이드라젤)-190mlx3통★특가★ 애니몰
  - text: 꽃을든남자 플러스유 포맨 스킨 250ml 1개 플러스유 포맨 로션 아주상사
  - text: 오니츠카타이거 MEXICO 66 SD 크림 남성 운동화 스니커즈 1183A838-100 27.5cm_- 77언니
  - text: 키엘 훼이셜 퓨얼 에너자이징 모이스처 트리트먼트 포 맨 125ml 옵션없음 주식회사 샌팅
  - text: 비오템옴므 포스 수프림 클렌저 125ml 비오템 포스 수프림 클렌저 125ml 메이비굿컴퍼니
inference: true
model-index:
  - name: SetFit with mini1013/master_domain
    results:
      - task:
          type: text-classification
          name: Text Classification
        dataset:
          name: Unknown
          type: unknown
          split: test
        metrics:
          - type: accuracy
            value: 0.561352657004831
            name: Accuracy

SetFit with mini1013/master_domain

This is a SetFit model that can be used for Text Classification. This SetFit model uses mini1013/master_domain as the Sentence Transformer embedding model. A LogisticRegression instance is used for classification.

The model has been trained using an efficient few-shot learning technique that involves:

Fine-tuning a Sentence Transformer with contrastive learning.
Training a classification head with features from the fine-tuned Sentence Transformer.

Model Details

Model Description

Model Type: SetFit
Sentence Transformer body: mini1013/master_domain
Classification head: a LogisticRegression instance
Maximum Sequence Length: 512 tokens
Number of Classes: 13 classes

Model Sources

Repository: SetFit on GitHub
Paper: Efficient Few-Shot Learning Without Prompts
Blogpost: SetFit: Efficient Few-Shot Learning Without Prompts

Model Labels

Label	Examples
13.0	'[토르셀렉샵] 랩시리즈 올인원 멀티 액션 훼이스 워시 100ml 100ml 토르컴퍼니' '랩시리즈 올인원 멀티 액션 훼이스 워시 100ml 1021844 올인원 멀티 액션 훼이스 워시 100ml 배스테인' '남자 폼 클렌징 미프 비타에너지 150ml x3개 세트 남성 세안제 비타민 트러블 보습 옵션없음 (주)라이프스타일프로젝트'
12.0	'라피크젤 포맨 남성 크림 50ml 1개 라파크젤' '남성 유니콘 롱타임 젤 60g 강한남자 옵션없음 에이플러스' '프리메라 맨 인더핑크 아쿠아 쉴드 파워 모이스처라이징 젤 50ml 옵션없음 달달하우스'
6.0	'니베아 맨 센서티브 쉐이빙 젤 200ml 옵션없음 네고장터' '니베아 맨 센서티브 쉐이빙 젤 면도젤 200ml 옵션없음 (주)에스티아이그룹' '질레트 포오미 레몬 라임 쉐이빙 크림 175g 옵션없음 라봉'
0.0	'더고래 롱타임스프레이 남자사타구니 쿨링스프레이 30ml 옵션없음 주식회사 버루디(Burudy Corp.)' '남성 청결제 샤워 위생 냄세제거기 청결 바디 목욕용품 200ML 옵션없음 나노나비' '남자 사타구니 습진 가려움 가려움증 세정제 청결제 옵션없음 강스맘'
5.0	'BIOTHERM 비오템 UV 디펜스 선스틱 20g 옵션없음 씨플랩' '[현대백화점][빌리프]빌리프 맨올로지 울트라 레스큐 에브리데이 선스크린 60ml 옵션없음 (주)현대백화점' '남자 선크림 자연스러운 화이트닝 남성 자외선 블록 4계절용 브리올 옴므'
2.0	'오디세이 로맨틱 에멀전 130ml (비치용)오디세이 로맨틱 스킨 130ml 리앤햇' '오딧세이 로맨틱 에멀젼130ml 옵션없음 야긴&보아스 무역' '헤라 옴므 파워부스팅 모이스춰라이저 110ml/N 에멀젼110ml+20ml(케이스 X) 케이상사'
3.0	'네이처바이 차콜 미네랄 아쿠아 콜라겐 마스크팩 포맨 50매 /남성전용팩 코코팜' '릴리프 맨테라피 마스크 18ml x4 엠도씨코리아' '마스크팩 매프디 남성 마스크 브라이트닝 오일 컨트롤 화이트닝 여드름 자국 수축 모공 리프팅 전용 바버스 Barbers'
8.0	'아라미스 애프터 쉐이브 스킨 200ml, 1개 옵션없음 지크(JIC)' '랩시리즈 레스큐 워터로션 200ml 토너 스킨 옵션없음 케이뷰티' '꽃을든남자 옴므 스킨 140ml 옵션없음 씀(SSM) 리테일'
10.0	'홀츠포맨 남자 에센스 100ml 종합 피부 영양제 멀티 비타민 주름개선 남성 화장품 쇼핑베토벤' '설화수 본윤 에센스 140ml 유통기한 27년 설화수' 'SatinNaturel 사틴내추럴 비어드 오일 100ml x 3개 옵션없음 명원박'
9.0	'오딧세이옴므 챕터파이브 안티파티그 아이 세럼 라이트하우스 15ml 오딧세이' '남자 아이크림 대용량 다크서클 화이트닝 안티에이징 남성 화장품 브리올 옴므' '토니모리 더 블랙 옴므 멀티 아이 크림 30ml 토니모리'
11.0	'아이디얼포맨 퍼펙트 올인원 밀크 기획 (올인원 150ml + 30ml) 니플밴드 스킨톤(64매입_32회분) 씨제이올리브영 주식회사' '아이디얼포맨 퍼펙트 올인원 밀크 150ml 30ml 1021872 옵션없음 배스테인' '라운드랩 포 맨 1025 독도 올인원 플루이드 200ml+200ml (1+1) 옵션없음 강렬유통'
1.0	'[갤러리아] [PRMR]맨 오가니언스 2종 기획세트 옵션없음 한화갤러리아(주)' '네오니스 바이옴 2종 세트 옵션없음 (주)랜디오션' '엘지 더후 공진향 군 자양 2종 스페셜 세트 옵션없음 베로베(주)'
4.0	'미프 미남크림 남자 썬 비비 커버 크림 50ml 미백 BB 잡티 커버 자외선 차단 미프' '다슈 맨즈 굿 룩스 아이브로우 펜슬 0.2g／2color 178904 내추럴블랙 빅토리커머스' '아이디얼포맨 베러톤 커버쿠션 비비쿠션 15g SPF50 1021891 22호 라이트 베이지 메가랜드'

Evaluation

Metrics

Label	Accuracy
all	0.5614

Uses

Direct Use for Inference

First install the SetFit library:

pip install setfit

Then you can load this model and run inference.

from setfit import SetFitModel

# Download from the 🤗 Hub
model = SetFitModel.from_pretrained("mini1013/master_cate_bt0_test")
# Run inference
preds = model("꽃을든남자 플러스유 포맨 스킨 250ml 1개 플러스유 포맨 로션 아주상사")

Training Details

Training Set Metrics

Training set	Min	Median	Max
Word count	4	8.8913	19

Label	Training Sample Count
0.0	12
1.0	25
2.0	20
3.0	19
4.0	17
5.0	18
6.0	22
8.0	19
9.0	10
10.0	11
11.0	22
12.0	18
13.0	17

Training Hyperparameters

batch_size: (512, 512)
num_epochs: (50, 50)
max_steps: -1
sampling_strategy: oversampling
num_iterations: 60
body_learning_rate: (2e-05, 1e-05)
head_learning_rate: 0.01
loss: CosineSimilarityLoss
distance_metric: cosine_distance
margin: 0.25
end_to_end: False
use_amp: False
warmup_proportion: 0.1
l2_weight: 0.01
seed: 42
eval_max_steps: -1
load_best_model_at_end: False

Training Results

Epoch	Step	Training Loss	Validation Loss
0.0370	1	0.4919	-
1.8519	50	0.3561	-
3.7037	100	0.0781	-
5.5556	150	0.0282	-
7.4074	200	0.0154	-
9.2593	250	0.0063	-
11.1111	300	0.0005	-
12.9630	350	0.0002	-
14.8148	400	0.0002	-
16.6667	450	0.0001	-
18.5185	500	0.0001	-
20.3704	550	0.0001	-
22.2222	600	0.0001	-
24.0741	650	0.0001	-
25.9259	700	0.0001	-
27.7778	750	0.0001	-
29.6296	800	0.0001	-
31.4815	850	0.0001	-
33.3333	900	0.0001	-
35.1852	950	0.0001	-
37.0370	1000	0.0001	-
38.8889	1050	0.0001	-
40.7407	1100	0.0001	-
42.5926	1150	0.0001	-
44.4444	1200	0.0001	-
46.2963	1250	0.0001	-
48.1481	1300	0.0001	-
50.0	1350	0.0001	-

Framework Versions

Python: 3.10.12
SetFit: 1.1.0
Sentence Transformers: 3.3.1
Transformers: 4.44.2
PyTorch: 2.2.0a0+81ea7a4
Datasets: 3.2.0
Tokenizers: 0.19.1

Citation

BibTeX

@article{https://doi.org/10.48550/arxiv.2209.11055,
    doi = {10.48550/ARXIV.2209.11055},
    url = {https://arxiv.org/abs/2209.11055},
    author = {Tunstall, Lewis and Reimers, Nils and Jo, Unso Eun Seo and Bates, Luke and Korat, Daniel and Wasserblat, Moshe and Pereg, Oren},
    keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
    title = {Efficient Few-Shot Learning Without Prompts},
    publisher = {arXiv},
    year = {2022},
    copyright = {Creative Commons Attribution 4.0 International}
}