8 1 26

Mert

Sengil

https://www.linkedin.com/in/mertsengil/

AI & ML interests

LLM's

Recent Activity

updated a model 1 day ago

Sengil/ModernBERT-NewsClassifier-EN-small

published a model 2 days ago

Sengil/ModernBERT-NewsClassifier-EN-small

liked a model 5 days ago

coqui/XTTS-v2

View all activity

Organizations

None yet

Sengil's activity

updated a model 1 day ago

Sengil/ModernBERT-NewsClassifier-EN-small

Text Classification • Updated 1 day ago • 2

published a model 2 days ago

Sengil/ModernBERT-NewsClassifier-EN-small

Text Classification • Updated 1 day ago • 2

liked a model 5 days ago

coqui/XTTS-v2

Text-to-Speech • Updated Dec 11, 2023 • 2.15M • 2.25k

liked a dataset 19 days ago

defunct-datasets/amazon_us_reviews

Updated Nov 2, 2023 • 5.48k • 71

liked a model 20 days ago

answerdotai/ModernBERT-base

Fill-Mask • Updated 6 days ago • 4.7M • 693

New activity in answerdotai/ModernBERT-base 20 days ago

ValueError: The checkpoint you are trying to load has model type `modernbert`

#37 opened 20 days ago by

Sengil

liked a model 20 days ago

dbmdz/bert-base-turkish-uncased

Updated Feb 20, 2024 • 23.9k • 34

liked 3 datasets 21 days ago

liked a model 23 days ago

deepseek-ai/DeepSeek-V3

Updated 22 days ago • 173k • 2.11k

updated a model 25 days ago

Sengil/ABSA-Turkish-bert-based-small

Text Classification • Updated 25 days ago • 146 • 4

updated a model about 2 months ago

Sengil/ABSA-Turkish-bert-based-small

Text Classification • Updated 25 days ago • 146 • 4

updated a dataset about 2 months ago

Sengil/Turkish-ABSA-Wsynthetic

Viewer • Updated Dec 5, 2024 • 22.1k • 11 • 1

liked a model about 2 months ago

turkish-nlp-suite/tr_core_news_trf

Token Classification • Updated 15 days ago • 293 • 14

reacted to gabrielchua's post with 👀 about 2 months ago

Post

1313

Sharing my first paper!

==
Large Language Models (LLMs) are powerful, but they're prone to off-topic misuse, where users push them beyond their intended scope. Think harmful prompts, jailbreaks, and misuse. So how do we build better guardrails?

Traditional guardrails rely on curated examples or classifiers. The problem?
⚠️ High false-positive rates
⚠️ Poor adaptability to new misuse types
⚠️ Require real-world data, which is often unavailable during pre-production

Our method skips the need for real-world misuse examples. Instead, we:
1️⃣ Define the problem space qualitatively
2️⃣ Use an LLM to generate synthetic misuse prompts
3️⃣ Train and test guardrails on this dataset

We apply this to the off-topic prompt detection problem, and fine-tune simple bi- and cross-encoder classifiers that outperform heuristics based on cosine similarity or prompt engineering.

Additionally, framing the problem as prompt relevance allows these fine-tuned classifiers to generalise to other risk categories (e.g., jailbreak, toxic prompts).

Through this work, we also open-source our dataset (2M examples, ~50M+ tokens) and models.

paper: A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection (2411.12946)

artifacts: govtech/off-topic-guardrail-673838a62e4c661f248e81a4

reacted to maxiw's post with 👍 about 2 months ago

Post

2093

You can now try out computer use models from the hub to automate your local machine with https://github.com/askui/vision-agent. 💻

import time
from askui import VisionAgent

with VisionAgent() as agent:
    agent.tools.webbrowser.open_new("http://www.google.com")
    time.sleep(0.5)
    agent.click("search field in the center of the screen", model_name="Qwen/Qwen2-VL-7B-Instruct")
    agent.type("cats")
    agent.keyboard("enter")
    time.sleep(0.5)
    agent.click("text 'Images'", model_name="AskUI/PTA-1")
    time.sleep(0.5)
    agent.click("second cat image", model_name="OS-Copilot/OS-Atlas-Base-7B")

Currently these models are integrated with Gradio Spaces API. Also planning to add local inference soon!

Currently supported:
- Qwen/Qwen2-VL-7B-Instruct
- Qwen/Qwen2-VL-2B-Instruct
- AskUI/PTA-1
- OS-Copilot/OS-Atlas-Base-7B

3 replies

liked a Space about 2 months ago

Running on CPU Upgrade

108

🔥

HuggingFace Trending Board

reacted to csabakecskemeti's post with 👍 2 months ago

Post

1244

Some time ago, I built a predictive LLM router that routes chat requests between small and large LLM models based on prompt classification. It dynamically selects the most suitable model depending on the complexity of the user input, ensuring optimal performance while maintaining conversation context. I also fine-tuned a RoBERTa model to use with the package, but you can plug and play any classifier of your choice.

Project's homepage:
https://devquasar.com/llm-predictive-router/
Pypi:
https://pypi.org/project/llm-predictive-router/
Model:
DevQuasar/roberta-prompt_classifier-v0.1
Training data:
DevQuasar/llm_router_dataset-synth
Git:
https://github.com/csabakecskemeti/llm_predictive_router_package

Feel free to check it out, and/or contribute.

liked a model 2 months ago

yeniguno/absa-turkish-bert-dbmdz

Text Classification • Updated Sep 22, 2024 • 31 • 4