Spaces:

stackofsugar
/

depression-causal-analysis

Sleeping

App Files Files Community

Christopher Digno commited on Oct 31, 2024

Commit

15e365d

1 Parent(s): c818632

Added application files

Browse files

Files changed (10) hide show

.gitignore +3 -0
app.py +161 -0
model/config.json +61 -0
model/model.safetensors +3 -0
requirements.txt +81 -0
tokenizer/merges.txt +0 -0
tokenizer/special_tokens_map.json +1 -0
tokenizer/tokenizer.json +0 -0
tokenizer/tokenizer_config.json +1 -0
tokenizer/vocab.json +0 -0

.gitignore ADDED Viewed

	@@ -0,0 +1,3 @@

+*.ipynb
+coba.py
+.venv/

app.py ADDED Viewed

	@@ -0,0 +1,161 @@

+import streamlit as st
+import os.path
+import re
+import pandas as pd
+from transformers import pipeline, Pipeline
+from time import sleep
+ID2LABEL = {
+    'LABEL_0': {
+        "Cause": "No Reason",
+        "description": "There is no reason that identifies the cause of mental disorder, or the text does not reflect a mental disorder",
+    },
+    'LABEL_1': {
+        "Cause": "Bias / Abuse",
+        "description": """
+            A strong inclination of the mind or a preconceived opinion about something or someone. To avoid someone intentionally,
+            or to prevent someone from taking part in the social activities of a group because they dislike the person or disapprove
+            of their activities. It includes body shaming, physical, sexual, or emotional abuse.
+        """,
+    },
+    'LABEL_2': {
+        "Cause": "Jobs and Career",
+        "description": """
+            Financial loss can have catastrophic effects on mental illness, relationships and even physical health. Poor, meaningless
+            and unmanageable education, unemployment, un-affordable home loans, poor financial advice, and losing a job are some of
+            the major concerns. It includes gossiping and/or social cliques, aggressive bullying behavior, poor communication
+            and unclear expectations, dictatorial management techniques that don’t embrace employee feedback. The educational problems
+            like picking up courses under some external pressure and poor grades are also part of this category.
+        """,
+    },
+    'LABEL_3': {
+        "Cause": "Medication",
+        "description": """
+            The general drugs and other antiviral drugs can increase the risk of depression. The habit of using substances and alcohols
+            can aggravate the problem of mental disorders. Moreover, medical problems like tumors, cancer, and other prolonged diseases
+            can boost the presence of mental depression.
+        """,
+    },
+    'LABEL_4': {
+        "Cause": "Relationships",
+        "description": """
+            When two people or a group of people fight, it may lead to a relationship or friendship drifting apart, for example, regular
+            fights, breakups, divorce, mistrust, jealousy, betrayal, difference in opinion, inconsistency, conflicts, bad company,
+            noncommitment, priority, envy. Problems like bad parenting and childhood trauma are also part of this category.
+        """,
+    },
+    'LABEL_5': {
+        "Cause": "Alienation",
+        "description": """
+            Alienation is the feeling of life being worthless even after doing everything. There may be indicators of meaninglessness,
+            loneliness, tired of daily routines, powerlessness, normlessness, isolation, and cultural estrangement.
+        """,
+    },
+}
+EXAMPLES = [
+    """
+        Same dad, different day. I can't believe that my dad have no sense of humanity. He hit me in the head this morning.
+        I felt nauseous all day, including when I take my exam just before this. If I kill myself, I will make sure that my dad
+        got the most blame.
+    """,
+    """
+        My boss laid me off today. He said that my company was downsizing. I don't believe his cr*p though, I think he just hates
+        me as the only women in my department. Now I don't know how to feed my four kids. I am officially, totally, completely, out
+        of money and will to live.
+    """,
+    """
+        Last month, my doctor prescribed me some alprazolam to calm my nerves down. I churn through a bottle of the pill in a month.
+        My doctor today told me that I'm quite healthy to go without some calming drugs, but I just can't stop consuming them. I
+        just bought like 4 bottles of it through some shady middle-man and I'll go crazy if I went for half a day without
+        swallowing one.
+    """,
+    """
+        My girlfriend dumped me because of some stupid nerdy dude at her office, my mother disowns me for not enlisting to the
+        millitary like my brother did. It all just keeps on pinning me to the ground. When I asked my friends to go out, they all
+        refused because my ex-girlfriend was spreading lies of how I cheated on her yada yada. I'm totally f*cked.
+    """,
+    """
+        Everything is worthless, everything is meaningless. All the things that I do literally contribute nothing to the
+        society. I just want to go somewhere I can just lie down, sleep, eat, with no negative consequences for me.
+    """,
+    """
+        Yesterday, I bought an ice cream for myself at the city. It was really good. I will definitely tell everyone
+        I know about this ice cream place.
+    """,
+]
+@st.cache_resource(show_spinner=False)
+def load_model() -> Pipeline:
+    model = "AIMH/mental-longformer-base-4096"
+    tokenizer = "AIMH/mental-longformer-base-4096"
+    if os.path.isfile("model/model.safetensors"):
+        model = "model"
+    if os.path.isfile("tokenizer/tokenizer.json"):
+        tokenizer = "tokenizer"
+    return pipeline("text-classification", model=model, tokenizer=tokenizer)
+@st.cache_data(show_spinner=False)
+def predict(text: str, _pipe: Pipeline):
+    res = _pipe(text, return_all_scores=True)[0]
+    res = sorted(res, key=lambda d: d['score'], reverse=True)
+    res = [
+        dict(ID2LABEL[x["label"]], **{"Confidence": round(x["score"] * 100)})
+    for x in res]
+    return pd.DataFrame(res)
+    # return res
+if __name__ == "__main__":
+    st.markdown('<h1 align="center">✨ Depression Causal Analysis ✨</h1>', unsafe_allow_html=True, )
+    st.error("""
+        DISCLAIMER: This project was only intended for research showcase purposes only.
+        If you believe that you have mental health issues, please consult your physician.\n
+        This project also contains triggering example words from social media that might not
+        fit everybody. Continue with caution. Please love yourself and don't hesitate to reach
+        out for professional help. ❤
+    """)
+    left, center, right = st.columns(3)
+    left.link_button("Go to project on GitHub", "https://github.com/stackofsugar/", use_container_width=True)
+    center.link_button("Read the dataset's paper", "https://arxiv.org/abs/2207.04674v1", use_container_width=True)
+    right.link_button("Read the model's paper", "https://arxiv.org/abs/2304.10447v1", use_container_width=True)
+    example = st.selectbox("Load an example", EXAMPLES, index=None)
+    with st.form("main_prediction"):
+        text = st.text_area(
+            "Text to analyze (tip: this model is better for long texts)",
+            value=(re.sub(' +', ' ', example).strip().replace("\n", "") if example else ""),
+            height=200
+        )
+        left, middle, right = st.columns(3)
+        submitted = middle.form_submit_button("Predict!", use_container_width=True)
+        if submitted and text.strip():
+            pipe = None
+            with st.spinner("Loading model..."):
+                pipe = load_model()
+            with st.spinner("Predicting..."):
+                preds = predict(text.strip(), pipe)
+            st.markdown(f"**Result: {preds.at[0, "Cause"]}**")
+            col1, col2 = st.columns([1,2])
+            # col1, col2 = st.columns(2)
+            col1.dataframe(preds[["Cause", "Confidence"]],
+                        column_config={
+                            "Confidence": st.column_config.NumberColumn(
+                                format="%d%%"
+                            )
+                        },
+                        hide_index=True,
+                        use_container_width=True)
+            col2.markdown(f"Explanation of **{preds.at[0, "Cause"]}**:")
+            col2.write(preds.at[0, "description"])
+    st.write(
+        """Developed with :heart: by [stackofsugar](https://github.com/stackofsugar/). For more information on the
+        project, please visit the project's GitHub page with the button above."""
+        )

model/config.json ADDED Viewed

	@@ -0,0 +1,61 @@

+{
+  "_name_or_path": "allenai/longformer-base-4096",
+  "architectures": [
+    "LongformerForSequenceClassification"
+  ],
+  "attention_mode": "longformer",
+  "attention_probs_dropout_prob": 0.1,
+  "attention_window": [
+    512,
+    512,
+    512,
+    512,
+    512,
+    512,
+    512,
+    512,
+    512,
+    512,
+    512,
+    512
+  ],
+  "bos_token_id": 0,
+  "eos_token_id": 2,
+  "gradient_checkpointing": false,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "LABEL_0",
+    "1": "LABEL_1",
+    "2": "LABEL_2",
+    "3": "LABEL_3",
+    "4": "LABEL_4",
+    "5": "LABEL_5"
+  },
+  "ignore_attention_mask": false,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "LABEL_0": 0,
+    "LABEL_1": 1,
+    "LABEL_2": 2,
+    "LABEL_3": 3,
+    "LABEL_4": 4,
+    "LABEL_5": 5
+  },
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 4098,
+  "model_type": "longformer",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "onnx_export": false,
+  "pad_token_id": 1,
+  "position_embedding_type": "absolute",
+  "sep_token_id": 2,
+  "torch_dtype": "float32",
+  "transformers_version": "4.44.2",
+  "type_vocab_size": 1,
+  "use_cache": true,
+  "vocab_size": 50265
+}

model/model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3da077d487c964259ec9f47b5b58f39b9db45f2964e3763e3fc91695ddb0728b
+size 594690488

requirements.txt ADDED Viewed

	@@ -0,0 +1,81 @@

+accelerate==1.0.1
+altair==5.4.1
+asttokens==2.4.1
+attrs==24.2.0
+blinker==1.8.2
+cachetools==5.5.0
+certifi==2024.8.30
+charset-normalizer==3.4.0
+click==8.1.7
+colorama==0.4.6
+comm==0.2.2
+debugpy==1.8.7
+decorator==5.1.1
+executing==2.1.0
+filelock==3.16.1
+fsspec==2024.10.0
+gitdb==4.0.11
+GitPython==3.1.43
+huggingface-hub==0.26.2
+idna==3.10
+inquirerpy==0.3.4
+ipykernel==6.29.5
+ipython==8.29.0
+jedi==0.19.1
+Jinja2==3.1.4
+jsonschema==4.23.0
+jsonschema-specifications==2024.10.1
+jupyter_client==8.6.3
+jupyter_core==5.7.2
+markdown-it-py==3.0.0
+MarkupSafe==3.0.2
+matplotlib-inline==0.1.7
+mdurl==0.1.2
+mpmath==1.3.0
+narwhals==1.12.1
+nest-asyncio==1.6.0
+networkx==3.4.2
+numpy==2.1.2
+packaging==24.1
+pandas==2.2.3
+parso==0.8.4
+pfzy==0.3.4
+pillow==10.4.0
+platformdirs==4.3.6
+prompt_toolkit==3.0.48
+protobuf==5.28.3
+psutil==6.1.0
+pure_eval==0.2.3
+pyarrow==18.0.0
+pydeck==0.9.1
+Pygments==2.18.0
+python-dateutil==2.9.0.post0
+pytz==2024.2
+pywin32==308
+PyYAML==6.0.2
+pyzmq==26.2.0
+referencing==0.35.1
+regex==2024.9.11
+requests==2.32.3
+rich==13.9.3
+rpds-py==0.20.0
+safetensors==0.4.5
+setuptools==75.3.0
+six==1.16.0
+smmap==5.0.1
+stack-data==0.6.3
+streamlit==1.39.0
+sympy==1.13.1
+tenacity==9.0.0
+tokenizers==0.20.1
+toml==0.10.2
+torch==2.5.1
+tornado==6.4.1
+tqdm==4.66.6
+traitlets==5.14.3
+transformers==4.46.1
+typing_extensions==4.12.2
+tzdata==2024.2
+urllib3==2.2.3
+watchdog==5.0.3
+wcwidth==0.2.13

tokenizer/merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"bos_token": "<s>", "eos_token": "</s>", "unk_token": "<unk>", "sep_token": "</s>", "pad_token": "<pad>", "cls_token": "<s>", "mask_token": {"content": "<mask>", "single_word": false, "lstrip": true, "rstrip": false, "normalized": false}}

tokenizer/tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"unk_token": "<unk>", "bos_token": "<s>", "eos_token": "</s>", "add_prefix_space": false, "errors": "replace", "sep_token": "</s>", "cls_token": "<s>", "pad_token": "<pad>", "mask_token": "<mask>", "model_max_length": 4096, "special_tokens_map_file": null, "name_or_path": "allenai/longformer-base-4096", "tokenizer_class": "LongformerTokenizer"}

tokenizer/vocab.json ADDED Viewed

The diff for this file is too large to render. See raw diff