homersimpson's picture
Create README.md
df21b9e
Subsection of XLM_RoBERTa, 30k top Portuguese tokens (obtained from por-pt_web_2015_1M.tar.gz found <a href="https://wortschatz.uni-leipzig.de/en/download/Portuguese">here</a>)
All credits for methodology go to David Dale/avidale/cointegrated. Created following an adaptation of their guide, which can be found in the comments section <a href="https://gist.github.com/avidale/44cd35bfcdaf8bedf51d97c468cc8001">here</a>.