Subsection of XLM_RoBERTa, 30k top Portuguese tokens (obtained from por-pt_web_2015_1M.tar.gz found <a href="https://wortschatz.uni-leipzig.de/en/download/Portuguese">here</a>) | |
All credits for methodology go to David Dale/avidale/cointegrated. Created following an adaptation of their guide, which can be found in the comments section <a href="https://gist.github.com/avidale/44cd35bfcdaf8bedf51d97c468cc8001">here</a>. |