After presenting a paper on building a trilingual corpus, I'm thinking of the next steps. https://t.co/OKa3HYtSPF 7 atbildes
social.3dots.lv/@pixel (2017-08-06 22:15:56) |
After presenting a paper on building a trilingual corpus, I'm thinking of the next steps. https://t.co/OKa3HYtSPF | ||
social.3dots.lv/@pixel (2017-08-06 22:17:32) |
The paper is here https://t.co/oC2dtyMeAK | ||
social.3dots.lv/@pixel (2017-08-06 22:19:53) |
One direction would be to get Latvian and Russian word embeddings and look for the nearest neighbours for such words as enemy. | ||
social.3dots.lv/@pixel (2017-08-06 22:21:19) |
Another direction could be to try to align tweets in Latvian with Russian. The trick is that it would be nice to align controversial tweets | ||
social.3dots.lv/@pixel (2017-08-06 22:21:46) |
... so that opposite opinions are put together. | ||
social.3dots.lv/@pixel (2017-08-06 22:25:19) |
And in spirit of doing things openly, everything will be on @github. Let me know if you are interested. | ||
social.3dots.lv/@pixel (2017-08-06 22:26:44) |
@github And have I mentioned that I've been collecting some tweets from Kiev, Munich, Bolzano, Amsterdam, Singapore and Montreal. | ||
social.3dots.lv/@pixel (2017-08-06 22:28:00) |
@github Having "domain experts" who speaks the local languages and is familiar with the current affairs there would help me a lot. |