Character Encoding and You�

Why does your text output have all those black boxes in it? Why can't it handle Portuguese? The answer is most likely "character encoding". This talk will cover some of the common character encoding gotchas and cover some defensive programming practices to help your code handle multiple encodings.

About Rachael Tatman

I have a PhD in linguistics, with a focus on computational sociolinguistics. My research focuses on highlighting how speech and text applications can better serve the diversity of language varieties in the world.

I'm always down to talk about fairness, accountability and transparency in machine learning, NLP ethics, and linguistic variation and change. (I've also been known to do the odd project on emoji and internet linguistics.)