Add a bit of a hack to better detect UTF-8 in the wild, versus ISO88591