On branch centos, I had to add these parameters in my.cnf in order to have all the characters correctly encoded in the database:
collation-server = utf8_general_ci character-set-server = utf8