hh-suite icon indicating copy to clipboard operation
hh-suite copied to clipboard

hhsuitedb.py UnicodeDecodeERROR

Open eli1199 opened this issue 5 years ago • 2 comments

Expected Behavior

Makes database with user supplied .a3m file(s)

Current Behavior

Erorr of: UnicodeDecodeError: 'utf-8' codec can't decode byte 0xff in position 36: invalid start byte

Steps to Reproduce (for bugs)

python hhsuitedb.py --ia3m=og4320.a3m -o my_db --cpu=8 --force

HH-suite Output (for bugs)

Unlinking entries from '/tmp/tmprv97qam3/files.dat' Unlinking entries from '/tmp/tmprv97qam3/files.dat' Traceback (most recent call last): File "hhsuitedb.py", line 482, in main() File "hhsuitedb.py", line 478, in main check_database(options.output_basename, options.nr_cores, options.force_mode) File "hhsuitedb.py", line 376, in check_database calculate_hhm(threads, output_basename+"_a3m", output_basename+"_hhm") File "hhsuitedb.py", line 100, in calculate_hhm large_a3ms = get_large_a3ms(a3m_base_path) File "hhsuitedb.py", line 76, in get_large_a3ms entries = ffindex.read_index(a3m_base_path+".ffindex") File "ffindex.py", line 20, in read_index for line in fh: File "/cvmfs/soft.computecanada.ca/easybuild/software/2017/Core/python/3.7.0/lib/python3.7/codecs.py", line 322, in decode (result, consumed) = self._buffer_decode(data, self.errors, final) UnicodeDecodeError: 'utf-8' codec can't decode byte 0xff in position 36: invalid start byte

Your Environment

Include as many relevant details about the environment you experienced the issue in.

  • Version/Git commit used: 3.2.0
  • Server specifications (especially CPU support for AVX2/SSE and amount of system memory): can request as much as needed (compute cluster)
  • Operating system and version: Linux

eli1199 avatar Jun 20 '19 00:06 eli1199

Hello ali1199,

I am having the same issue, did you find a way to solve this?

Thanks !

ApollineBruley avatar Feb 26 '20 14:02 ApollineBruley

Hello ali1199,

I am having the same issue, did you find a way to solve this?

Thanks !

If anyone else is confronted to this issue : for me the problem came from the .a3m and .hhm file names. I shortened them and removed the '_' and '.' (I'm not sure what caused the problem exactly), and it works perfectly now!

ApollineBruley avatar Feb 28 '20 08:02 ApollineBruley