[DIYbio] File format detection and parser validation

Does anyone know of a tool for detecting file formats for common biology files, e.g. FASTA, FASTQ, GenBank, SBOL, AB1, etc.

The *nix file command / libmagic does a terrible job of this.

I'm also looking for a library of samples that showcase the diversity of formats and *ahem* variants of those formats for the purpose of ensuring that parsers don't fail on edge cases.

--
marc/juul

--
-- You received this message because you are subscribed to the Google Groups DIYbio group. To post to this group, send email to diybio@googlegroups.com. To unsubscribe from this group, send email to diybio+unsubscribe@googlegroups.com. For more options, visit this group at https://groups.google.com/d/forum/diybio?hl=en
Learn more at www.diybio.org
---
You received this message because you are subscribed to the Google Groups "DIYbio" group.
To unsubscribe from this group and stop receiving emails from it, send an email to diybio+unsubscribe@googlegroups.com.
To post to this group, send email to diybio@googlegroups.com.
Visit this group at https://groups.google.com/group/diybio.
To view this discussion on the web visit https://groups.google.com/d/msgid/diybio/CAL4ejvS-u6dJNyCJ2Ac4MCiFb_4zh9VQqF41MjC_cHE83W71rQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

  • Digg
  • Del.icio.us
  • StumbleUpon
  • Reddit
  • RSS

0 comments:

Post a Comment