load_unaligned_seqs#
- load_unaligned_seqs(filename: str | Path, format_name: str | None = None, moltype: Literal['dna', 'rna', 'protein', 'protein_with_stop', 'text', 'bytes'] | None = None, label_to_name: Callable[[str], str] | None = None, parser_kw: dict | None = None, info: dict | None = None, **kw) SequenceCollection #
loads unaligned sequences from file
- Parameters:
- filename
path to sequence file or glob pattern. If a glob we assume a single sequence per file. All seqs returned in one SequenceCollection.
- format_name
sequence file format, if not specified tries to guess from the path suffix
- moltype
the moltype, eg DNA, PROTEIN, ‘dna’, ‘protein’
- label_to_name
function for converting original name into another name.
- parser_kw
optional arguments for the parser
- info
a dict from which to make an info object
- **kw
other keyword arguments passed to SequenceCollection, or show_progress. The latter induces a progress bar for number of files processed when filename is a glob pattern.
- Returns:
SequenceCollection