load_unaligned_seqs#

load_unaligned_seqs(filename: str | Path, format_name: str | None = None, moltype: Literal['dna', 'rna', 'protein', 'protein_with_stop', 'text', 'bytes'] | None = None, label_to_name: Callable[[str], str] | None = None, parser_kw: dict | None = None, info: dict | None = None, **kw) SequenceCollection#

loads unaligned sequences from file

Parameters:
filename

path to sequence file or glob pattern. If a glob we assume a single sequence per file. All seqs returned in one SequenceCollection.

format_name

sequence file format, if not specified tries to guess from the path suffix

moltype

the moltype, eg DNA, PROTEIN, ‘dna’, ‘protein’

label_to_name

function for converting original name into another name.

parser_kw

optional arguments for the parser

info

a dict from which to make an info object

**kw

other keyword arguments passed to SequenceCollection, or show_progress. The latter induces a progress bar for number of files processed when filename is a glob pattern.

Returns:
SequenceCollection