load_unaligned_seqs#

load_unaligned_seqs(filename: str | Path, format=None, moltype=None, label_to_name=None, parser_kw: dict | None = None, info: dict | None = None, new_type: bool = False, **kw) SequenceCollection#

loads unaligned sequences from file

Parameters:
filename

path to sequence file or glob pattern. If a glob we assume a single sequence per file. All seqs returned in one SequenceCollection.

format

sequence file format, if not specified tries to guess from the path suffix

moltype

the moltype, eg DNA, PROTEIN, ‘dna’, ‘protein’

label_to_name

function for converting original name into another name.

parser_kw

optional arguments for the parser

info

a dict from which to make an info object

new_type

if True, the returned SequenceCollection will be of the new type, (cogent3.core.new_sequence.SequenceCollection). The default will be changed to True in 2024.12. Support for the old style will be removed as of 2025.6.

**kw

other keyword arguments passed to SequenceCollection, or show_progress. The latter induces a progress bar for number of files processed when filename is a glob pattern.

Returns:
SequenceCollection