Bases: builtins.object
Represents an alignment matrix, used to determine the alignment score between two Sequences
Creates an AlignMatrix object that uses scoreMatrix as a scoring system between amino acids.
Returns all best global alignment between ‘seqA’ and ‘seqB’ with the provided initial and extended gap penalties. If ‘semiGlobal’ is True, allows alignment to discard the start and end of either Sequence. (this allows for better alignments when Sequences overlap only partially)
Returns n best local alignment between ‘seqA’ and ‘seqB’ with the provided initial and extended gap penalties, where n equals ‘resultCount’ (-1 for all). If ‘subOptimalDepth’ equals m, looks for m best suboptimal alignments as well.
Returns n best local M.S.A. of ‘Sequence’ against the ScoreMatrix (which must be position-specific and provide gap penalties) where n equals ‘resultCount’ (-1 for all). If ‘subOptimalDepth’ equals m, looks for m best suboptimal alignments as well.
Bases: builtins.object
Represents two aligned Sequences with some metadata about the alignment. Alignments can be two-sequences alignments or multiple sequence alignments.
Returns True if there’s at least ‘minMatches’ matches between outSequence and any sequence from ‘group’; False otherwise. Size of all sequences is assumed to be equal.
Creates and returns a ScoreMatrix for all sequences in the provided ‘filepaths’, using the BLOSUM approach with an identity of at least ‘requiredIdentityPercent’. Each file is grouped independently and only then their weighted probabilities are merged. :param requiredIdentityPercent: :param filepaths:
Fills and returns a Score according to the BLOSUM algorithm, from the probabilities of AA pairs and singletons provided in ‘probPairs’ and ‘probSingle’.
Evaluates the frequencies of AminoAcids within columns of groups in ‘groupValues’. Frequencies are weighted according to group sizes in ‘groupSizes’. Returns two dictionaries and a number: -‘freqPairs’ maps pairs of AminoAcids to their frequencies -‘freqSingle’ maps single AminoAcids to their frequencies -‘freqSum’ is the sum of all frequencies
Loads Sequences from file at ‘path’ and separate them in groups with an identity of at least ‘requiredIdentityPercent’. Returns a list representing the groups as lists of Protein objects.
Bases: builtins.object
Position Specific Score Matrix. Creates a profile for a series of aligned Sequences, and gives a score to each AA subsitution in a given column.
Bases: builtins.object
Represents a scoring matrix, used to determine the score between two Amino Acids