Association of two genomic intervals, used to represent the composition of one interval by another
Find contigs with deletions and return tuple containing list of idicies of contigs to be replaced along with replacement. e.g.:
([2, 3],
{'seq': 'chr', 'start': 3, 'end': 5, 'contig': 'contig1',
'contig_start': 5, 'contig_end': 10, 'strand': '+', 'contig_size': 20})
replaces contig_composition_list[2:3] with the new contig composition specified
Return length of the reference_species.sequence_id
Return alignment created by stitching MAF blocks along entire sequence (including gaps) Also returns a list of intervals relative to the alignment that indicate the MAF block, block start, and block end of the source of that piece of the alignment
Read contig composition summary and return attributes
Return updated alignment, interval
Subtract two intervals, return list of resulting intervals
Summarize the contig composition in a list of tuples (seq, start, end, contig, contig_start, contig_end, strand, contig_size)
Update list of ContigComposition objects with replacements Replacements are a list of tuples containing a list of indicies of contigs to be replaced along with replacements The replacements must be non-overlapping and sorted
Update Seq object with replacements Replacements must be non-overlapping and sorted
Convert coordinates from GFF or BED file using multi-fasta alignments
Extract fasta sequences from regions defined in GFF/BED file and output fasta to stdout
SYNOPSIS
TODO summarize_alignments multi_align_fasta reference_sequence [-h,–help] [-v,–verbose] [–version]
DESCRIPTION
TODO This describes how to use this script. This docstring will be printed by the script if there is an error or if the user requests help (-h or –help).
EXAMPLES
TODO: Show some examples of how to use this script.
EXIT STATUS
TODO: List exit codes
AUTHOR
TODO: lparsons <lparsons@princeton.edu>
LICENSE
BSD 2-Clause License http://www.opensource.org/licenses/BSD-2-Clause
VERSION
$Id$
Parse a simple event with reference_position, reference_base, and new_base and determine the type and add padding if necessary (for VCF compatibility)
Summarizes changes in given alignment (pairwise only) Input: alignment = Bio.AlignIO object
reference_index = index of the reference sequence in alignment (default is 1)
Utility classes and methods for working with sequence data
A genomic region specified by chr:start-end, using 1-based cooredinates
Take position with gaps and return position without gaps Uses 0-based positions
Take position without gaps and return position with gaps Uses 0-based positions