Preprocessing: pp#
ATAC-seq matrix#
| 
 | Binarize the data matrix in an AnnData or MuData object | 
ATAC-seq fragment#
| 
 | Calculate genome depth for a given chromosome. | 
| Collapse consecutive identical values in an array. | |
| 
 | Convert fragment data to genome coverage signal. | 
| 
 | Convert fragment file to BigWig format. | 
| Splits a fragment file into multiple group-specific fragment files based on cell barcodes. | 
Gene#
| 
 | Extract transcription start site (TSS) coordinates for genes from a GTF file. | 
| 
 | Add the TSS coordinates of genes to mdata[mod_names].uns. | 
Motif#
| Fetch transcription factor motifs from the JASPAR database. | |
| 
 | Filter motifs by matching their names to expressed gene names in the RNA modality. | 
| 
 | Matches transcription factor motifs to accessible DNA sequences and links them with expressed genes. | 
| 
 | Link transcription factors (TFs) to target genes based on TF binding sites. | 
Peaks#
| 
 | Add peak metadata to an ATAC-seq modality in a MuData object. | 
| 
 | Link peaks to genes based on proximity to transcription start sites (TSS). | 
Sequences#
| 
 | Add sequences to peak metadata in a MuData object. | 
| Add genomic variants to DNA sequences from peak regions to generate personalized haplotype sequences. | |
| Shuffle a one-hot encoded DNA sequence while preserving its dinucleotide composition. | |
| Shuffle a DNA sequence while preserving its dinucleotide composition. | |
| 
 | Converts a one-hot encoded DNA matrix back to a nucleotide sequence. | 
| 
 | Generate a random nucleotide sequence of a specified length. | 
| One-hot encodes a DNA sequence while handling unknown bases. | |
| Update reference DNA sequences with genomic variants based on genotype information. |