4.1.1.3.1.3. datanator.data_source.process_rna_seq package

4.1.1.3.1.3.1. Submodules

4.1.1.3.1.3.2. datanator.data_source.process_rna_seq.command_line_core module

class datanator.data_source.process_rna_seq.command_line_core.App(label=None, **kw)[source]

Bases: cement.core.foundation.App

class Meta[source]

Bases: object

base_controller = 'base'[source]
handlers = [<class 'datanator.data_source.process_rna_seq.command_line_core.BaseController'>, <class 'datanator.data_source.process_rna_seq.command_line_core.DownloadCDNA'>, <class 'datanator.data_source.process_rna_seq.command_line_core.DownloadFASTQ'>, <class 'datanator.data_source.process_rna_seq.command_line_core.ProcessCDNA'>, <class 'datanator.data_source.process_rna_seq.command_line_core.ProcessFASTQ'>, <class 'datanator.data_source.process_rna_seq.command_line_core.DeleteCDNAFiles'>, <class 'datanator.data_source.process_rna_seq.command_line_core.DeleteFASTQFiles'>][source]
label = 'datanator'[source]
class datanator.data_source.process_rna_seq.command_line_core.BaseController(*args, **kw)[source]

Bases: cement.ext.ext_argparse.ArgparseController

class Meta[source]

Bases: object

description = 'Utilities for aggregating data for biochemical models'[source]
label = 'base'[source]
class datanator.data_source.process_rna_seq.command_line_core.DeleteCDNAFiles(*args, **kw)[source]

Bases: cement.ext.ext_argparse.ArgparseController

class Meta[source]

Bases: object

arguments = [(['strain_name'], {'type': <class 'str'>, 'help': 'Name of the strain'}), (['temp_directory'], {'type': <class 'str'>, 'help': 'location where strain is saved'})][source]
description = 'Delete CDNA files'[source]
label = 'delete-cdna'[source]
stacked_on = 'base'[source]
stacked_type = 'nested'[source]
class datanator.data_source.process_rna_seq.command_line_core.DeleteFASTQFiles(*args, **kw)[source]

Bases: cement.ext.ext_argparse.ArgparseController

class Meta[source]

Bases: object

arguments = [(['experiment_name'], {'type': <class 'str'>, 'help': 'Name of the experiment'}), (['sample_name'], {'type': <class 'str'>, 'help': 'Name of the sample'}), (['temp_directory'], {'type': <class 'str'>, 'help': 'location where strain is saved'})][source]
description = 'Delete fastq files'[source]
label = 'delete-fastq'[source]
stacked_on = 'base'[source]
stacked_type = 'nested'[source]
class datanator.data_source.process_rna_seq.command_line_core.DownloadCDNA(*args, **kw)[source]

Bases: cement.ext.ext_argparse.ArgparseController

class Meta[source]

Bases: object

arguments = [(['strain_name'], {'type': <class 'str'>, 'help': 'strain of the reference genome'}), (['url'], {'type': <class 'str'>, 'help': 'url to download the reference genome'}), (['temp_directory'], {'type': <class 'str'>, 'help': 'location to save the reference genome'})][source]
description = 'Download CDNA file into specified directory'[source]
label = 'download-cdna'[source]
stacked_on = 'base'[source]
stacked_type = 'nested'[source]
class datanator.data_source.process_rna_seq.command_line_core.DownloadFASTQ(*args, **kw)[source]

Bases: cement.ext.ext_argparse.ArgparseController

class Meta[source]

Bases: object

arguments = [(['experiment_name'], {'type': <class 'str'>, 'help': 'Name of the experiment'}), (['sample_name'], {'type': <class 'str'>, 'help': 'Name of the sample'}), (['temp_directory'], {'type': <class 'str'>, 'help': 'location to save the fastq files'}), (['fastq_urls'], {'type': <class 'str'>, 'help': 'space seperated list of fastq urls'})][source]
description = 'Download FASTQ files into specified directory'[source]
label = 'download-fastq'[source]
stacked_on = 'base'[source]
stacked_type = 'nested'[source]
class datanator.data_source.process_rna_seq.command_line_core.ProcessCDNA(*args, **kw)[source]

Bases: cement.ext.ext_argparse.ArgparseController

class Meta[source]

Bases: object

arguments = [(['strain_name'], {'type': <class 'str'>, 'help': 'strain of the reference genome'}), (['output_directory'], {'type': <class 'str'>, 'help': 'location to save index file'}), (['temp_directory'], {'type': <class 'str'>, 'help': 'location of the reference genomes'})][source]
description = 'Process a CDNA file into a kallisto index file and save it to a specified directory'[source]
label = 'process-cdna'[source]
stacked_on = 'base'[source]
stacked_type = 'nested'[source]
class datanator.data_source.process_rna_seq.command_line_core.ProcessFASTQ(*args, **kw)[source]

Bases: cement.ext.ext_argparse.ArgparseController

class Meta[source]

Bases: object

arguments = [(['experiment_name'], {'type': <class 'str'>, 'help': 'Name of the experiment'}), (['sample_name'], {'type': <class 'str'>, 'help': 'Name of the sample'}), (['strain_name'], {'type': <class 'str'>, 'help': 'strain name of the reference genome'}), (['num_fastq_files'], {'type': <class 'int'>, 'help': 'Number of FASTQ files for sample'}), (['read_type'], {'type': <class 'str'>, 'help': 'Read type of RNA-seq experiment'}), (['output_directory'], {'type': <class 'str'>, 'help': 'location to save the processed files'}), (['temp_directory'], {'type': <class 'str'>, 'help': 'location of the fastq files'})][source]
description = 'Process FASTQ files'[source]
label = 'process-fastq'[source]
stacked_on = 'base'[source]
stacked_type = 'nested'[source]
datanator.data_source.process_rna_seq.command_line_core.main()[source]

4.1.1.3.1.3.3. datanator.data_source.process_rna_seq.core module

datanator.data_source.process_rna_seq.core.delete_cdna_files(strain_name, temp_directory)[source]
datanator.data_source.process_rna_seq.core.delete_fastq_files(experiment_name, sample_name, temp_directory)[source]
datanator.data_source.process_rna_seq.core.download_cdna(ref_genome, strain_name, url, temp_directory)[source]
datanator.data_source.process_rna_seq.core.download_fastq(experiment_name, sample_name, temp_directory, fastq_urls)[source]
datanator.data_source.process_rna_seq.core.get_processed_data_samples(samples, output_directory, temp_directory)[source]
datanator.data_source.process_rna_seq.core.process_cdna(strain_name, output_directory, temp_directory)[source]
datanator.data_source.process_rna_seq.core.process_fastq(experiment_name, sample_name, strain_name, num_fastq_files, read_type, output_directory, temp_directory)[source]

4.1.1.3.1.3.4. datanator.data_source.process_rna_seq.download_cdna module

datanator.data_source.process_rna_seq.download_cdna.download_cdna(ensembl_info, top_dir)[source]
datanator.data_source.process_rna_seq.download_cdna.process_cdna(ensembl_info, top_dir)[source]
datanator.data_source.process_rna_seq.download_cdna.run(ensembl_info, top_dir)[source]
Downloads the CDNA for a given sample, and creates a kallisto index file.

The CDNA file is stored in a “CDNA” subdirectory within the top directory. The kalliso index files are stored within “kallisto_index_files” subdirectory within the top directory

Parameters
  • experiment (array_express.Experiment) – the array express experiment

  • top_dirname (str) – the name of the directory where the overall data is being stored

4.1.1.3.1.3.5. Module contents