v2.0.7 and v2.0.8 has the following updates and changes (November 20, 2021)
volcanoandinvolcanofunctions added toGeneExpressionclassmafunction for MA plot updated and added basemean and p value parameters. basemean parameter is useful for visualizing the results from DESeq2.
v2.0.6 has the following updates and changes (August 18, 2021)
- New function
Fasta.split_seqadded inanalysmodule for splitting the sequence into smaller subsequences
v2.0.5 has the following updates and changes (August 16, 2021)
- New function
HtsAna.merge_featureCountadded inanalysmodule for merging the counts for all samples obtained from featureCounts
v2.0.4 has the following updates and changes (May 10, 2021)
- New class
Fastareplaced with oldfasta Fasta.split_fastafunction added inanalysmodule
v2.0.3 has the following updates and changes (April 14, 2021)
- New class
GeneExpressionandGeneraladded - MA plot updated for adding the label names to genes and plot title
- New theme option added to set new background styles to plots (now work only for MA plot)
gene_expclass will be deprecated in future releases
v2.0.2 has the following updates and changes (March 13, 2021)
- PCA biplot updated for the y scale
- blood pressure dataset added
v2.0.1 has the following updates and changes (March 11, 2021)
- Dark theme added for image backgrounds
v2.0.0 has the following updates and changes (March 10, 2021)
analys.visuz.cluster.biplotfunction updated to create the biplot where variables are more than observations
v1.0.9 has the following updates and changes (March 07, 2021)
analys.visuz.marker.mhatfunction updated to handle the Fst values for Manhattan plot- Boolean
log-scaleparameter added for choice of minus log10 conversion of p values - New marker dataset with Fst values added. This dataset is provided by the Vincent Appiah, which is downloaded from
the Pf3K Project (pilot data release 5). This dataset can be accessed using
analys.get_data('fst').data.
v1.0.8 has the following updates and changes (February 14, 2021)
- Function for regression metrics added (
bioinfokit.analys.stat.reg_metric) - It calculates Root Mean Square Error (RMSE), Mean squared error (MSE), Mean absolute error (MAE), and Mean absolute percent error (MAPE) from regression fit
v1.0.7 has the following updates and changes (January 30, 2021)
- Plant species richness dataset added for regression analysis
v1.0.6 has the following updates and changes (January 29, 2021)
- Individual log fold change and p value cutoff added for volcano and inverted volcano plot
(
bioinfokit.visuz.gene_exp.volcanoandbioinfokit.visuz.gene_exp.involcano)
v1.0.5 has the following updates and changes (December 22, 2020)
analys.gff.gff_to_gtffunction updated to handle dot value for phase in CDS features
v1.0.4 has the following updates and changes (November 22, 2020)
- `Breast Cancer Wisconsin (Diagnostic) Data Set added
visuz.stat.rocfunction added for visualizing the ROC
v1.0.3 has the following updates and changes (November 06, 2020)
bartlettandlevenefunction added toanalys.statclass for checking the ANOVA assumptions for datasets in stacked formattukey_hsdfunction updated for grouping order- Pandas series added as input for
fasta.extract_seqfunction
v1.0.2 has the following updates and changes (October 26, 2020)
extract_seqfunction moved tofastaclassextract_seqfunction deprecated fromanalys- visualization for single and multiple statistical bar charts updated for future releases
v1.0.1 has the following updates and changes (October 24, 2020)
- Tukey HSD test updated for interaction effect. Pairwise comparison for interaction effect can be calculated.
gff_to_gtffunction updated for the GFF3 file for non-coding RNA transcripts. GFF3 files with non-coding transcripts (e.g. from miRBase GFF3) can be converted to GTF
v1.0.0 has the following updates and changes (October 10, 2020)
- genFam enrichment analysis function added (
bioinfokit.analys.genfam.fam_enrich) - genfam test added
v0.9.9 has the following updates and changes (October 04, 2020)
- Tukey HSD test added to perform multiple pairwise comparisons (
bioinfokit.analys.stat.tukey_hsd)
v0.9.8 has the following updates and changes (September 25, 2020)
- new option
mrna_feature_nameadded inanalys.gff.gff_to_gtfif the name of the feature (column 3 of GFF3 file) of protein coding mRNA is other than 'mRNA' or 'transcript' (e.g. some GFF3 file has this feature named as protein_coding_gene ) dimoption added tovisuz.cluster.screeplot,visuz.cluster.pcaplotandvisuz.cluster.biplotto control the figure size
v0.9.7 has the following updates and changes (September 18, 2020)
seqcovmoved tofastqclasssra_dbfunction added underfastqclass for batch download of FASTQ files from NCBI SRA database
v0.9.6 has the following updates and changes (August 22, 2020)
- In t-test, the one sample t and paired t-test added
- Two sample t-test switched to class based method
- t-test function name changed to
ttestfromttsam - programmatic access to chi-squared independence test dataset added
- boxplot removed from t-test
- 'adjustText' module added in
setup.py(issue #12)
v0.9.5 has the following updates and changes (August 14, 2020)
- In chi-squared test, the sum of probabilities is rounded to 10 for exact sum in case of floats
v0.9.4 has the following updates and changes (August 13, 2020)
- chi-squared goodness of fit test added under the
stat.chisq - chi-squared independence test updated for output as class attributes and mosaic plot removed
mergevcfrenamed toconcatvcfto keep with conventional naming (issue # 9)- programmatic access to chi-squared independence test dataset added
marker.vcf_anotfunction updated for tab-delimited text output
v0.9.3 has the following updates and changes (August 08, 2020)
- The error message for volcano, inverted volcano, and MA plot updated when there are no significant or non-significant genes (issue # 7)
- The
vcf_anotfunction output updated for strand information
v0.9.2 has the following updates and changes (July 30, 2020)
- The manhatten plot updated to add the lables in sorted order for numerical strings
- The manhatten plot updated to add figname option
v0.9.1 has the following updates and changes (July 30, 2020)
- TPM normalization function added
v0.9 has the following updates and changes (July 28, 2020)
- RPKM normalization function added
v0.8.9 has the following updates and changes (July 28, 2020)
- gene expression raw count normalization class added as 'analys.norm'
- CPM normalization function added
v0.8.8 has the following updates and changes (July 02, 2020)
- check for lfc_thr and pv_thr added
v0.8.7 has the following updates and changes (July 01, 2020)
- legend labels, position, and figname parameters added in volcano plot
- utility to check the non-numeric values added for
ma,volcanoandinvolcano - plotlegend parameter added to
ma
v0.8.6 has the following updates and changes (June 27, 2020)
- the parameter for log fold change threshold lines added in MA plot
- legend labels, position, and figname parameters added in MA plot
v0.8.5 has the following updates and changes (June 22, 2020)
tsneplotadded for t-SNE visualization- in
bardotdrop NA value function added to ignore missing values to plot dots - scRNA-seq dataset added (PBMC and Arabidopsis root cells)
v0.8.4 has the following updates and changes (June 17, 2020)
fasta_readerandrev_commoved to newly createdfastaclasstsneplotandvcf_anotinitialized for future release- more parameters added in
biplot(cluster coloring, datapoints) fignameadded inhmap
v0.8.3 has the following updates and changes (June 03, 2020)
mafunction updated for absolute expression countssvgfigures addedpcafunction will be deprecated in future release
v0.8.1 and v0.8.2 has the following updates and changes (May 31, 2020)
- 2D and 3D loadings plot, biplot and scree plot functions added under the
clusterclass for PCA - programmatic access to iris and cotton dataset added
pcafunction will be deprecated in future release
v0.8 has the following updates and changes (May 24, 2020)
- GFF3 to GTF file conversion utility added and updated under class
gff
v0.7.3 has the following updates and changes (May 14, 2020)
- In manhatten plot (
visuz.marker.mhat), the labeling issue withmarkernamesparameter corrected (see issue # 4 on github for details; thanks to mkirchler for reporting) gstyleparameter added in manhatten plot for box style annotation
v0.7.2 has the following updates and changes (May 08, 2020)
splitvcffunction added for splitting VCF file into individual VCF files for each chromosomemergevcfmoved toanalys.markerclass
v0.7.1 has the following updates and changes (April 24, 2020)
reg_linfunction updated for multiple regression- degree of freedom fixed for t-test for regression coefficients
- VIF calculation for MLR updated
- functions
fastq_readerandfqreadcountermoved tofastqclass
v0.7 has the following updates and changes
split_fastqfunction added for splitting individual (left and right) paired-end fastq files from single interleaved paired-end file- GFF3 to GTF file conversion utility added under class
gff - two-sample and Welch's t-test updated for CI and alpha parameter added
- module termcolor removed
- Programmatic access of dataset for
ttsamadded
v0.6 has the following updates and changes
- Programmatic access of dataset added (class
get_data) - More features for figures added (
figtype,axtickfontsize,axtickfontname,axxlabel,axylabel,xlm,ylm,yerrlw,yerrcw) - In volcano plot, the typo for xlabel corrected (-log2(FoldChange) to log2(FoldChange))
helpwill be deprecated in future release- VIF calculation for MLR updated
- adjustText removed
v0.5 has the following updates and changes
- Linear regression analysis added in
analys.statclass volcano,involcano,maandheatmapfunctions moved to newvisuz.gen_expclass- In
volcano, parameters for new box type labelling and threshold grid lines added corr_matupdated for new colormaps and moved to stat class- To visualize the graph in console itself (e.g. Jupyter notebook), show parameter added
- Pandas dataframe input added for
volcano,involcano,corr_mat,ma,ttsam, andchisq ttsamandchisqmoved toanalys.statclass- graph control parameters added for
volcano,involcano,ma, andheatmap - documentation can also be accessed at https://reneshbedre.github.io/blog/howtoinstall.html
helpwill be deprecated in future release- fixed the numpy bug in
visuz.stat.bardot. Theintcast added to generate number of samples, which does not accept float (See details of numpy bug: numpy/numpy#15345)
v0.4 has the following updates and changes
- function
analyis.format.fq_qual_var()added for detecting the FASTQ quality encoding format helpmodule added command-line help message- class
fastqadded for FASTQ related functions
v0.3 has the following updates and changes