JWES: Pipeline for WGS & WES data processing, and gene-variant discovery, annotation, prediction, and genotyping

Whole genome and exome sequencing (WGS/WES) are the most popular next-generation sequencing (NGS) methodologies, and are at present often used to detect rare and common genetic variants of clinical significance. We emphasize that automated sequence data processing, management, and visualization should be an indispensable component of modern WGS and WES data analysis for sequence assembly, variant detection (SNPs, SVs), imputation, and resolution of haplotypes. In this manuscript, we present a newly developed findable, accessible, interoperable, and reusable (FAIR) bioinformatics-genomics pipeline (JWES: Java based Whole Genome/Exome Sequence Data Processing Pipeline) for efficient variant discovery and interpretation, and big data modelling and visualization. JWES is a cross-platform, user-friendly, product line application, that entails three modules: (i) data processing, (ii) storage, and (iii) visualization. The data processing module performs a series of different tasks for variant calling, the data storage module efficiently manages high-volume gene-variant data, and the data visualization module supports variant data interpretation with Circos graphs. The performance of JWES was tested and validated in-house with different experiments, using Microsoft Windows, macOS Big Sur and UNIX operating systems. JWES is an open-source and freely available pipeline, allowing scientists to take full advantage of all the computing resources available, without requiring much computer science knowledge. We have successfully applied JWES for processing, management, and gene-variant discovery, annotation, prediction, and genotyping of WGS and WES data to analyze variable complex disorders. In summary, we report the performance of JWES with some reproducible case studies, using open access and in-house generated, high quality datasets.

Publication:

Ahmed, Z., Renart., E., Mishra, D., Zeeshan, S. (2021). JWES: A new pipeline for whole genome/exome sequence data processing, management, and gene-variant discovery, annotation, prediction, and genotyping. FEBS Open Bio (Wiley, Federation of European Biochemical Societies). PMID: 34370400