
The Indian Cancer Genome Atlas (ICGA) provides a comprehensive, clinically annotated, multi-omics data visualization platform that enables an integrative understanding of cancer in the Indian population.
Powered by the cBioPortal framework, the ICGA Portal allows researchers to explore and analyze genomic, transcriptomic, proteomic, and clinical datasets through an intuitive and interactive interface. The platform currently hosts datasets beginning with Indian breast cancer cohorts and will expand to additional cancer types over time.
The ICGA Portal offers researchers a powerful gateway for accessing processed cancer genomic dataset curated by ICGA. It enables researchers to perform exploratory and hypothesis-driven analyses while ensuring compliance with ethical, legal, and data governance frameworks.
The portal has been developed with philanthropic support from Strand Life Sciences Ltd. and reflects ICGA’s commitment to responsible data sharing in cancer research.
ICGA adheres to strict ethical and regulatory standards in data sharing.
The portal provides secondary-level processed datasets, including:
The portal visualizes highly processed, curated, and harmonized multidimensional cancer genomics data.
The ICGA Breast Cancer Cohort represents a foundational dataset within the ICGA program.
Metadata of the ICGA’s cohort on breast cancer patients
The ICGA follows a controlled access model governed by the Data Access Committee (DAC) in alignment with ICGA Data Policy and DBT PRIDE guidelines.
The ICGA Foundation is committed to aligning its data governance framework with the Digital Personal Data Protection (DPDP) Act 2023 and the Digital Personal Data Protection Rules 2025, notified by the Ministry of Electronics and Information Technology (MeitY) in November 2025. The Rules provide for a phased implementation, with full substantive compliance required by May 2027. ICGA’s governance policies and data access framework are being updated accordingly during this period. Researchers with queries about data governance or compliance may write to suveera@icga.co.in.
ICGA data is accessible through two routes. Both routes require a completed application and approval by the DAC before access is granted.
Route A — ICGA Data Portal (cBioPortal)
Interactive, browser-based access to processed and visualised datasets within the secure ICGA portal environment. Please remember no raw data would be available here. Researchers can explore somatic mutation profiles, gene expression patterns, proteomics summaries, and associated clinical metadata using the portal’s built-in analysis tools. Data remains within ICGA’s secure infrastructure at all times. [The portal visualizes highly processed, curated, and harmonized multidimensional cancer genomics data. ]
Route B — AWS Controlled Access
Programmatic access to processed data files — including VCF/MAF files, expression matrices, and proteomics outputs — for researchers requiring computational analysis beyond what the portal interface supports. Access is provided within ICGA-managed, India-based infrastructure. Applicants are responsible for all associated AWS infrastructure costs. Contact ICGA for details.
Both routes are subject to a single consolidated application reviewed by the DAC.
Note: Commercial and industry applications are subject to additional review including execution of a Commercial Data Licensing Agreement. Contact suveera@icga.co.in before submitting any data request.
Researchers seeking access must submit a single consolidated application that includes:
All applications are reviewed by the ICGA Data Access Committee (DAC). Incomplete applications will not be considered. Full and final approval is followed by the signing of a Data User Agreement (DUA) with ICGA.
In cases where there is clear scientific justification and upon DAC approval, researchers may be granted access to processed data files within an ICGA-approved secure compute environment. Data does not leave ICGA’s governed infrastructure unless the DAC exceptionally approves it. The modality of access will be determined by ICGA on a case-by-case basis following DAC’s review.
Such requests must demonstrate:
Eligible file types include processed somatic variant data (VCF/MAF), normalised expression matrices, and proteomics outputs.
The following are not available for external access at this stage:
Requests for the above will not be considered under the current policy framework.
All approved users must comply with the following conditions throughout the approved access period:
Suggested citation:“The results [published or shown] here are based, in whole or in part, on data generated by the Indian Cancer Genome Atlas (ICGA) Network: https://icga.in, https://icga.net.in“
ICGA is dedicated to advancing cancer research through a rigorous, end-to-end process that involves:
Due to legal and ethical considerations, ICGA is unable to accommodate requests for biological samples, analytes, or tissue materials. All cases within the ICGA programme have been consented exclusively for ICGA use, and the redistribution of materials to outside parties is prohibited. Additionally, the majority of tissue samples have been depleted through the multiple assays performed for ICGA research.
The portal provides access to processed, secondary-level data, including:
Most visualizations allow download of the underlying data.
Yes, provided your specific access request was approved with download. Data can be downloaded from:
Users can also define custom cohorts (“virtual studies”) using clinical or genomic filters and download the corresponding datasets.
No. The portal does not host raw sequencing data or raw count-level datasets.
It is designed for access to curated and processed data only.
Access to controlled datasets requires application through the ICGA Data Access Committee (DAC).
This includes:
For queries, contact:
No. Since the portal provides processed, gene-level summarized data rather than raw count matrices, workflows requiring raw count reprocessing are not supported directly from portal downloads.
The portal supports:
Users can query specific genes or define cohorts before exporting results.
No. The ICGA Portal is an independent instance of the cBioPortal platform and hosts ICGA-specific datasets only.
No. The interface is designed to be intuitive. However, familiarity with cBioPortal workflows may be helpful for advanced queries and cohort analyses.
Yes. Users can apply clinical and genomic filters to create custom cohorts (“virtual studies”) for downstream exploration and data export.
Users are required to acknowledge the Indian Cancer Genome Atlas (ICGA) in any publications, presentations, or outputs derived from ICGA data.
Suggested citation:
“The results published or shown here are based, in whole or in part, on data generated by the Indian Cancer Genome Atlas (ICGA) Network: https://icga.in and https://icga.net.in.”
Where applicable, users should also cite associated ICGA publications relevant to the dataset used.
For citation-related queries, contact:
Yes. ICGA datasets are periodically updated as new data is generated, processed, and curated.
Breast cancer study data, for example, is routinely updated.
Users should record:
for reproducibility and future reference.
For dataset version queries, contact: