Downloads

All data

Users can also download data for the viewed protein or a position within the protein. Data can also be accessed through the REST API. Tables can also be downloaded.


Basic
Genome
Comlexity
SNP & Disease
Structure
Disorder
TCGA
COSMIC
Motifs & PTM
Uniprot
Bindings
Phase Separation
Conservation
Sig. Mut.

Protein specific data


Protein specific information can be downloaded by clicking on the icon on the left corner. The Button look like:


Positional data


The user can access detailed information for each position by clicking on the residues in the amino acid sequence. For example, TCGA mutational data contains all mutations with their sample id.

These data can be downloaded in two formats:

  • txt
  • json

REST API

To enable automated access, DisCanVis data is also available via REST API both at protein level or at a region level.

1. Fasta file and all annotation for a given protein

Users can access the sequence in fasta format and the whole annotation for the protein. Requests should be input following the syntax:

https://discanvis.elte.hu/rest/::identifier::.::response_type::

Where

::identifier:: is any valid identifier from DisCanVis and ::response_type:: is json or txt or fasta

Examples:

https://discanvis.elte.hu/rest/WNK3_ENST00000375159.fasta
https://discanvis.elte.hu/rest/WNK3_ENST00000375159.json
https://discanvis.elte.hu/rest/WNK3_ENST00000375159.txt


2. Specific annotation for a given protein

For specific annotation from the current protein users can access with the following syntax:

https://discanvis.elte.hu/rest/::identifier::/::annotation::

Examples:

https://discanvis.elte.hu/rest/WNK3_ENST00000375159/protein
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/elm
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/tcgam

Choose able annotations:

  • protein
  • exon
  • phastcons
  • complexity-seg
  • complexity-dust
  • complexity-trf
  • polymorphism
  • omim
  • clinvar
  • pdb
  • pfam
  • anchor
  • iupred
  • mobidb
  • alphafold
  • tcgam
  • tcgaf
  • tcgai
  • cosmicm
  • cosmicf
  • cosmici
  • roisig
  • elm
  • elmswitches
  • ptm
  • roi
  • binding
  • dibs
  • mfib
  • binding_domain
  • phasepro
  • conservation

3. Region for a specific protein with all annotation

The user can access all data with a given protein in specific region with a following syntax:

https://discanvis.elte.hu/visual/::identifier::/::positionstart::-::positionend::.::response_type::

Where

::identifier:: is any valid identifier from DisCanVis ::positionstart:: and ::positionend:: is a positive integer within range of the length of the sequence of protein coded by ::identifier:: . The positions should be in a valid range (start should be lower than end). The ::response_type:: is either json or txt

Examples:

https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537-550.json
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537-550.txt


4. Position for a specific protein with all annotation

Requests should be input following the syntax:

https://discanvis.elte.hu/visual/::identifier::/::position::.::response_type::

Where

::identifier:: is any valid identifier from DisCanVis ::position:: is a positive integer within range of the length of the sequence of protein coded by ::identifier::. ::response_type:: is either json or txt

Examples:

https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.json
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.txt



Access Protein annotations with different identifiers

Requests should be input following the syntax:

https://discanvis.elte.hu/rest/::identifier::/::target_arguments::?search_type=::identifier_search_type::

Where

::identifier_search_type:: currently can be:

  • uniprot
  • transcript

Examples:


import requests
import json
from io import StringIO
"""
Position specific request
"""

"""
TXT read example
"""
response = requests.get('https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.txt')
txt = response.text[5:-7]
data = StringIO(txt)
dct = {}
annotation = None
annotation_txt = ''
for line in data:
    if line.startswith('#'):
        if annotation:
            dct[annotation] = annotation_txt
            annotation_txt = ''
        annotation = line.strip('#\n')
    else:
        annotation_txt += line
print(dct['protein'])

                        

Output:

	Protein ID	UniProt Accession	Gene Name	Name	Transcript ID	Chromosome	Cancer Driver
0	WNK3_ENST00000375159	Q9BYP7	WNK3	Serine/threonine-protein kinase WNK3	ENST00000375159.6	chrX	Not Cancer Driver	
"""
Json read example
"""
response = requests.get('https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.json')
dct = json.loads(response.text)
print(dct['elm'])

Output:

[
    {
        "ELMIdentifier": "DEG_Kelch_KLHL3_1",
        "ELMType": "DEG",
        "ELM_Accession": "ELMI002836",
        "End": 546,
        "InstanceLogic": "true positive",
        "Methods": "coimmunoprecipitation; fluorescence polarization spectroscopy",
        "Organism": "Homo sapiens",
        "PDB": "nan",
        "References": "23838290",
        "Start": 537,
        "id": 1109,
        "protein_id": "WNK3_ENST00000375159"
    }
]