Access Data

Downloads Rest Api

Downloads

All data

Users can also download data for the viewed protein or a position within the protein. Data can also be accessed through the REST API. Tables can also be downloaded.

Basic

Genome

Comlexity

SNP & Disease

Structure

Disorder

TCGA

COSMIC

Motifs & PTM

Uniprot

Bindings

Phase Separation

Conservation

Sig. Mut.

Protein specific data

Protein specific information can be downloaded by clicking on the icon on the left corner. The Button look like:

Positional data

The user can access detailed information for each position by clicking on the residues in the amino acid sequence. For example, TCGA mutational data contains all mutations with their sample id.

These data can be downloaded in two formats:

txt
json

REST API

To enable automated access, DisCanVis data is also available via REST API both at protein level or at a region level.

1. Fasta file and all annotation for a given protein

Users can access the sequence in fasta format and the whole annotation for the protein. Requests should be input following the syntax:

https://discanvis.elte.hu/rest/::identifier::.::response_type::

Where

::identifier:: is any valid identifier from DisCanVis and ::response_type:: is json or txt or fasta

Examples:

https://discanvis.elte.hu/rest/WNK3_ENST00000375159.fasta
https://discanvis.elte.hu/rest/WNK3_ENST00000375159.json
https://discanvis.elte.hu/rest/WNK3_ENST00000375159.txt

2. Specific annotation for a given protein

For specific annotation from the current protein users can access with the following syntax:

https://discanvis.elte.hu/rest/::identifier::/::annotation::

Examples:

https://discanvis.elte.hu/rest/WNK3_ENST00000375159/protein
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/elm
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/tcgam

Choose able annotations:

protein
exon
phastcons
complexity-seg
complexity-dust
complexity-trf
polymorphism
omim
clinvar
pdb
pfam
anchor
iupred
mobidb
alphafold
tcgam
tcgaf
tcgai
cosmicm
cosmicf
cosmici
roisig
elm
elmswitches
ptm
roi
binding
dibs
mfib
binding_domain
phasepro
conservation

3. Region for a specific protein with all annotation

The user can access all data with a given protein in specific region with a following syntax:

https://discanvis.elte.hu/visual/::identifier::/::positionstart::-::positionend::.::response_type::

Where

::identifier:: is any valid identifier from DisCanVis ::positionstart:: and ::positionend:: is a positive integer within range of the length of the sequence of protein coded by ::identifier:: . The positions should be in a valid range (start should be lower than end). The ::response_type:: is either json or txt

Examples:

https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537-550.json
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537-550.txt

4. Position for a specific protein with all annotation

Requests should be input following the syntax:

https://discanvis.elte.hu/visual/::identifier::/::position::.::response_type::

Where

::identifier:: is any valid identifier from DisCanVis ::position:: is a positive integer within range of the length of the sequence of protein coded by ::identifier::. ::response_type:: is either json or txt

Examples:

https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.json
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.txt

Access Protein annotations with different identifiers

Requests should be input following the syntax:

https://discanvis.elte.hu/rest/::identifier::/::target_arguments::?search_type=::identifier_search_type::

Where

::identifier_search_type:: currently can be:

uniprot
transcript

Examples:

import requests
import json
from io import StringIO
"""
Position specific request
"""

"""
TXT read example
"""
response = requests.get('https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.txt')
txt = response.text[5:-7]
data = StringIO(txt)
dct = {}
annotation = None
annotation_txt = ''
for line in data:
    if line.startswith('#'):
        if annotation:
            dct[annotation] = annotation_txt
            annotation_txt = ''
        annotation = line.strip('#\n')
    else:
        annotation_txt += line
print(dct['protein'])

Output:

	Protein ID	UniProt Accession	Gene Name	Name	Transcript ID	Chromosome	Cancer Driver
0	WNK3_ENST00000375159	Q9BYP7	WNK3	Serine/threonine-protein kinase WNK3	ENST00000375159.6	chrX	Not Cancer Driver

"""
Json read example
"""
response = requests.get('https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.json')
dct = json.loads(response.text)
print(dct['elm'])

Output:

[
    {
        "ELMIdentifier": "DEG_Kelch_KLHL3_1",
        "ELMType": "DEG",
        "ELM_Accession": "ELMI002836",
        "End": 546,
        "InstanceLogic": "true positive",
        "Methods": "coimmunoprecipitation; fluorescence polarization spectroscopy",
        "Organism": "Homo sapiens",
        "PDB": "nan",
        "References": "23838290",
        "Start": 537,
        "id": 1109,
        "protein_id": "WNK3_ENST00000375159"
    }
]