Downloads
All data
Users can also download data for the viewed protein or a position within the protein. Data can also be accessed through the REST API. Tables can also be downloaded.
Basic
Genome
Comlexity
SNP & Disease
Structure
Disorder
TCGA
COSMIC
Motifs & PTM
Uniprot
Bindings
Phase Separation
Conservation
Sig. Mut.
Protein specific data
Protein specific information can be downloaded by clicking on the icon on the left corner. The Button look like:
Positional data
The user can access detailed information for each position by clicking on the residues in the amino acid sequence. For example, TCGA mutational data contains all mutations with their sample id.
These data can be downloaded in two formats:
- txt
- json
REST API
To enable automated access, DisCanVis data is also available via REST API both at protein level or at a region level.
1. Fasta file and all annotation for a given protein
Users can access the sequence in fasta format and the whole annotation for the protein. Requests should be input following the syntax:
https://discanvis.elte.hu/rest/::identifier::.::response_type::
Where
::identifier::
is any valid identifier from DisCanVis and::response_type::
is json or txt or fasta
Examples:
https://discanvis.elte.hu/rest/WNK3_ENST00000375159.fasta https://discanvis.elte.hu/rest/WNK3_ENST00000375159.json https://discanvis.elte.hu/rest/WNK3_ENST00000375159.txt
2. Specific annotation for a given protein
For specific annotation from the current protein users can access with the following syntax:
https://discanvis.elte.hu/rest/::identifier::/::annotation::
Examples:
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/protein https://discanvis.elte.hu/rest/WNK3_ENST00000375159/elm https://discanvis.elte.hu/rest/WNK3_ENST00000375159/tcgam
Choose able annotations:
- protein
- exon
- phastcons
- complexity-seg
- complexity-dust
- complexity-trf
- polymorphism
- omim
- clinvar
- pdb
- pfam
- anchor
- iupred
- mobidb
- alphafold
- tcgam
- tcgaf
- tcgai
- cosmicm
- cosmicf
- cosmici
- roisig
- elm
- elmswitches
- ptm
- roi
- binding
- dibs
- mfib
- binding_domain
- phasepro
- conservation
3. Region for a specific protein with all annotation
The user can access all data with a given protein in specific region with a following syntax:
https://discanvis.elte.hu/visual/::identifier::/::positionstart::-::positionend::.::response_type::
Where
::identifier::
is any valid identifier from DisCanVis ::positionstart::
and ::positionend::
is a
positive integer within range of the length of the sequence of protein coded by ::identifier::
.
The positions should be in a valid range (start should be lower than end). The ::response_type::
is either json or txt
Examples:
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537-550.json https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537-550.txt
4. Position for a specific protein with all annotation
Requests should be input following the syntax:
https://discanvis.elte.hu/visual/::identifier::/::position::.::response_type::
Where
::identifier::
is any valid identifier from DisCanVis
::position::
is a positive integer within range of the length of the sequence of protein coded by ::identifier::
.
::response_type::
is either json or txt
Examples:
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.json https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.txt
Access Protein annotations with different identifiers
Requests should be input following the syntax:
https://discanvis.elte.hu/rest/::identifier::/::target_arguments::?search_type=::identifier_search_type::
Where
::identifier_search_type::
currently can be:
- uniprot
- transcript
Examples:
import requests
import json
from io import StringIO
"""
Position specific request
"""
"""
TXT read example
"""
response = requests.get('https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.txt')
txt = response.text[5:-7]
data = StringIO(txt)
dct = {}
annotation = None
annotation_txt = ''
for line in data:
if line.startswith('#'):
if annotation:
dct[annotation] = annotation_txt
annotation_txt = ''
annotation = line.strip('#\n')
else:
annotation_txt += line
print(dct['protein'])
Output:
Protein ID UniProt Accession Gene Name Name Transcript ID Chromosome Cancer Driver
0 WNK3_ENST00000375159 Q9BYP7 WNK3 Serine/threonine-protein kinase WNK3 ENST00000375159.6 chrX Not Cancer Driver
"""
Json read example
"""
response = requests.get('https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.json')
dct = json.loads(response.text)
print(dct['elm'])
Output:
[
{
"ELMIdentifier": "DEG_Kelch_KLHL3_1",
"ELMType": "DEG",
"ELM_Accession": "ELMI002836",
"End": 546,
"InstanceLogic": "true positive",
"Methods": "coimmunoprecipitation; fluorescence polarization spectroscopy",
"Organism": "Homo sapiens",
"PDB": "nan",
"References": "23838290",
"Start": 537,
"id": 1109,
"protein_id": "WNK3_ENST00000375159"
}
]