Notes
The 'drug-targets' and 'disease-genes' files are needed when calculating the proximity, these files contain the relationships of the drug to its targets and the disease to its genes respectively.
Each file contains at lease these columns:
chemical,symbol,entrez_id
lepirudin,F2,2147
cetuximab,C1QA,712
cetuximab,C1QB,713
cetuximab,EGFR,1956
denileukin diftitox,IL2RA,3559
...
disease,entrez_id,symbol
abdominal pelvic pain,1128,CHRM1
abnormal micturition,1128,CHRM1
mild neurocognitive disorder,148,ADRA1A
pain,148,ADRA1A
post-traumatic stress disorder,148,ADRA1A
prostate cancer,148,ADRA1A
...
Select the files, enter an email address and then click the submit button
A webpage will show to The results can be downloaded in the page using the ID (e.g., testtestcom37138276).
Important notes:
-
In drug-targets or disease-genes files, values in different columns are separated by comma
-
The first line of the drug-targets file must contains 'disease', 'entrez_id' (without the single quote), 'symbol' is an option
-
The first line of the disease-gene file must contains 'disease', 'entrez_id' (without the single quote), 'symbol' is an option
-
In both files, entrez_id represents the gene ID (a sequence of number, e.g. 1386), as shown in NCBI
site, the Gene ID in the Gene database.
-
The name of drug or disease are treated as character sequences, in which lowercase and uppercase are seemed as different, so please make sure all the characters in the name are in lowercase
-
Unify the drug or disease names, and strip the name if non-ASCII characters are included
-
If there are commas in the name of the drug or disease, please quote the whole name with double quotes, such as "1,4-dithiothreitol"