Virus interactions with human signal transduction pathways

Viruses depend on their hosts at every stage of their life cycles and must therefore communicate with them via Protein-Protein Interactions (PPIs). To investigate the mechanisms of communication by different viruses, we overlay reported pairwise human-virus PPIs on human signalling pathways. Of 671 pathways obtained from NCI and Reactome databases, 355 are potentially targeted by at least one virus. The majority of pathways are linked to more than one virus. We find evidence supporting the hypothesis that viruses often interact with different proteins depending on the targeted pathway. Pathway analysis indicates overrepresentation of some pathways targeted by viruses. The merged network of the most statistically significant pathways shows several centrally located proteins, which are also hub proteins. Generally, hub proteins are targeted more frequently by viruses. Numerous proteins in virus-targeted pathways are known drug targets, suggesting that these might be exploited as potential new approaches to treatments against multiple viruses. Copyright © 2011 Inderscience Enterprises Ltd. *Corresponding author. NIH Public Access Author Manuscript Int J Comput Biol Drug Des. Author manuscript; available in PMC 2012 July 29. Published in final edited form as: Int J Comput Biol Drug Des. 2011 ; 4(1): 83–105. doi:10.1504/IJCBDD.2011.038658. N IH PA Athor M anscript N IH PA Athor M anscript N IH PA Athor M anscript


Introduction
Infectious diseases cause millions of deaths every year.New infectious diseases appear regularly in diverse parts of the globe, even as we fail to eradicate established diseases such as Acquired Immune Deficiency Syndrome (AIDS), malaria and tuberculosis.Virulent strains of known pathogens increasingly appear, as in the recent swine flu outbreak.Current treatments and vaccinations are focused on the pathogen.However, pathogens are able to adapt and rapidly evolve mechanisms to evade anti-bacterial and anti-viral drugs and vaccines.Moreover, once the pathogen has evaded the immune system, the infection may become chronic.Recent efforts in vaccine development therefore include targeting the host by combining pathogen-derived antigens with adjuvants, for example to boost the host immune system (Lederman, 1995).New avenues for drug discovery may arise from study of the interactions between the pathogens and their hosts.When a pathogen invades a cell, it depends on the host's resources at all stages of its life cycle.To subvert the host cellular machinery for its purposes, the pathogen must communicate with the host, via chains of interactions referred to as signal transduction pathways.These allow cells to respond to the environment, and pathogens to take over the host.
Recently, more than 2500 interactions involving ~1000 human and 17 Human Immunodeficiency Virus (HIV) proteins have been catalogued (Fu et al., 2009).While not all of these interactions may be based on physical contacts, this large number provides a glimpse into the extent of the communication between HIV and its human host, which is particularly remarkable considering the small number of HIV proteins.Small genome size is general for other viral genomes also, which typically encode less than 20 proteins.Increasingly, the interactions involving viruses other than HIV are also being catalogued, e.g., by the Pathogen Interaction Gateway (PIG) (Driscoll et al., 2009) and VirusMINT (Chatr-Aryamontri et al., 2009) efforts.These databases store information on PPIs between multiple hosts and pathogens.The VirusMINT database (Chatr-Aryamontri et al., 2009) integrates interactions involving proteins from 149 virus strains and their human binding partners, obtained from sources such as MINT (Zanzoni et al., 2002), IntAct (Kerrien et al., 2007) and the HIV-1, Human Protein Interaction Database (Fu et al., 2009).PIG, in addition to interactions with viruses, also includes bacteria (Driscoll et al., 2009).These efforts now allow a global view of the PPI networks between pathogens and their hosts.Earlier analysis of proteins targeted by pathogens including bacteria has revealed some common features that appear to be shared amongst several host-pathogen systems (Dyer et al., 2008).The topological properties of proteins in the human PPI network indicate that pathogens target hub proteins, i.e., proteins that have many interacting partners, or bottlenecks, i.e., proteins that have many 'shortest paths' going through them (Dyer et al., 2008).A common set of host proteins was identified that are frequently targeted by multiple pathogens, suggesting that at least some of the proteins targeted in humans are conserved across pathogens.
In this study, we strive to understand the functional significance of the interactions between pathogens and their hosts in the context of signal transduction pathways that may be disrupted or altered by PPIs involving the pathogens.The focus here is on viruses.A comparison of the potential effects of various pathogenic viruses on different signal transduction pathways in the host cell will allow addressing questions such as which viruses target the same pathways, which viruses target pathways that are less or not at all targeted by other viruses and which proteins are common targets for many viruses given the current interaction data.Addressing the above-mentioned questions is expected to advance our fundamental understanding of virus-host communication mechanisms and may help antiviral drug discovery.

Pathway data and path identification
Pathway data was collected from the NCI and Reactome protein pathway databases (Matthews et al., 2009;Schaefer et al., 2009) in November 2008 and processed as described previously (Balakrishnan et al., 2009).Only pathways under "Biomolecular interactions and cellular processes assembled into authoritative human signaling pathways" were downloaded from the PID website, http://pid.nci.nih.gov/.Thus, 671 pathways were analysed, representing 102 and 569 pathways from the NCI and Reactome databases, respectively.

Interactions data
Information on the interactions involving multiple viruses with human proteins was obtained from the VirusMINT database, which includes manually curated data as well as interactions downloaded from other databases (Chatr-Aryamontri et al., 2009).The VirusMINT database contains PPI data for 557 proteins and 149 different viral strains.This corresponds to 2007 unique interactions supported by 5483 experimental evidences (Chatr-Aryamontri et al., 2009).The interactions were downloaded on 4 August 2009.Data in VirusMINT that was imported from MINT and IntAct contained full experimental details while the HIV-1 Human Protein Interactions Database (Schaefer et al., 2009) does not provide a description of the experimental evidences.Therefore, VirusMINT reports only the subset of data from the HIV-1 Human Protein Interactions Database, which represents enzymatic reactions, physical associations and co-localisation (Chatr-Aryamontri et al., 2009).Some of the viruses reported in the VirusMINT database were very close relatives.We, therefore, grouped strains of the same virus as one category for each virus.The membership of different strains in their respective group is listed in Table 2, along with the number of interactions that has been reported in the VirusMINT database for each group.
For hub analysis, two databases were used.First, to analyse human, HIV interactions, we used the NIAID database (Fu et al., 2009) and defined a subset of those interactions relating to direct physical interactions as described (Tastan et al., 2009).Human PPIs were retrieved from the HPRD (Keshava Prasad et al., 2009).Second, to compare all viruses, human PPI data was obtained from the Protein Interaction Network Analysis (PINA) platform (4 March 2010 release) (Wu et al., 2009).

Curve fitting
Protein-pathway size dependency curves were fit by Matlab Inc. software using various standard regression techniques, including polynomial, linear, exponential, Weibull, fractional and Gaussian.A comparison of the performance of the fit for different functions and degrees was carried out by plotting the residual error and mean squared error for each fit.The smoothest fit was obtained by a polynomial function of degree 5, which is shown in Figure 2.

Drug target and protein data
For all virus-targeted human proteins, we checked if they are listed as drug targets in DrugBank (Wishart et al., 2008).The list of drug targets was obtained from DrugBank version 2.5.Additional functional information about the targeted proteins was obtained from the Protein knowledgebase (UniProtKB) (Apweiler et al., 2004).

Network analysis
We used the Network Modules function in the Ingenuity Pathway Analysis (IPA, Ingenuity® Systems, http://www.ingenuity.com) to identify the enriched networks for the proteins targeted by viruses.These proteins (genes) were clustered into several networks based on their PPI, regulation and other relationships.Enrichment of networks was tested by right-tailed Fisher's exact test based on the hypergeometric distribution, and its p value was transformed to score by −log 10 p (Jia et al., 2011;Sun et al., 2010).We used the Merge Networks function in IPA to merge top networks to have a more systematic view into the PPI features for virus targeting.Specifically in this study, we merged the top two networks to gain more insights into the function of genes and their relationships.

Results and discussion
An overall summary of the numbers of proteins and pathways investigated in this study with respect to their targeting by viruses with reference to the number of drug targets in the respective groups is provided in Table 1.

Global virus-pathway interactions
To obtain a general view of how many pathways are potentially targeted by known virus, human interactions, we enumerated the numbers of virus-interacting human proteins present in each pathway.The result is displayed in Figure 1. Figure 1(A) shows the number of pathways vs. the number of proteins targeted in them.A large number of pathways (>80) contain a single known virus binding partner, but the majority of pathways contain more than one protein interacting with a virus.Most pathways include between 2 and 42 virus binding partners, but two pathways display an exceptionally large number of proteins known to interact with viruses.These are the TGF-β and the canonical NF-κB pathways, with 50 and 78 proteins that are known virus binding partners, respectively.These proteins account for 23% and 10% of the proteins present in the corresponding pathway, respectively.Figure 1(B) shows the number of pathways targeted by one or more viruses.The number of pathways targeted by a single virus is of course equal to the number of pathways targeted by a single protein (>80), as in Figure 1(A).Most pathways contain proteins interacting with 2-5 different viruses, but some pathways are potentially targeted by up to 18 of the 31 viruses analysed.The TGF-β signalling pathway and canonical NF-κB pathway are targeted by 18 and 16 viruses, respectively.
To investigate if pathway targeting is correlated with the size of the pathways, we grouped the pathways according to the number of proteins in the pathways in bins of size 3.Then, we calculated the average number of viruses targeting at least one protein in each pathway group.The results are shown in Figure 2. The results indicate that -as expected -pathways containing more proteins are more likely to be targeted by viruses.This statement is substantiated by fitting of the curve.The best result was obtained with a polynomial fit of degree 5.

Pathway targeting by different viruses
The 31 virus groups reported to have interactions with human proteins according to VirusMINT are listed in Table 2.For CRPV, HEV and SeV, there are only 5, 3 and 1 interactions reported.These viruses were, therefore, omitted in our analysis.The remaining 28 viruses cover a range of species: 16 have a double-stranded DNA genome, 2 have a single-stranded RNA negative strand genome, 3 have a single-stranded RNA positive strand genome, 6 are retro-transcribing virus and one is a deltavirus.Nine virus groups, HIV, Human papillomavirus (HPV), EBV, Human adenovirus (HadV), Human herpes virus (HHV), Simian virus (SV), Vaccinia virus (VACV), Bovine papillomavirus (BPV) and FLUAV, have the maximum number of known interactions with human proteins.They interact with proteins in 289, 153, 121, 106, 139, 121, 62, 68 and 57 pathways, respectively.In the following, we briefly discuss the PPI of these nine groups, with particular emphasis on the cell surface receptors used by these viruses and the pathways most frequently targeted.
Human immunodeficiency virus-1 (HIV), the causative agent of AIDS, encodes 15 proteins (Frankel and Young, 1998).The maximum number of interactions with human proteins has been reported for this virus.These interactions potentially affect 289 human pathways (Table 2).The largest number of interactions ( 48) is with the proteins in the canonical NF-κB pathway.HIV initiates infection via the CD4 receptor (Maddon et al., 1986), which participates in several pathways, including the canonical NF-κB pathway, Vpu-mediated degradation of CD4, IL12 signalling mediated by STAT4, IL23-mediated signalling events, downstream TCR signalling, Nef-Mediated CD4 down-regulation, ADP-ribosylation factor 1 pathway, binding and entry of HIV virion, translocation of ZAP-70 to immunological synapse, phosphorylation of CD3 and TCR zeta chains and generation of second messenger molecules pathways.In all of these pathways, HIV targets additional proteins apart from CD4.
Human papillomavirus (HPV) leads to cervical cancer (Schiffman and Castle, 2003).We grouped 10 types of HPV together (Table 2).HPV encodes for 8 or 9 proteins depending on HPV subtype.HPV proteins are reported to be involved in 243 PPIs that potentially perturb 153 signal transduction pathways.TGF-β receptor signalling is the most frequently targeted pathway by HPV. 17 proteins of this pathway have been reported to interact with HPV.Other receptors such as α 6 integrin and laminin 5 are also involved in HPV infection.α 6 integrin participates in a6b1 and a6b4 Integrin signalling and Integrin cell surface interactions pathways.Laminin 5 is involved in integrin cell surface interactions pathways.
Bovine papillomavirus (BPV) encodes 8 proteins that have been shown to undergo 29 interactions with human proteins.These are predicted to perturb 68 pathways.BPV has its maximum number of interactions with proteins from the Sumoylation by RanBP2 regulates transcriptional repression pathway.5 proteins of this pathway are targeted by BPV.
Epstein-Barr virus (EBV) belongs to the Herpesviridae family and causes mononucleosis.
The EBV strain GD1 genome encodes 30 proteins while EBV strain B95-8 codes for 73 proteins.Taken together, the proteins of the two strains have 145 interactions with human proteins, potentially affecting 121 signal transduction pathways.23 proteins of GTP hydrolysis and joining of the 60S ribosomal subunit pathway are targeted by EBV.This is the highest number of proteins of any pathway involved in interaction with EBV proteins.CR2 (Complement receptor type 2) and β1 family integrins act as receptors for EBV infection and these receptors participate in a number of different pathways.However, none of the other proteins in them are reported to be binding partners of an EBV protein.
Human adenovirus (HAdV), a virus with 27 proteins, causes respiratory diseases, including croup, bronchitis, pneumonias, keratoconjunctivitis, cystitis and gastroenteritis.97 interactions of HAdV proteins with human proteins have been reported and are involved in 106 pathways.Among the pathways studied, the TGF-β receptor signalling pathway has the maximum number of interactions with HAdV involving 10 proteins.Coxsackievirus and adenovirus receptor (CXADR) is the cellular receptor for HAdV types A, C, D, E and F, while CD80 and CD86 are receptors for type B and CD46 is the receptor for HAdV B2 and HAdV-3.CXADR appears in the PECAM1 interactions pathway, immunoregulatory interactions between a lymphoid and a non-lymphoid cell pathway, and the formation of platelet plug pathway.These pathways have additional proteins being targeted by other viruses but only CXADR is targeted by HAdV in all these pathways.CD80 and CD86 participate in the following pathways: canonical NF-κB, IL12 signalling mediated by STAT4 and downstream signalling in naive CD8+ T cells.Note that CD80 and CD86 are reported in VirusMINT to interact with HIV but not HAdV.
Human herpesvirus (HHV) encodes 73 proteins and 73 interactions with human proteins have been reported.HHV has maximum interactions with proteins involved in the regulation of telomerase pathway.15 proteins of this pathway are reported to interact with HHV.Besides heparan sulphate, TNFRSF14 (Tumour necrosis factor receptor superfamily member 14), PVRL1 & PVRL2 (Poliovirus-receptor-related proteins 1 & 2) and Integrin αV-β3 serve as receptors for different HHV subtypes [Note that VirusMINT does not include the interactions with these receptors.].Integrin αV participates in VEGFR1-specific signals, integrins in angiogenesis, osteopontin-mediated events, S1P3 pathway, PDGFRalpha signalling pathway, integrin cell surface interactions, S1P1 pathway and PECAM1 interactions pathway.These pathways include other virus-targeted proteins.
Simian virus (SV) encodes 6 proteins, which are known to interact with 47 human proteins across 121 pathways.SV40 is a tumour-causing virus.The TGF-β receptor signalling pathway is the most frequently SV-targeted pathway involving interactions with 7 proteins.
Vaccinia virus (VACV) with 254 proteins participates in 62 interactions with host proteins affecting 70 of analysed pathways.Proteins of canonical NF-κB pathway have the maximum number of interactions ( 5) with VACV proteins.
Influenza A (FLUAV) causes flu.Its genome encodes 11 proteins that undergo 12 interactions with human proteins affecting 57 pathways.The maximum number of interactions ( 5) is with proteins of the activation of the pre-replicative complex pathway.

Comparison of pathways across viruses
The above-mentioned analysis indicates that despite the differences in cell surface receptors recognised and required for entry, the different viruses subsequently may target overlapping signal transduction pathways.This is also supported by the relatively similar distribution of the numbers of pathways potentially affected by different viruses, shown in Figure 3.There are some significant outliers, however.The graph is sorted according to pathway interactions with HIV.The comparison of the pathways affected by HIV with all other viruses highlights that EBV interacts with proteins in pathways, which other viruses, including HIV, may not affect or for which the corresponding interactions have not yet been studied.Major pathways, which are potentially affected by EBV but not other viruses, are listed in the legend of Figure 3.Further work is required to identify if these outliers reflect a true difference between EBV and other viruses, or if the respective pathways have not yet been investigated for other viruses.

Analysis at the pathway and protein level
Table 3 lists the proteins that interact with at least 2 viruses and shows the number of pathways in which these proteins participate.If a particular protein is present in many pathways, then its interaction with viral proteins may disrupt multiple pathways simultaneously.Interaction of a viral protein with such human proteins may thus up-or down-regulate many pathways allowing the virus to subvert the host's cellular machinery for its purpose at a very broad scale.These proteins include proto-oncogenes such as tyrosine-protein kinases LCK, Fyn and Lyn, proteasome subunit β type-6, CREB-binding protein, histone acetyltransferase p300, paxillin and TATA-box-binding protein.Their study can thus be of pharmacological significance.Of the 93 proteins that are targeted by at least 2 proteins, SV interacts with 23 of these and the SV and HPV together have interactions with 17 (from 23) of these proteins.BPV also interacts with a large number (14) of proteins from the set of 93 proteins.BPV and HPV together interact with 13 (from 14) of these proteins.These high overlaps of proteins likely reflect conserved mechanisms of viral communication with their hosts.
Figure 4 gives the distribution of the number of pathways and number of proteins targeted by more than one virus.One can see that the number of proteins targeted by different viruses is smaller when compared with the number of pathways.This finding suggests that different viruses target a number of the same pathways by binding to different proteins.This implies that interactions may be conserved at the pathway level, rather than the protein level.It may be more critical for a virus to disrupt a particular pathway by any PPI that may do so, rather than restricting itself to a specific PPI.Pathway disruption may be conserved irrespective of the molecular mechanism of disruption.
The above-mentioned analysis indicates that some viral proteins interact more with several host proteins when compared with others.A prime example of this is the TGF-β receptor signalling pathway, which is targeted by the maximum number of viruses (18).One possible explanation is because with 212 proteins this pathway is large (see Figure 2 for the general trend).Even more proteins participate in the canonical NF-κB pathway (739), and this pathway is also frequently targeted (by 16 viruses).However, there are other large pathways that are not frequently targeted, for example the electron transport chain has 78 proteins, yet only one has been shown to interact with a viral protein.Thus, it is likely that the frequent targeting of some pathways is functionally significant.To further investigate this observation, we screened the proteins targeted by viruses (among the 413 proteins with Uniprot ID, 411 could be mapped to gene symbols) using the IPA system (see Section 2).The network analysis resulted in 25 enriched networks whose statistical tests were significant (p < 10 −5 , score > 5).Table 4 lists 12 most significant networks whose scores were at least 20.We merged networks 1 and 2, both of which had functions related to infectious disease.Figure 5 shows the combined molecular network.As can be seen in the figure, NFKBIA, NFKB1 and NFKB2 are centrally located in the network, providing quantitative support for the finding described earlier, that viruses often target the NF-κB pathway.For many viruses, the largest number of interactions is with the proteins in the canonical NF-κB pathway (see earlier).

Targeting of human hub proteins by viruses
NFKBIA, NFKB1 and NFKB2 are proteins that are generally referred to as hub proteins.Hub proteins are proteins that interact with many other proteins.According to the PINA database (Wu et al., 2009), these three proteins interact with 119, 192 and 195 other human proteins, respectively.To investigate if viruses generally target hub proteins, we first investigated HIV.A comparison between the degree of proteins randomly sampled when compared with those of the proteins interacting with HIV is shown in Figure 6.This comparison shows that especially for proteins with degree above 10 the difference becomes very large.Since there is no standard definition of hub in a network, we defined three groups: hubs, intermediate hubs and non-hubs by degree >20, 10 ≤ degree < 20 and degree < 10, respectively.We then investigated these groups with respect to all virus groups.The results are shown in Table 5.We found a clear inverse relationship between the percentage of proteins (hubs vs. non-hubs) in the pathways targeted by virus or not.This finding confirms that hub proteins are more likely targeted by viruses.

Drug targets in virus-targeted pathways
The 671 signal transduction pathways are composed of 3039 unique proteins including 710 drug targets (Table 1).Figure 7 gives a distribution of the known drug targets in the pathways along with the drug targets that have a viral interacting partner.HIV has the maximum interactions with 219 human proteins, followed by HPV.Of the 671 signal transduction pathways investigated, 355 may be targeted by at least one virus.These 355 pathways have 413 proteins that interact with 28 different viruses.Amongst the 413 targeted proteins, 95 are known drug targets with 59 of those interacting with HIV viral proteins.Amongst the 355 targeted pathways, there are 268 that have at least one protein targeted by more than one virus.These 268 pathways have 93 unique proteins with which the various virus proteins interact.
Thus, amongst all 671 pathways, there are 93 human proteins that interact with at least two pathogenic viruses.These 93 proteins are involved in a total of 774 interactions reported across the 268 pathways.We hypothesise that these 93 proteins may be potential drug targets for antiviral drugs against multiple viruses.According to the Drug Bank data, 23 proteins from these 93 proteins are known drug targets, i.e., there are ligands known to interact with these proteins.It would be interesting to test if these compounds have antiviral activity.MAV, HPV, HAdV, FLUAV, EBV, SV, VACV, ASV, RSV, FAdV, MPyV, HCV, HHV and HIV interact with the 23 drug targets from DrugBank.

Conclusion
In this paper, we have explored human signal transduction pathways that are potentially targeted by virus-human PPI across different viruses.We find an increase in the number of pathways targeted with the size of the pathway targeted, but some pathways are overrepresented in being a virus target.We find that viruses have the potential to target the same pathways, albeit not necessarily via the same PPIs, indicating commonalities in their hijack strategies.One such common strategy is to target hub proteins.It has previously been noted that the EBV targets high degree human proteins (Calderwood et al., 2007) and pathogens in general tend to interact with host proteins with high degrees and betweenness centrality (Dyer et al., 2008).Finally, we need to note that the validity of any global analysis on the protein interaction data is dependent on the quality of the experimental data.The interaction data, in general, are derived through different experiments and experimental conditions and the resulting sets of protein interactions are more likely to include false positive and negative pairs as well as sample bias.Because of this lack of gold standard data sets, any conclusion that is based on protein interaction data sets should be taken with a grain of salt.However, this type of global views has still the potential to lead us to valuable findings regarding the general phenomena of how organisms work and communicate.Distribution of the average number of viruses that target at least one protein in a pathway.Bin size was 3, until the number of proteins reached 123, which only includes 7 pathways.The minimum sum of residuals was found for polynomial fit at degree = 5, shown here    HIV proteins target human hub proteins.The number of proteins a protein is interacting with is referred to as degree.HIV interactions were retrieved from the NIAID database (Fu et al., 2009) but only those interactions that are more likely related to direct physical interactions are included in this analysis, referred to as 'group 1' (for details, see Tastan et al. ( 2009))

Description Counts
No. of proteins in all pathways 3039 No. of proteins in all pathways targeted by the 28 viruses studied 413 No. of proteins in all pathways targeted by HIV 219 No. of proteins in all pathways targeted by two or more viruses 93 No. of proteins in all pathways that are known drug targets 710 No. of proteins in all pathways that are known drug targets and are targeted by any virus 95 No. of proteins in all pathways that are known drug targets and are targeted by HIV

Figure 1 .
Figure 1.Global analysis of pathways potentially targeted by viruses.(A) Distribution of the number of pathways as a function of virus-interacting human proteins present in each pathway.The number of proteins in a pathway that are targeted by viruses varies for different pathways.This figure quantifies this variation in the number of pathways against the number of proteins that are targeted in those pathways.(B) Number of pathways being targeted by one or more viruses.This graph quantifies the variation in the extent to which different pathways are targeted by viruses Figure 2.

Figure 3 .
Figure 3.Variation in proteins targeted by HIV and EBV in pathways.The number of proteins that are targeted by different viruses in the pathways varies.HIV and EBV targeted pathways stand out.In particular, pathways labelled 'A' are Activation of the mRNA upon binding of the cap-binding complex and eIFs and subsequent binding to 43S, L13a-mediated translational silencing of Ceruloplasmin expression, Formation of a pool of free 40S subunits, Eukaryotic Translation Termination, Viral mRNA Translation, Translation initiation complex formation, Formation of the ternary complex and subsequently the 43S complex and Ribosomal scanning and start codon recognition.Pathways 'B' are GTP hydrolysis and Figure 4.

Figure 5 .
Figure 5.The merged molecular network for proteins targeted by viruses.Node shapes represent different molecule types (horizontal diamonds: peptidases; vertical diamonds: enzymes; circles: 'other').Solid lines denote protein-protein interactions and dashed lines denote regulation relationships.A few proteins such as NFKBIA, NFKB1 and NFKB2 are centrally located in the network Figure 6.

Figure 7 .
Figure 7. Distribution of drug targets and drug targets with virus interacting partners in pathways.Approximately half of the drug targets in the pathways appear to be targeted by viruses.However, some pathways have significant higher number of drug targets compared with drug targets with reported virus interactions namely, Pathway 'A' Canonical NF-kappaB pathway, Pathway 'B' HIF-1-alpha transcription factor network, Pathway 'C' Vif-mediated degradation of APOBEC3G, Pathway 'D' Electron Transport Chain and Pathway 'E' Signalling mediated by p38-alpha and p38-beta Biol Drug Des.Author manuscript; available in PMC 2012 July 29.Biol Drug Des.Author manuscript; available in PMC 2012 July 29.
Zhongming Zhao received his PhD in Human and Molecular Genetics from the University of Texas Health Science Center at Houston and MD Anderson Cancer Center, Houston, Texas, in 2000.Currently, he is an Associate Professor in the Departments of Biomedical Informatics, Psychiatry, and Cancer Biology at the Vanderbilt University Medical Center.His research interests include bioinformatics and systems biology approaches to studying complex diseases, genome-wide or large-scale analysis of genetic variation and methylation patterns, next-generation sequencing data analysis, comparative genomics and biomedical informatics.Junfeng Xia received his PhD in Bioinformatics from the University of Science and Technology of China in 2010.He is currently a Postdoctoral Researcher in the Department of Biomedical Informatics, Vanderbilt University School of Medicine.His research interests include computational biology, intelligent computing and development and application of bioinformatics methods and algorithms in genomic data sets for complex disease studies.Oznur Tastan obtained a BSc in Biological Sciences and Bioengineering from Sabanci University, Turkey, in 2004.She received an MSc from the Language Technologies Institute, School of Computer Science at Carnegie Mellon University in 2007.She was a PhD student in the same programme.She is currently a Postdoctoral Researcher with Microsoft Research, New England.Her interests are in machine learning, computational biology and systems biology.Irtisha Singh received her MSc in Computational Biology from Carnegie Mellon University in 2010.She is currently working as a Bioinformatics Analyst at the Department of Biomedical Informatics at the University of Pittsburgh School of Medicine.Her research includes systems biology, application of bioinformatics methods for drug discovery against HIV and analysis of whole-genome and microRNA array cancer datasets.Meghana Kshirsagar received her MSc in Computer Science from the Indian Institute of Technology, Bombay.She is currently a Master's student at the Language Technologies Institute at Carnegie Mellon University and is working on Computational Proteomics.Before joining Carnegie Mellon University, she was working at Yahoo! Labs in the Search Sciences group.Her research interests include machine learning and computational biology.Jaime Carbonell is the Director of the Language Technologies Institute and Allen Newell Professor of Computer Science at Carnegie Mellon University.He received BS in Physics and Mathematics from Massachusetts Institute of Technology, and MS and PhD in Computer Science from Yale University.His current research includes machine learning, computational proteomics, data mining, text mining and machine translation.He recently invented Proactive Machine Learning, including underlying decision-theoretic framework.He has served on multiple governmental advisory committees and is the Chairman of the Carnegie Speech Company.Judith Klein-Seetharaman received her PhD in Chemistry from the Massachusetts Institute of Technology in 2000.She is currently Associate Professor at the Department of Structural Biology at the University of Pittsburgh School of Medicine with secondary faculty appointments at the Language Technologies Institute, School of Computer Science, Carnegie Mellon University and Institute for Structural Biology, Research Center Jülich, Germany.Her research is on structure, function and interactions of proteins.Her major research activities include the Biological Language Modelling Project (www.cs.cmu.edu/~blmt), the Data Integration and Bioinformatics Core of the Pittsburgh Center for HIV Protein Interactions (www.hivppi.pitt.edu)and SHIPREC, the Salmonella Host Interactions Project European Consortium (www.erasysbio.net/index.php?index=279).

Table 2
Taxon grouping and summary of data.The table details the grouping of different strains of viruses and lists statistics on their interactions and the pathways they are linked to.Column 1: Taxon.The abbreviation indicated in brackets is used throughout the text.Column 2: Strains grouped together as VirusMINT with human proteins for each strain in the grouped taxons.Column 4: Total number of interactions for the taxon group and the number of human proteins involved in the interactions, given in brackets.Column 5: Number of pathways each virus taxon group is linked to by way of PPI.Column 6: Percentage of the human proteins present in pathways that are targeted by the virus in at least one pathway

Table 3
List of proteins most frequently targeted by more than two viruses.Column 1: Protein name.Column 2: Uniprot id of the protein.Column 3: Indicates (yes/no) whether the protein is listed as drug target in DrugBank.Column 4: Number of pathways in which the protein participates.Column 5: Number of viruses that interact with the protein.Column 6: List of the viruses targeting the protein via PPI