From 1ebd3509c68140dce2759e97c8b02d0f8713fa14 Mon Sep 17 00:00:00 2001 From: Marc Gouw <marc.r.gouw@gmail.com> Date: Wed, 9 Nov 2016 13:39:50 +0100 Subject: [PATCH] Added ELM material. --- README.md | 2 +- TeachingMaterials/elm.md | 114 +++++++++++++++++++++++++++++++++++++++ 2 files changed, 115 insertions(+), 1 deletion(-) create mode 100644 TeachingMaterials/elm.md diff --git a/README.md b/README.md index ce8c258..14648ab 100644 --- a/README.md +++ b/README.md @@ -53,7 +53,7 @@ - [TMHMM](http://www.cbs.dtu.dk/services/TMHMM/) - [IUPRED](http://iupred.enzim.hu/) and [Anchor](http://anchor.enzim.hu/) 5. Intrinsically disordered region: Marc - - ELM + - [ELM](https://git.embl.de/sharan/protein-bioinformatics-nov-2016/blob/master/TeachingMaterials/elm.md) - [DisProt](https://git.embl.de/sharan/protein-bioinformatics-nov-2016/blob/master/TeachingMaterials/disprot.md) diff --git a/TeachingMaterials/elm.md b/TeachingMaterials/elm.md new file mode 100644 index 0000000..9b86c82 --- /dev/null +++ b/TeachingMaterials/elm.md @@ -0,0 +1,114 @@ +# Short Linear Motifs + +This text was largely adapted from a [tutorial written by Holger +Dinkel][elm_tutorial] for the [EMBO Practical Course on computational analysis +of protein-protein interactions][embo_course_ppi] + +[elm_tutorial]: http://aidanbudd.github.io/course_EMBO_at_TGAC_PPI_Sep2015/trainingMaterial/holgerDinkel/linear_motifs/ +[embo_course_ppi]: http://aidanbudd.github.io/course_EMBO_at_TGAC_PPI_Sep2015// + +## Eukaryotic Linear Motifs + +Eukaryotic Linear Motifs (or ELMS) sometimes also known as short Linear Motifs +(SLiMs) are short sequences typically found in disordered regions that have +important roles in the function of a protein. + +## The ELM database + +The [ELM database][elm] is a project who's ultimate goal it is to all occurences +of ELMs and their function in all known proteins(!). + +It consists of manually annotated entries carefully curated by experts in a +particular field, working in a certain protein, or a particular motif. These +annotators are responsible for contributing ELM **classes**, which represent +linear motifs with a known function, and experimentally verified **instances** +of this motif. + +- **types** There are 6 types of motifs: LIG: ligand binding, MOD: + modification, TRG: targeting, DOC: docking, DEG: degradation, CLV: cleavage. + +- **class** is a sequence of amino acids with a given function, based on + binding partner, modifying enzyme, acting peptidase and targeted subcellular + localisation. Each **class** is defined by a **regular expression** + +- **instance** an manually annotated occurrence of a **class** in a protein, + verified by a literature citable experiment. + +[elm]: http://elm.eu.org + +## Browsing content + +## Exercise 1: Browsing content + +There are two main ways in which the ELM database content can be browsed. + +Click on "ELM DB" -> "ELM Classes", or follow the link to the ELM classes page: +http://elm.eu.org/elms to browse the ELM **classes** that have been annotated. + +Use the search (or side filters) to find the ELM motif: **DOC_CYCLIN_1** + +- **Question 1:** What does this motif do? +- **Question 2:** How many instances are annotated in the database? +- **Question 3:** Which Gene Ontology terms is this motif associated with? + +This motif was identified in P53 in the sequence: **KKLMF** + +- **Question 4:** What is the starting and finishing position of this sequence + in P53? +- **Question 5:** Which experimental protocols were used to infer the existence + of this instance? +- **Question 6:** How certain are we about this annotation? +- **Question 7:** What activates P53 in the pathway to induce apoptosis? + +## Exercise 2: The ELM Prediction tool + +Navigate to the "ELM predictions" page. + +Search protein **SRC_HUMAN** (accession P12931) for ELMs using the following parameters: + +- Cell Compartment: Not specified +- Motif Probability Cutoff: 100 +- Context information: (leave blank) + +- **Question 1:** How many instances do you find? +- **Question 2:** What can you say about the globularity of the protein? Does + it have globular and/or disordered regions? + +Redo the above search, this time using the following parameters: + +- Cell Compartment: cytosol +- Motif Probability Cutoff: 0.01 +- Context information: Homo sapiens + +- **Question 3:** How many instances do you find now? +- **Question 4:** How many of the instances are manually annotated? +- **Question 5:** Do the structural predictors/filters (SMART, GlobPlot, + IUPRED, Secondary Structure) agree in terms of which regions are +structured/disordered? +- **Question 6:** Compare the location of the annotated instances with + structural information at hand (IUPRED, Secondary Structure). +- **Question 7:** How many deteced instances were removed by the + SMART/Structure filter? +- **Question 8:** For the annotated instances, which of the ELM classes require + a phosphorylation at a certain residue of the motif? (Hint: This information +can be found in the description of the ELM class) +- **Question 9:** Which residue in SRC_HUMAN corresponds to this and can you + find evidence for a phosphorylation of this residue (using Phospho.ELM)? + + +## Exercise 3: The ELM Prediction tool + +Search ELM using the protein name **MDM4_HUMAN** and look for the ‘USP binding motif’ **DOC_USP7_MATH_1** + +- **Question 1:** How many such motif instances are found in this protein sequence? +- **Question 2:** How many of these have been exprimentally validated (i.e., are manually annotated?), and what are the "FP" annotations? + +## Exercise 4: Switches + +Use the ELM "global search box" (on the top right) to search for the class +**LIG_SH3_2**. (Just start typing, and wait for the autocomplete to finish). + +Click on "LIG_SH3_2" to visit the class page. + +- **Question 1:** How many switches are annotated for this class? +- **Question 2:** What is the mechanism that results in the switching event in **SYNJ2_RAT**? -- GitLab