Question

Part 1: Smith-Waterman Algorithm Instructions: 1. Copy the "STS protein query sequence" from the week 3 links page (This is the one letter code for the protein encoded by the Resveratrol

synthase gene, the gene that catalyzes the final step in the resveratrol synthesis pathway). Be sure to include the ">STS query" on the first line, or the program won't accept it. 2. Go to https://www.ebi.ac.uk/Tools/sss/ and choose SSEARCH, then "protein". This will do a Smith-Waterman local alignment on your protein sequence. 3. Choose the UniProtKB/TrEMBL database (under Step 1 on the page) 4. Paste the STS protein query sequence into the paste window 5. Choose SSEARCH under step 3, then click "More options". Here you will find a number of parameters including the substitution matrix. Search UniProt using three different substitution matrices: • BLOSUM 50 • PAM 120 • PAM 250 Be patient, the calculation will take a while. Questions: 1. What is the name and score of the best hit for each matrix? 2. What is the e-value of the best hit for each matrix? 3. Why do you think the results may have changed? 4. What do e-values mean and how do we interpret them?

Fig: 1