GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:33:10 Sequence gi568815596r:179845132_180088649 : 243518 bp : 37.27% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4060 4185 126 0 0 63 38 123 0.383 2.10 1.02 PlyA + 4200 4205 6 1.05 2.07 PlyA - 5114 5109 6 1.05 2.06 Term - 5496 5228 269 1 2 81 48 117 0.328 1.67 2.05 Intr - 7474 7339 136 1 1 78 78 71 0.377 4.32 2.04 Intr - 16101 16033 69 0 0 26 49 149 0.002 3.26 2.03 Intr - 51630 51526 105 0 0 96 42 72 0.285 2.89 2.02 Intr - 55505 55413 93 2 0 31 100 58 0.053 0.54 2.01 Init - 57113 56967 147 2 0 83 25 92 0.075 2.54 2.00 Prom - 57315 57276 40 -3.65 3.00 Prom + 61107 61146 40 -3.75 3.01 Init + 84464 84514 51 1 0 68 90 31 0.428 2.23 3.02 Term + 84891 85016 126 2 0 97 44 123 0.879 6.10 3.03 PlyA + 85765 85770 6 1.05 4.00 Prom + 91263 91302 40 -2.75 4.01 Init + 93452 93522 71 1 2 96 91 76 0.787 9.27 4.02 Term + 95978 96008 31 2 1 67 38 51 0.184 -5.65 4.03 PlyA + 96483 96488 6 1.05 5.14 PlyA - 97258 97253 6 1.05 5.13 Term - 100584 99998 587 1 2 63 31 326 0.999 18.09 5.12 Intr - 105601 105381 221 0 2 -2 72 231 0.774 9.52 5.11 Intr - 105795 105694 102 2 0 109 83 63 0.771 6.27 5.10 Intr - 107467 107340 128 1 2 98 80 68 0.994 5.56 5.09 Intr - 109226 109074 153 2 0 59 84 75 0.719 3.55 5.08 Intr - 113951 113891 61 1 1 59 45 51 0.087 -4.28 5.07 Intr - 119497 119416 82 2 1 93 116 49 0.981 5.98 5.06 Intr - 120851 120747 105 0 0 90 92 253 0.999 25.17 5.05 Intr - 125432 125370 63 0 0 62 95 69 0.872 2.77 5.04 Intr - 125725 125519 207 2 0 79 83 296 0.994 26.33 5.03 Intr - 125945 125810 136 2 1 46 115 57 0.846 3.42 5.02 Intr - 128671 128503 169 0 1 79 71 137 0.854 10.13 5.01 Init - 133171 133059 113 1 2 47 115 31 0.936 1.43 5.00 Prom - 136008 135969 40 -4.75 6.04 PlyA - 136083 136078 6 1.05 6.03 Term - 136866 136587 280 2 1 42 44 361 0.984 21.03 6.02 Intr - 141674 141564 111 1 0 49 61 108 0.786 2.78 6.01 Init - 146082 145751 332 0 2 57 14 142 0.134 0.58 6.00 Prom - 148126 148087 40 -7.55 7.00 Prom + 148700 148739 40 -7.05 7.01 Init + 161581 161683 103 0 1 74 82 83 0.612 6.75 7.02 Intr + 164948 165129 182 0 2 54 100 117 0.618 8.17 7.03 Term + 165986 166048 63 1 0 104 44 74 0.928 1.51 7.04 PlyA + 166743 166748 6 1.05 8.06 PlyA - 167011 167006 6 1.05 8.05 Term - 168845 168774 72 2 0 131 43 23 0.559 -0.97 8.04 Intr - 173829 173746 84 0 0 68 98 66 0.829 4.70 8.03 Intr - 175716 175617 100 2 1 65 98 24 0.044 0.29 8.02 Intr - 215579 215418 162 1 0 83 37 81 0.054 0.67 8.01 Intr - 225977 225902 76 0 1 102 82 76 0.534 6.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 17706 17839 134 2 2 72 111 109 0.844 11.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:179845132_180088649|GENSCAN_predicted_peptide_1|41_aa HHSGFVVIQENTMQTKRTKGLQKTFEGKRGYELQNLRLTEP >gi568815596r:179845132_180088649|GENSCAN_predicted_CDS_1|126_bp catcattctggcttcgtggttattcaggagaataccatgcaaacgaaaaggacaaaagga cttcagaaaacctttgaagggaaacgaggttatgaattacagaatttaagactgactgag ccttaa >gi568815596r:179845132_180088649|GENSCAN_predicted_peptide_2|272_aa MDEAGSHNSQQTNTGMENQTPYVLTYKWELKNENTWTQGRKHHTLGPVGVHQEVLQKKAL IQGITASCGVLPLKVFQWDKHKQLDEIVGSDSGLIFFCGDEIVSWVVICLSIKKNQIAGE RLREAPVQPWRDRSNHRSVLVHDEFPCQREPLDACRSIYVSSLMVYGFQGPNLTLQNLKE VVGARGQTKHVPPHPVPNKKALNRHGSCLQTLEGMIRGQGSRFIFPKFQQAQPDSVGGII GRQISTQYEKELSDDSLSSNGMAALKFHAPSS >gi568815596r:179845132_180088649|GENSCAN_predicted_CDS_2|819_bp atggatgaagctggaagccataattctcagcaaactaacacaggaatggaaaaccaaaca ccatatgttctcacttacaagtgggagttgaagaatgagaacacatggacacaaggaagg aaacatcacacactggggcctgtcggggttcatcaggaggtattgcagaagaaggcattg atacaagggataacagcttcatgtggggtattgcccctgaaagtcttccagtgggacaag cataaacagttggatgaaatagttggatcagattcaggtttaatttttttttgtggggac gagattgtttcatgggtggtgatatgtttatccatcaagaagaaccagatcgcgggagaa cgtctgcgcgaagccccggtgcagccctggcgggaccgcagcaaccaccggagcgtttta gtccacgatgagtttccatgccaacgtgaaccactggacgcctgccgcagtatctatgtg agctcactcatggtatatggattccagggcccaaacctaaccctacaaaatctaaaagag gtggtaggagcaagaggacagacaaagcatgtgcctcctcatccggttccgaataagaag gctttgaacagacatggtagttgcctgcaaacacttgaagggatgatacgtggacaagga agtagattcatttttccaaagttccagcaggcacaaccagattcagtgggtgggatcata gggaggcaaatttcaactcaatatgagaaagaactttcagatgattcactgtccagtaat ggtatggctgccttgaagtttcatgctccctctagctga >gi568815596r:179845132_180088649|GENSCAN_predicted_peptide_3|58_aa MISFDAMSCIWVTLMQEAQHHMEAAKAWGFHPLKPQPELYIGPFQPWLKWLEHRAPGP >gi568815596r:179845132_180088649|GENSCAN_predicted_CDS_3|177_bp atgatctcctttgatgccatgtcttgcatctgggtcacactgatgcaagaggctcaacat cacatggaagctgccaaggcttggggcttccaccctctgaagccacagcctgagctctac attggcccctttcagccatggctgaagtggctggaacacagggcaccaggtccctag >gi568815596r:179845132_180088649|GENSCAN_predicted_peptide_4|33_aa MDAASGHYPKQITGTETKYHISRSKYTDMHTSV >gi568815596r:179845132_180088649|GENSCAN_predicted_CDS_4|102_bp atggatgcagccagtggccattatcctaagcagataacaggaacagaaaccaaataccac atttctcggtctaaatacacggacatgcacacctctgtgtga >gi568815596r:179845132_180088649|GENSCAN_predicted_peptide_5|708_aa MSWEALKKSINGLINKVNISNISIIIQELLQENIVRGRGLLSRSVLQAQSASPIFTHVYA ALVAIINSKFPQIGELILKRLILNFRKGYRRNDKAHEVLCLEMLTLLLERPTDDSVEVAI GFLKECGLKLTQVSPRGINAIFERLRNILHESEIDKRVQYMIEVMFAVRKDGFKDHPIIL EGLDLVEEDDQFTHMLPLEDDYNPEDVLNVFKMDPNFMENEEKYKAIKKEILDEGDTDSN TDQDAGSSEEDEEEEEEEGEEDEEGQKVTIHDKTEINLVSFRRTIYLAIQSSLDFEECAH KLLKMEFPESQTRFCMLKKEYMESFEGIFKEQYDTIHRLETNKLRNVAKMFAHLLYTDSL PWSVLECIKLSEETTTSSSRIFVKIFFQELCEYMGLPKLNARLKDETLQPFFEGLLPRDN PRNTRFAINFFTSIGLGGLTDELREHLKNTPKVIVAQKPDVEQNKSSPSSSSSASSSSES DSSDSDSDSSDSSSESSSEESDSSSISSHSSASANDVRKKGHGKTRSKEVDKLIRNQQTN DRKQKERRQEHGHQETRTERERRSEKHRDQNSSGSNWRDPITKYTSDKDVPSERNNYSRV ANDRDQEMHIDLENKHGDPKKKRGERRNSFSENEKHTHRIKDSENFRRKDRSKSKEMNRK HSGSRSDEDRYQNGAERRWEKSSRYSEQSRESKKNQDRRREKSPAKQK >gi568815596r:179845132_180088649|GENSCAN_predicted_CDS_5|2127_bp atgagttgggaggccctgaagaagtcaattaatggccttatcaacaaagtcaacatttcc aacataagtattattattcaagagcttcttcaagaaaatatagttagaggaagaggactg ctgtccaggtctgttttgcaagcacagagtgcttctccaatcttcacccatgtttatgca gcattagtggcaattatcaactcaaaatttccacaaattggagaattaatcctcaaaagg ttaattcttaattttcgaaaaggctatcgaagaaatgacaaggcacacgaagtattatgc ttagagatgctcactttgctcctggaaagaccaacagatgatagcgttgaagtagctatt ggttttcttaaggaatgtggcctcaaattaacacaagtgtcaccaagaggaatcaatgct atatttgaacgccttcgaaacattctgcatgagtctgaaattgacaaaagagttcaatat atgattgaagtgatgtttgctgtacggaaagatggattcaaggaccaccccattatccta gaaggtcttgatttggtggaagaagatgatcaattcactcatatgctccctctggaggat gactataatccagaagatgttcttaatgttttcaagatggatcctaattttatggagaat gaagagaagtacaaagctattaagaaagaaattcttgatgagggagatactgactcgaac acagaccaggatgctgggagtagtgaagaggacgaggaagaagaagaggaagagggagaa gaagatgaagaaggacaaaaagtaactattcatgacaaaacagaaattaacctggtctca tttcgtcgtacaatttatcttgctattcagtcaagtttagattttgaagaatgtgctcac aaattgctgaaaatggagtttcctgaaagccaaacacgattttgcatgctaaagaaagag tacatggaatcctttgaaggtatattcaaagaacagtatgataccatccatcgcttggaa acaaacaagttgcgaaatgttgctaagatgtttgctcaccttttatacactgattcactt ccatggagtgttcttgaatgtataaaactgagtgaagaaaccactacatcatccagtaga atttttgtcaaaatatttttccaggaactgtgtgaatacatgggtcttcctaaacttaat gcaagattaaaggatgaaactctgcagccattctttgaaggattattaccccgagataat ccaagaaacactcggtttgccatcaacttctttacttctataggtcttggaggtttaacg gatgaactgcgggagcatctcaaaaatacaccaaaggtcattgtggcgcagaaaccagat gttgagcaaaataaatcctccccatcctcttcctcttcagcgtcctcctcttcagagtct gactcatccgactctgattctgacagcagtgatagcagttcagagtcttccagtgaagag agcgactcttcatccatcagtagtcatagctctgcctcagctaatgatgtaagaaagaag ggacatgggaagaccagaagtaaagaggtagataaattgatcagaaaccagcaaacaaat gataggaaacaaaaagaaagaagacaagaacacgggcaccaggaaacaaggactgagaga gaaagaaggtcagaaaaacacagagatcaaaattcaagtggttcaaattggagagatcct ataacaaagtacacatcagacaaagatgttccttctgaacgaaataactacagtagagtt gcgaatgacagagaccaagaaatgcatatagatttggaaaataagcatggtgatcccaaa aagaagagaggagagagaagaaattctttttctgaaaatgagaagcatacacaccgaatt aaagacagtgaaaatttcagaagaaaagatagatcaaagtcaaaggaaatgaatagaaag cactcaggctcaagaagtgatgaagatagatatcaaaatggtgccgagagacgatgggaa aaatctagcagatactctgaacaatccagagaatcaaagaaaaatcaggaccggcgaaga gaaaagtctccagcaaaacaaaaataa >gi568815596r:179845132_180088649|GENSCAN_predicted_peptide_6|240_aa MKATGLRLLVPLVSSYCSPYIVGDYELLHDVTIFVLLHEQFRKILKGNSDCIVFSTTALT YNLNSVIKEIGKFRFLIFDSYLKVILNLVKSYSSTKAKFKSPFSHFHPLALYEEQERSPR DRDYFDYSRSDYEHSRRGRSYDSSMESRNRDREKRRERERDTDRKRSRKSPSPGRRNPET SVTQSSSAQDEPATKKKKDELDPLLTRTGGAYIPPAKLRMMQEQITDKNRHVYINQYTKP >gi568815596r:179845132_180088649|GENSCAN_predicted_CDS_6|723_bp atgaaagccactggactcaggcttcttgttccccttgtctcctcgtactgctctccctac attgttggagactatgagttactacatgatgtcaccatatttgttttgcttcatgaacaa tttcgaaaaattttaaaaggtaattctgactgtatagtattttcaactactgcactgact tacaatctaaactctgtgataaaagaaatagggaaattcaggttcttaatatttgattca tacctgaaagtgattcttaacttggtgaaatcctattcatctaccaaagccaaattcaag tccccgttttcccactttcatccgttagcactatatgaagaacaagaacgatccccccgg gatagagattactttgattacagcagatcagactatgagcattcaagaagaggacgttct tatgatagtagcatggagtcacgaaacagggaccgagaaaaacgcagagaaagagaaaga gatacggatcggaaaaggtctcggaaatccccatctcctgggaggagaaacccagaaaca tcagtaactcagagttcctctgctcaggatgaacctgctacaaagaaaaagaaagatgag ctggatcctcttcttactcgcactggtggagcatatattccccctgcaaagctcaggatg atgcaggaacagattacagataaaaacaggcatgtttatatcaatcagtatacaaagcca tga >gi568815596r:179845132_180088649|GENSCAN_predicted_peptide_7|115_aa MDSRGSKYKTQCTRNNTAPALVQDFRQLQELHKEGTLHPERQYLILAEYCLLADTLTESS AGHSSALLGHQLLGMQHVIHPVEPMVTGLYHLCYELIDQDHTSAPYGHQYELKEY >gi568815596r:179845132_180088649|GENSCAN_predicted_CDS_7|348_bp atggactccagaggctctaagtataaaacgcagtgcacaagaaacaacacagcaccagcg ttggtgcaggatttcagacagctccaggaacttcacaaggaagggaccctgcatcctgag agacagtacctcattcttgcagaatattgtcttctagctgatactttaactgaatcttca gcaggtcattccagtgctctattaggccaccagcttctgggcatgcagcatgtgatacat ccagtggaacccatggtcacaggcttgtaccacctttgctatgaacttattgatcaagac cacacctctgctccatatggtcaccaatatgagctgaaggagtattga >gi568815596r:179845132_180088649|GENSCAN_predicted_peptide_8|164_aa XHWNSNLVAQELDVRTGPKGSFEGQRFNCLMAHAGGEPGPWLGSMGSPPPQSPQFYGRTH LEVIHTVRSSPGKSIPSISTGILLLMSQLVYTMCVHPVALFVKSYGDVMPHFTGDVTTSV TVSVQPDIIPSILGDVTPNFTVLLSNTLLTPLFLSQYLLLRGSN >gi568815596r:179845132_180088649|GENSCAN_predicted_CDS_8|495_bp ngacactggaactccaatcttgtagctcaggaacttgatgtgaggactggtcctaagggc agttttgaggggcaaagattcaactgtctcatggctcatgctgggggagaacccggtcca tggttgggatccatgggttcccccccgccccagtctccccaattctatggtcgtacacat cttgaggtcatccacacggttcgttcatctcctggcaaaagcataccctcaatctccacg gggatattactcctaatgtcacagttggtgtacaccatgtgtgttcaccctgtggcatta tttgtaaaatcctacggggatgttatgcctcatttcacaggggatgttactactagcgtc actgtgagtgtacaacctgatattattcctagtatcttaggggatgttactcctaatttc acagtattactttctaatacacttttgacacccctgtttctatctcagtatctgcttctt agaggatccaattga