GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:08:17 Sequence gi568815581r:35883804_36086649 : 202846 bp : 44.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16735 16783 49 0 1 90 94 57 0.672 5.93 1.02 Term + 17227 17237 11 2 2 124 45 1 0.588 -2.24 1.03 PlyA + 19674 19679 6 1.05 2.10 PlyA - 19833 19828 6 1.05 2.09 Term - 25199 24927 273 2 0 49 47 147 0.542 2.17 2.08 Intr - 25935 25655 281 1 2 33 105 114 0.647 4.80 2.07 Intr - 26429 26266 164 1 2 47 76 95 0.308 3.82 2.06 Intr - 31464 31346 119 0 2 59 50 88 0.271 1.36 2.05 Intr - 36469 36360 110 2 2 55 44 102 0.406 2.50 2.04 Intr - 40969 40801 169 1 1 39 66 93 0.800 1.72 2.03 Intr - 41834 41712 123 2 0 65 92 81 0.991 6.98 2.02 Intr - 46452 46273 180 0 0 74 98 83 0.967 7.96 2.01 Init - 46924 46829 96 1 0 91 58 96 0.970 7.21 2.00 Prom - 47007 46968 40 -7.66 3.00 Prom + 48101 48140 40 -3.86 3.01 Init + 48348 48366 19 2 1 81 117 6 0.255 3.24 3.02 Intr + 51002 51132 131 0 2 73 97 13 0.275 1.11 3.03 Term + 52551 52688 138 2 0 85 48 131 0.336 6.76 3.04 PlyA + 52720 52725 6 1.05 4.05 PlyA - 52784 52779 6 -1.95 4.04 Term - 53030 52948 83 0 2 129 55 55 0.641 4.16 4.03 Intr - 54113 53955 159 0 0 97 47 182 0.904 14.96 4.02 Intr - 55554 55415 140 2 2 80 74 145 0.422 12.41 4.01 Init - 69403 69258 146 1 2 79 98 123 0.807 11.99 4.00 Prom - 71337 71298 40 -3.26 5.00 Prom + 77527 77566 40 -6.36 5.01 Init + 77782 77859 78 0 0 83 39 93 0.687 4.96 5.02 Intr + 88019 88056 38 1 2 77 67 70 0.540 0.86 5.03 Term + 89706 89778 73 0 1 111 41 92 0.858 4.18 5.04 PlyA + 93535 93540 6 1.05 6.04 PlyA - 93642 93637 6 1.05 6.03 Term - 93928 93763 166 0 1 95 45 220 0.994 15.79 6.02 Intr - 94481 94340 142 0 1 49 77 73 0.586 1.71 6.01 Init - 97617 97542 76 0 1 67 89 118 0.911 9.07 6.00 Prom - 97886 97847 40 -5.46 7.04 PlyA - 98318 98313 6 1.05 7.03 Term - 100085 99998 88 1 1 120 48 138 0.996 10.13 7.02 Intr - 100649 100535 115 0 1 97 25 129 0.912 7.01 7.01 Init - 102846 102768 79 0 1 76 94 127 0.871 13.36 7.00 Prom - 107590 107551 40 -3.86 8.05 PlyA - 107864 107859 6 1.05 8.04 Term - 114057 113964 94 2 1 119 38 65 0.966 1.90 8.03 Intr - 114588 114477 112 1 1 88 103 84 0.973 9.34 8.02 Intr - 115122 115063 60 1 0 94 98 11 0.712 1.41 8.01 Init - 117689 117614 76 2 1 65 101 119 0.970 10.19 8.00 Prom - 118347 118308 40 -4.66 9.08 PlyA - 119667 119662 6 1.05 9.07 Term - 124612 124401 212 2 2 96 44 129 0.972 6.76 9.06 Intr - 124800 124722 79 0 1 79 93 34 0.411 2.12 9.05 Intr - 125828 125787 42 2 0 104 63 31 0.157 0.64 9.04 Intr - 129505 129448 58 0 1 113 41 48 0.248 1.49 9.03 Intr - 130106 129941 166 0 1 102 53 154 0.932 12.32 9.02 Intr - 130590 130531 60 1 0 116 98 30 0.968 5.51 9.01 Init - 134094 134019 76 0 1 65 113 104 0.962 9.94 9.00 Prom - 135576 135537 40 -7.26 10.03 PlyA - 135961 135956 6 1.05 10.02 Term - 137102 136195 908 0 2 -29 45 349 0.544 11.36 10.01 Init - 138857 138353 505 2 1 74 0 254 0.428 10.35 10.00 Prom - 140667 140628 40 -5.16 11.00 Prom + 145904 145943 40 -6.16 11.01 Init + 146145 146184 40 2 1 38 116 24 0.077 0.55 11.02 Intr + 153061 153136 76 2 1 84 72 29 0.040 -0.53 11.03 Intr + 157061 157161 101 2 2 44 95 50 0.089 1.05 11.04 Intr + 163198 163366 169 2 1 24 86 114 0.170 3.80 11.05 Intr + 163546 163713 168 1 0 54 68 112 0.135 4.86 11.06 Intr + 180536 180606 71 2 2 87 113 59 0.000 7.13 11.07 Intr + 186644 186755 112 0 1 108 103 76 0.986 10.54 11.08 Term + 187148 187238 91 2 1 128 55 146 0.999 12.49 11.09 PlyA + 188206 188211 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 180540 180606 67 2 1 86 113 127 0.973 14.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:35883804_36086649|GENSCAN_predicted_peptide_1|19_aa MPHPLHDALHPEATLGAAF >gi568815581r:35883804_36086649|GENSCAN_predicted_CDS_1|60_bp atgccccaccctctgcatgacgccctgcatcctgaagccaccctgggagctgctttttga >gi568815581r:35883804_36086649|GENSCAN_predicted_peptide_2|504_aa MAELVPFAVPIESDKTLLVWELSSGPTAEALHHSLFTAFSQFGLLYSVRVFPNAAVAHPG FYAVIKFYSARAAHRAQKACDRKQLFQKSPVKVRLGTRHKAVQHQALALNSSKCQELANY YFGFNGCSKRIIKLQELSDLEERENEDSMVPLPKQSLKFFCALEVVLPSCDCRSPGIGLV EEPMDKVEEESGKIAVEYRPSEDIVGVRCEEELHGLIQVCEDKNSGQFQHLKDQQEMIIQ QLNTPENDELPPVPQEPTTQSPAQTLAPSGSGTLSNSAKLSSSDSIPPMEAEPSPNQQEA TVQASEPPKNIELSSQQMVPENIFPPTMENSNQLPEPPTEVVAQLPPRYEVTIPTQGQDQ AQLSTLASVTLQPLDLGFIITPESTTEIELSPTMQETPTQPPKEFVPQPPVYQEHPEMTH PPPDKNQAQHPNLTQFTVQSLDLELTITTEPTTEVKTSPTMEETSTQPSDLGFAIVPELT IETEHSTGLDKTTAPHPDQVQTQH >gi568815581r:35883804_36086649|GENSCAN_predicted_CDS_2|1515_bp atggcggagttggtaccttttgcggttcccatcgagagtgacaaaaccttgctagtgtgg gagctgagctccggacccacggccgaggctttgcatcattctctgttcacagcattttct cagtttggccttctgtattcagtccgggtcttcccaaatgctgcagtggcccatcctggt ttctatgccgtcattaagttttattctgcaagggctgcccacagagcccaaaaggcatgc gaccggaagcagctttttcagaaatctccagtcaaggttcgtcttggcaccagacataag gcagttcaacatcaagcccttgccctgaacagttccaaatgccaagaactggcgaattac tactttggtttcaatgggtgttccaaaaggatcatcaagcttcaggagctttctgacctt gaagaaagggaaaatgaagatagcatggtgccacttccgaagcaaagcctgaagttcttc tgtgctttagaagtggtgttgccatcctgtgattgcaggagtcctggcattggcttggtg gaggagcctatggataaggtggaggaagaaagtggtaaaatagctgtggagtacagaccc agtgaagacatcgtaggtgtcagatgcgaagaagaactacacggtttaattcaagtatgt gaagataaaaactcagggcagtttcagcatttgaaagaccagcaagaaatgattattcag cagctaaatacccctgaaaatgatgaacttcctccagtccctcaagagcccacaactcag tcaccagctcagactttagctccctcaggaagtggaaccctctctaactcagcaaaactt tccagctcagactccataccccctatggaggcagagccttctccaaaccagcaggaggcc acagttcaggcttcagagccccccaagaatatagaactttcaagccagcagatggtccca gagaatatatttcctccaaccatggagaactcaaatcaacttccagaaccacctacggag gttgtagctcaacttccacctcgttatgaggtgacaattccaacacaaggtcaggatcaa gctcagctttcaacactggccagtgtcacacttcaacctttggacctggggtttatcatc actccagaatccactacagaaattgaactttctccaaccatgcaggagaccccaactcag cctcctaaggaatttgtaccccaacctccagtatatcaagagcatcctgaaatgacacat ccacctccagacaagaaccaggctcagcatccaaacctgactcaattcacagttcaatct ttggacctggagcttaccataactacagaacctactacagaggttaaaacttctccaacc atggaggagacctcaactcagccttcagacctgggatttgccatagttccagaactcacc atagagactgaacattctacaggcctggacaagactacagctccacatccagaccaagtt cagactcagcattga >gi568815581r:35883804_36086649|GENSCAN_predicted_peptide_3|95_aa MGIVGSGGILSASRRVAGLNSATSILPCGEKRHGSCLTLSNSSTHMTQFLTVEQRKSAER EVKQMHQEKLKPAMDSVLTISSPGSMLFLRPSCSP >gi568815581r:35883804_36086649|GENSCAN_predicted_CDS_3|288_bp atgggcatagtgggctccggcggcatcctgtcagccagtagaagagtggccggcctgaac agtgcaacctccattctaccctgcggagaaaaaagacatggttcttgcctcacactcagt aactcttccacacacatgacccagttcttgactgtggagcagagaaagtctgcggagaga gaagtgaagcaaatgcaccaagagaagctgaaacctgccatggacagtgtgctgacaatt tccagccctggttccatgctcttcctgaggcccagctgcagcccttga >gi568815581r:35883804_36086649|GENSCAN_predicted_peptide_4|175_aa MRKNQCKNAENSKNQNASSPPNDRNTAPARAQNWMENEIDKLTEVGFRRMTKALLIYLVS SFLALNQASLISRCDLAQVLQLEDLDGFEGYSLSDWLCLAFVESKFNISKINENADGSFD YGLFQINSHYWCNDYKSYSENLCHVDCQDLLNPNLLAGIHCAKRIVSGARGMNNW >gi568815581r:35883804_36086649|GENSCAN_predicted_CDS_4|528_bp atgaggaaaaaccaatgcaaaaacgctgaaaattccaaaaaccagaatgcctcttctcct ccaaatgatcgcaacaccgctccagcaagggcacaaaactggatggagaatgagattgac aaattgacagaagtaggcttcagaaggatgacaaaggcgctactcatctatttggtcagc agctttcttgccctaaatcaggccagcctcatcagtcgctgtgacttggcccaggtgctg cagctggaggacttggatgggtttgagggttactccctgagtgactggctgtgcctggct tttgtggaaagcaagttcaacatatcaaagataaatgaaaatgcagacggaagctttgac tatggcctcttccagatcaacagccactactggtgcaacgattataagagttactcggaa aacctttgccacgtagactgtcaagatctgctgaatcccaaccttcttgcaggcatccac tgcgcaaaaaggattgtgtccggagcacgggggatgaacaactggtga >gi568815581r:35883804_36086649|GENSCAN_predicted_peptide_5|62_aa MKNQIEKQYDIDETQKRTDRNFQENKILYQKLAVVTAALPLNSDSLALTLVLAKDFETSC RI >gi568815581r:35883804_36086649|GENSCAN_predicted_CDS_5|189_bp atgaagaatcaaatcgagaaacaatatgacatagatgaaacacagaagagaactgacagg aacttccaggagaacaagatcctgtaccagaagctggcagttgtcacagcagccctgcct ttgaactctgactccctggctttgaccctggtgctggctaaggactttgaaacttcttgc cgcatctga >gi568815581r:35883804_36086649|GENSCAN_predicted_peptide_6|127_aa MKVSEAALSLLVLILIITSASRSQPTCNVIPSEVPEWVNTPSTCCLKYYEKVLPRRLVVG YRKALNCHLPAIIFVTKRNREVCTNPNDDWVQEYIKDPNLPLLPTRNLSTVKIITAKNGQ PQLLNSQ >gi568815581r:35883804_36086649|GENSCAN_predicted_CDS_6|384_bp atgaaggtctccgaggctgccctgtctctccttgtcctcatccttatcattacttcggct tctcgcagccagccaacttgcaatgtgattccttcagaagttcctgagtgggtgaacacc ccatccacctgctgcctgaagtattatgagaaagtgttgccaaggagactagtggtggga tacagaaaggccctcaactgtcacctgccagcaatcatcttcgtcaccaagaggaaccga gaagtctgcaccaaccccaatgacgactgggtccaagagtacatcaaggatcccaaccta cctttgctgcctaccaggaacttgtccacggttaaaattattacagcaaagaatggtcaa ccccagctcctcaactcccagtga >gi568815581r:35883804_36086649|GENSCAN_predicted_peptide_7|93_aa MKISVAAIPFFLLITIALGTKTESSSRGPYHPSECCFTYTTYKIPRQRIMDYYETNSQCS KPGIVFITKRGHSVCTNPSDKWVQDYIKDMKEN >gi568815581r:35883804_36086649|GENSCAN_predicted_CDS_7|282_bp atgaagatctccgtggctgccattcccttcttcctcctcatcaccatcgccctagggacc aagactgaatcctcctcacggggaccttaccacccctcagagtgctgcttcacctacact acctacaagatcccgcgtcagcggattatggattactatgagaccaacagccagtgctcc aagcccggaattgtcttcatcaccaaaaggggccattccgtctgtaccaaccccagtgac aagtgggtccaggactatatcaaggacatgaaggagaactga >gi568815581r:35883804_36086649|GENSCAN_predicted_peptide_8|113_aa MKVSVAALSCLMLVAVLGSQAQFTNDAETELMMSKLPLENPVVLNSFHFAADCCTSYISQ SIPCSLMKSYFETSSECSKPGVIFLTKKGRQVCAKPSGPGVQDCMKKLKPYSI >gi568815581r:35883804_36086649|GENSCAN_predicted_CDS_8|342_bp atgaaggtctccgtggctgccctctcctgcctcatgcttgttgctgtccttggatcccag gcccagttcacaaatgatgcagagacagagttaatgatgtcaaagcttccactggaaaat ccagtagttctgaacagctttcactttgctgctgactgctgcacctcctacatctcacaa agcatcccgtgttcactcatgaaaagttattttgaaacgagcagcgagtgctccaagcca ggtgtcatattcctcaccaagaaggggcggcaagtctgtgccaaacccagtggtccggga gttcaggattgcatgaaaaagctgaagccctactcaatataa >gi568815581r:35883804_36086649|GENSCAN_predicted_peptide_9|230_aa MKVSVAALSCLMLVTALGSQARVTKDAETEFMMSKLPLENPVLLDMLWRRKIGPQMTLSH AAGFHATSADCCISYTPRSIPCSLLESYFETNSECSKPGVIFLTKKGRRFCANPSDKQVQ GLNLKAGIEFQTKKEGEHTDGKVQELGQALLGSGPMVASKGPCLTSWKNQVIVQEDLNDG ECGDFNELWMWLLWDEWGAGEWMEWEDDLPLEFSHPAADLLFNCPQPNFS >gi568815581r:35883804_36086649|GENSCAN_predicted_CDS_9|693_bp atgaaggtctccgtggctgccctctcctgcctcatgcttgttactgcccttggatcccag gcccgggtcacaaaagatgcagagacagagttcatgatgtcaaagcttccattggaaaat ccagtacttctggacatgctctggaggagaaagattggtcctcagatgaccctttctcat gctgcaggattccatgctactagtgctgactgctgcatctcctacaccccacgaagcatc ccgtgttcactcctggagagttactttgaaacgaacagcgagtgctccaagccgggtgtc atcttcctcaccaagaaggggcgacgtttctgtgccaaccccagtgataagcaagttcag ggcctcaacctgaaggctggaattgagtttcagacaaaaaaggagggggaacacacagat gggaaggtgcaggagctggggcaagcacttttgggctctggccccatggtagcatccaaa ggtccttgtctgacatcctggaagaatcaggtcatcgttcaagaagacttgaacgatggt gaatgtggagattttaatgagttatggatgtggctcttatgggatgaatggggagctgga gaatggatggagtgggaagatgatcttcccttggagttcagccatcccgcggctgatctc ctcttcaactgtccccagccgaacttctcctga >gi568815581r:35883804_36086649|GENSCAN_predicted_peptide_10|470_aa MKENLCKKAENSKNQNASSPPKDHNSSPAREQNWMENEFDESTEVGFRRWVLTNSSKLKE HVVTQCKEAKNLDKRLQELLTRIASLEKSINDLIELKNTARELREAYTSINSRIDQAEER ISETEDHLNEIKHEDKIREKRMKRNEQSLQEIWDYVKRPNLHLICVPEKLRIKKLTQNCT TTWKLNNLLLNDYWVNNKVKAEINKFFETNGNKDTTYQNLWDTAKVVFRGKFIALNVHIR KWERSKIDTLTSQLKELEKQEQTNSKANRRQEITKIRAELKEIETQKTLQKINESRCWFF EKINKIDRLLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTAIREYDKHLYANKPENLE EMDKFLDTYTLPRLNQEEVESLNRPITSSEIEAVIDSLPTKKSTGPDGFTAEFYQRYKEE LVPFLLKLFQSTEKEGLLPNSFYEASIILIPKPGKDKTTTTTKEISGQYP >gi568815581r:35883804_36086649|GENSCAN_predicted_CDS_10|1413_bp atgaaggaaaacctttgcaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaaggatcacaactcctcgccagcaagggaacaaaactggatggagaatgagtttgat gaatcgacagaagtaggcttcagaaggtgggtactaacaaactcctccaagctaaaggag catgttgtaacccaatgcaaggaagctaagaaccttgataaaaggttacaggaactgcta actagaatagccagtttagagaagagcataaatgatctgatagagctgaaaaacacagca cgagaacttcgtgaagcatacacaagtatcaatagccgaattgatcaagcagaagaaagg atatcagagactgaagatcaccttaatgaaataaagcatgaagacaagattagagaaaaa agaatgaaaaggaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaac ctacatttgatttgtgtacctgaaaaactcaggattaagaaactcactcaaaactgcaca actacatggaaactgaacaacctgctcctgaatgactactgggtaaacaacaaagttaag gcagaaataaataagttctttgaaaccaatgggaacaaagacacaacgtaccagaatctc tgggacacggcaaaagtagtgtttagagggaaatttatagcactaaatgtccacattaga aagtgggaaagatccaaaattgacaccctaacatcacaattaaaagaactagagaagcaa gagcaaacaaattcgaaagctaacagaagacaagaaataactaagatcagagcagaactg aaggagatagagacacaaaaaaccctgcaaaaaatcaatgaatccaggtgctggtttttt gaaaagattaacaaaatagatagactgctagccagactaataaaaaagaaaagagagaag aatcaaatagacacaataaaaaatgataaaggggatatcaccactgatcccacagaaata caaactgccatcagagaatacgataaacacctctatgcaaataaaccagaaaatctagaa gaaatggataaattcctagacacatacaccctcccaagactaaaccaggaagaagttgaa tccctgaatagaccaataacgagttctgaaattgaggcagtaattgatagcctaccaacc aaaaaaagcacaggaccagatggattcacagccgaattctaccagaggtacaaagaggag ctggtaccattccttctgaaattattccaatcaacagaaaaagagggactcctccctaac tcattttatgaggccagcatcatcctgataccaaaacctggcaaagacaaaacaacaaca acaacaaaggaaatttctggccaatatccctga >gi568815581r:35883804_36086649|GENSCAN_predicted_peptide_11|275_aa MFAKLTSEEKGDKGMFRYTNTHHCVTTAYSILYSSMLHRPCRGLIPVADVFLDPHLAGVG IDLSSLLCGGSNERRSKLPVNGHDPNGKLREDSQDLSENPWEEAGAQERKAARVWQTLTY EELGVPWKRSYLKQQMPYLLFTHYGPLPCQGTSALTLHYQIPCKHTPELTLNLAPTGDQW VPGELIMKGLAAALLVLVCTMALCSCAQVGTNKELCCLVYTSWQIPQKFIVDYSETSPQC PKPGVILLTKRGRQICADPNKKWVQKYISDLKLNA >gi568815581r:35883804_36086649|GENSCAN_predicted_CDS_11|828_bp atgtttgctaagttgacaagtgaggaaaagggagacaaaggtatgtttaggtacacaaat actcaccactgtgttacaactgcctacagtattctgtacagtagcatgctgcacagacct tgccggggtttaattccagttgctgatgtattcctggacccacaccttgctggagttggc atagacctttccagtctcctctgtggaggaagcaatgaaagaagatcgaagttacctgtt aatggtcatgatccaaatggaaaactgagggaagacagccaggacctgtcagagaaccca tgggaagaagctggggcacaagaaaggaaagcagcaagagtctggcagacattgacctat gaggaacttggggtcccatggaaaaggtcctacctgaaacagcagatgccatacttgttg ttcacccattatgggccactgccctgccagggaacctcggcccttacgttgcattaccag atcccctgcaaacatactccagaactcactctgaatttggcacccacaggggatcagtgg gtccctggagagctcatcatgaagggccttgcagctgccctccttgtcctcgtctgcacc atggccctctgctcctgtgcacaagttggtaccaacaaagagctctgctgcctcgtctat acctcctggcagattccacaaaagttcatagttgactattctgaaaccagcccccagtgc cccaagccaggtgtcatcctcctaaccaagagaggccggcagatctgtgctgaccccaat aagaagtgggtccagaaatacatcagcgacctgaagctgaatgcctga