GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:32:02 Sequence gi568815581r:35877569_36081420 : 203852 bp : 44.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 137 132 6 1.05 1.02 Term - 1071 956 116 1 2 116 47 164 0.817 13.83 1.01 Init - 2737 2662 76 1 1 88 103 136 0.999 14.36 1.00 Prom - 2826 2787 40 -3.26 2.11 PlyA - 3154 3149 6 1.05 2.10 Term - 17813 17745 69 2 0 105 47 53 0.157 0.84 2.09 Intr - 29118 29011 108 0 0 57 99 41 0.072 2.58 2.08 Intr - 32170 31890 281 2 2 33 105 114 0.647 4.80 2.07 Intr - 32664 32501 164 2 2 47 76 95 0.308 3.82 2.06 Intr - 37699 37581 119 1 2 59 50 88 0.271 1.36 2.05 Intr - 42704 42595 110 0 2 55 44 102 0.406 2.50 2.04 Intr - 47204 47036 169 2 1 39 66 93 0.800 1.72 2.03 Intr - 48069 47947 123 0 0 65 92 81 0.991 6.98 2.02 Intr - 52687 52508 180 1 0 74 98 83 0.967 7.96 2.01 Init - 53159 53064 96 2 0 91 58 96 0.970 7.21 2.00 Prom - 53242 53203 40 -7.66 3.00 Prom + 54336 54375 40 -3.86 3.01 Init + 54583 54601 19 0 1 81 117 6 0.255 3.24 3.02 Intr + 57237 57367 131 1 2 73 97 13 0.275 1.11 3.03 Term + 58786 58923 138 0 0 85 48 131 0.336 6.76 3.04 PlyA + 58955 58960 6 1.05 4.05 PlyA - 59019 59014 6 -1.95 4.04 Term - 59265 59183 83 1 2 129 55 55 0.641 4.16 4.03 Intr - 60348 60190 159 1 0 97 47 182 0.904 14.96 4.02 Intr - 61789 61650 140 0 2 80 74 145 0.422 12.41 4.01 Init - 75638 75493 146 2 2 79 98 123 0.807 11.99 4.00 Prom - 77572 77533 40 -3.26 5.00 Prom + 83762 83801 40 -6.36 5.01 Init + 84017 84094 78 1 0 83 39 93 0.687 4.96 5.02 Intr + 94254 94291 38 2 2 77 67 70 0.540 0.86 5.03 Term + 95941 96013 73 1 1 111 41 92 0.858 4.18 5.04 PlyA + 99770 99775 6 1.05 6.04 PlyA - 99877 99872 6 1.05 6.03 Term - 100163 99998 166 1 1 95 45 220 0.994 15.79 6.02 Intr - 100716 100575 142 1 1 49 77 73 0.586 1.71 6.01 Init - 103852 103777 76 1 1 67 89 118 0.911 9.07 6.00 Prom - 104121 104082 40 -5.46 7.04 PlyA - 104553 104548 6 1.05 7.03 Term - 106320 106233 88 2 1 120 48 138 0.996 10.13 7.02 Intr - 106884 106770 115 1 1 97 25 129 0.912 7.01 7.01 Init - 109081 109003 79 1 1 76 94 127 0.871 13.36 7.00 Prom - 113825 113786 40 -3.86 8.05 PlyA - 114099 114094 6 1.05 8.04 Term - 120292 120199 94 0 1 119 38 65 0.966 1.90 8.03 Intr - 120823 120712 112 2 1 88 103 84 0.973 9.34 8.02 Intr - 121357 121298 60 2 0 94 98 11 0.712 1.41 8.01 Init - 123924 123849 76 0 1 65 101 119 0.970 10.19 8.00 Prom - 124582 124543 40 -4.66 9.08 PlyA - 125902 125897 6 1.05 9.07 Term - 130847 130636 212 0 2 96 44 129 0.972 6.76 9.06 Intr - 131035 130957 79 1 1 79 93 34 0.411 2.12 9.05 Intr - 132063 132022 42 0 0 104 63 31 0.157 0.64 9.04 Intr - 135740 135683 58 1 1 113 41 48 0.248 1.49 9.03 Intr - 136341 136176 166 1 1 102 53 154 0.932 12.32 9.02 Intr - 136825 136766 60 2 0 116 98 30 0.968 5.51 9.01 Init - 140329 140254 76 1 1 65 113 104 0.962 9.94 9.00 Prom - 141811 141772 40 -7.26 10.03 PlyA - 142196 142191 6 1.05 10.02 Term - 143337 142430 908 1 2 -29 45 349 0.544 11.36 10.01 Init - 145092 144588 505 0 1 74 0 254 0.428 10.35 10.00 Prom - 146902 146863 40 -5.16 11.00 Prom + 152139 152178 40 -6.16 11.01 Init + 152380 152419 40 0 1 38 116 24 0.077 0.55 11.02 Intr + 159296 159371 76 0 1 84 72 29 0.040 -0.53 11.03 Intr + 163296 163396 101 0 2 44 95 50 0.089 1.05 11.04 Intr + 169433 169601 169 0 1 24 86 114 0.170 3.80 11.05 Intr + 169781 169948 168 2 0 54 68 112 0.135 4.86 11.06 Intr + 186771 186841 71 0 2 87 113 59 0.000 7.13 11.07 Intr + 192879 192990 112 1 1 108 103 76 0.986 10.54 11.08 Term + 193383 193473 91 0 1 128 55 146 0.999 12.49 11.09 PlyA + 194441 194446 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 186775 186841 67 0 1 86 113 127 0.973 14.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:35877569_36081420|GENSCAN_predicted_peptide_1|63_aa MKVSAAALAVILIATALCAPASASPYSSDTTPCCFAYIARPLPRAHIKEYFYTSGKCSNP AVV >gi568815581r:35877569_36081420|GENSCAN_predicted_CDS_1|192_bp atgaaggtctccgcggcagccctcgctgtcatcctcattgctactgccctctgcgctcct gcatctgcctccccatattcctcggacaccacaccctgctgctttgcctacattgcccgc ccactgccccgtgcccacatcaaggagtatttctacaccagtggcaagtgctccaaccca gcagtcgtgtga >gi568815581r:35877569_36081420|GENSCAN_predicted_peptide_2|472_aa MAELVPFAVPIESDKTLLVWELSSGPTAEALHHSLFTAFSQFGLLYSVRVFPNAAVAHPG FYAVIKFYSARAAHRAQKACDRKQLFQKSPVKVRLGTRHKAVQHQALALNSSKCQELANY YFGFNGCSKRIIKLQELSDLEERENEDSMVPLPKQSLKFFCALEVVLPSCDCRSPGIGLV EEPMDKVEEESGKIAVEYRPSEDIVGVRCEEELHGLIQVCEDKNSGQFQHLKDQQEMIIQ QLNTPENDELPPVPQEPTTQSPAQTLAPSGSGTLSNSAKLSSSDSIPPMEAEPSPNQQEA TVQASEPPKNIELSSQQMVPENIFPPTMENSNQLPEPPTEVVAQLPPRYEVTIPTQGQDQ AQLSTLASVTLQPLDLGFIITPESTTEIELSPTMQETPTQPPKEFVPQPPVYQESHRKRL DLYQFNRRLQLNLQNLLKMRTPLQYSRRLQGIQSQQYDISGTLTSPSQEASK >gi568815581r:35877569_36081420|GENSCAN_predicted_CDS_2|1419_bp atggcggagttggtaccttttgcggttcccatcgagagtgacaaaaccttgctagtgtgg gagctgagctccggacccacggccgaggctttgcatcattctctgttcacagcattttct cagtttggccttctgtattcagtccgggtcttcccaaatgctgcagtggcccatcctggt ttctatgccgtcattaagttttattctgcaagggctgcccacagagcccaaaaggcatgc gaccggaagcagctttttcagaaatctccagtcaaggttcgtcttggcaccagacataag gcagttcaacatcaagcccttgccctgaacagttccaaatgccaagaactggcgaattac tactttggtttcaatgggtgttccaaaaggatcatcaagcttcaggagctttctgacctt gaagaaagggaaaatgaagatagcatggtgccacttccgaagcaaagcctgaagttcttc tgtgctttagaagtggtgttgccatcctgtgattgcaggagtcctggcattggcttggtg gaggagcctatggataaggtggaggaagaaagtggtaaaatagctgtggagtacagaccc agtgaagacatcgtaggtgtcagatgcgaagaagaactacacggtttaattcaagtatgt gaagataaaaactcagggcagtttcagcatttgaaagaccagcaagaaatgattattcag cagctaaatacccctgaaaatgatgaacttcctccagtccctcaagagcccacaactcag tcaccagctcagactttagctccctcaggaagtggaaccctctctaactcagcaaaactt tccagctcagactccataccccctatggaggcagagccttctccaaaccagcaggaggcc acagttcaggcttcagagccccccaagaatatagaactttcaagccagcagatggtccca gagaatatatttcctccaaccatggagaactcaaatcaacttccagaaccacctacggag gttgtagctcaacttccacctcgttatgaggtgacaattccaacacaaggtcaggatcaa gctcagctttcaacactggccagtgtcacacttcaacctttggacctggggtttatcatc actccagaatccactacagaaattgaactttctccaaccatgcaggagaccccaactcag cctcctaaggaatttgtaccccaacctccagtatatcaagagagtcatcggaagagactg gacctttaccagttcaacaggagacttcagctgaatctccagaacctactaaagatgaga acccctctccaatacagtaggaggctgcagggtatacagtctcagcagtacgacatatca gggacgctaacttctcccagccaagaagcaagcaaatga >gi568815581r:35877569_36081420|GENSCAN_predicted_peptide_3|95_aa MGIVGSGGILSASRRVAGLNSATSILPCGEKRHGSCLTLSNSSTHMTQFLTVEQRKSAER EVKQMHQEKLKPAMDSVLTISSPGSMLFLRPSCSP >gi568815581r:35877569_36081420|GENSCAN_predicted_CDS_3|288_bp atgggcatagtgggctccggcggcatcctgtcagccagtagaagagtggccggcctgaac agtgcaacctccattctaccctgcggagaaaaaagacatggttcttgcctcacactcagt aactcttccacacacatgacccagttcttgactgtggagcagagaaagtctgcggagaga gaagtgaagcaaatgcaccaagagaagctgaaacctgccatggacagtgtgctgacaatt tccagccctggttccatgctcttcctgaggcccagctgcagcccttga >gi568815581r:35877569_36081420|GENSCAN_predicted_peptide_4|175_aa MRKNQCKNAENSKNQNASSPPNDRNTAPARAQNWMENEIDKLTEVGFRRMTKALLIYLVS SFLALNQASLISRCDLAQVLQLEDLDGFEGYSLSDWLCLAFVESKFNISKINENADGSFD YGLFQINSHYWCNDYKSYSENLCHVDCQDLLNPNLLAGIHCAKRIVSGARGMNNW >gi568815581r:35877569_36081420|GENSCAN_predicted_CDS_4|528_bp atgaggaaaaaccaatgcaaaaacgctgaaaattccaaaaaccagaatgcctcttctcct ccaaatgatcgcaacaccgctccagcaagggcacaaaactggatggagaatgagattgac aaattgacagaagtaggcttcagaaggatgacaaaggcgctactcatctatttggtcagc agctttcttgccctaaatcaggccagcctcatcagtcgctgtgacttggcccaggtgctg cagctggaggacttggatgggtttgagggttactccctgagtgactggctgtgcctggct tttgtggaaagcaagttcaacatatcaaagataaatgaaaatgcagacggaagctttgac tatggcctcttccagatcaacagccactactggtgcaacgattataagagttactcggaa aacctttgccacgtagactgtcaagatctgctgaatcccaaccttcttgcaggcatccac tgcgcaaaaaggattgtgtccggagcacgggggatgaacaactggtga >gi568815581r:35877569_36081420|GENSCAN_predicted_peptide_5|62_aa MKNQIEKQYDIDETQKRTDRNFQENKILYQKLAVVTAALPLNSDSLALTLVLAKDFETSC RI >gi568815581r:35877569_36081420|GENSCAN_predicted_CDS_5|189_bp atgaagaatcaaatcgagaaacaatatgacatagatgaaacacagaagagaactgacagg aacttccaggagaacaagatcctgtaccagaagctggcagttgtcacagcagccctgcct ttgaactctgactccctggctttgaccctggtgctggctaaggactttgaaacttcttgc cgcatctga >gi568815581r:35877569_36081420|GENSCAN_predicted_peptide_6|127_aa MKVSEAALSLLVLILIITSASRSQPTCNVIPSEVPEWVNTPSTCCLKYYEKVLPRRLVVG YRKALNCHLPAIIFVTKRNREVCTNPNDDWVQEYIKDPNLPLLPTRNLSTVKIITAKNGQ PQLLNSQ >gi568815581r:35877569_36081420|GENSCAN_predicted_CDS_6|384_bp atgaaggtctccgaggctgccctgtctctccttgtcctcatccttatcattacttcggct tctcgcagccagccaacttgcaatgtgattccttcagaagttcctgagtgggtgaacacc ccatccacctgctgcctgaagtattatgagaaagtgttgccaaggagactagtggtggga tacagaaaggccctcaactgtcacctgccagcaatcatcttcgtcaccaagaggaaccga gaagtctgcaccaaccccaatgacgactgggtccaagagtacatcaaggatcccaaccta cctttgctgcctaccaggaacttgtccacggttaaaattattacagcaaagaatggtcaa ccccagctcctcaactcccagtga >gi568815581r:35877569_36081420|GENSCAN_predicted_peptide_7|93_aa MKISVAAIPFFLLITIALGTKTESSSRGPYHPSECCFTYTTYKIPRQRIMDYYETNSQCS KPGIVFITKRGHSVCTNPSDKWVQDYIKDMKEN >gi568815581r:35877569_36081420|GENSCAN_predicted_CDS_7|282_bp atgaagatctccgtggctgccattcccttcttcctcctcatcaccatcgccctagggacc aagactgaatcctcctcacggggaccttaccacccctcagagtgctgcttcacctacact acctacaagatcccgcgtcagcggattatggattactatgagaccaacagccagtgctcc aagcccggaattgtcttcatcaccaaaaggggccattccgtctgtaccaaccccagtgac aagtgggtccaggactatatcaaggacatgaaggagaactga >gi568815581r:35877569_36081420|GENSCAN_predicted_peptide_8|113_aa MKVSVAALSCLMLVAVLGSQAQFTNDAETELMMSKLPLENPVVLNSFHFAADCCTSYISQ SIPCSLMKSYFETSSECSKPGVIFLTKKGRQVCAKPSGPGVQDCMKKLKPYSI >gi568815581r:35877569_36081420|GENSCAN_predicted_CDS_8|342_bp atgaaggtctccgtggctgccctctcctgcctcatgcttgttgctgtccttggatcccag gcccagttcacaaatgatgcagagacagagttaatgatgtcaaagcttccactggaaaat ccagtagttctgaacagctttcactttgctgctgactgctgcacctcctacatctcacaa agcatcccgtgttcactcatgaaaagttattttgaaacgagcagcgagtgctccaagcca ggtgtcatattcctcaccaagaaggggcggcaagtctgtgccaaacccagtggtccggga gttcaggattgcatgaaaaagctgaagccctactcaatataa >gi568815581r:35877569_36081420|GENSCAN_predicted_peptide_9|230_aa MKVSVAALSCLMLVTALGSQARVTKDAETEFMMSKLPLENPVLLDMLWRRKIGPQMTLSH AAGFHATSADCCISYTPRSIPCSLLESYFETNSECSKPGVIFLTKKGRRFCANPSDKQVQ GLNLKAGIEFQTKKEGEHTDGKVQELGQALLGSGPMVASKGPCLTSWKNQVIVQEDLNDG ECGDFNELWMWLLWDEWGAGEWMEWEDDLPLEFSHPAADLLFNCPQPNFS >gi568815581r:35877569_36081420|GENSCAN_predicted_CDS_9|693_bp atgaaggtctccgtggctgccctctcctgcctcatgcttgttactgcccttggatcccag gcccgggtcacaaaagatgcagagacagagttcatgatgtcaaagcttccattggaaaat ccagtacttctggacatgctctggaggagaaagattggtcctcagatgaccctttctcat gctgcaggattccatgctactagtgctgactgctgcatctcctacaccccacgaagcatc ccgtgttcactcctggagagttactttgaaacgaacagcgagtgctccaagccgggtgtc atcttcctcaccaagaaggggcgacgtttctgtgccaaccccagtgataagcaagttcag ggcctcaacctgaaggctggaattgagtttcagacaaaaaaggagggggaacacacagat gggaaggtgcaggagctggggcaagcacttttgggctctggccccatggtagcatccaaa ggtccttgtctgacatcctggaagaatcaggtcatcgttcaagaagacttgaacgatggt gaatgtggagattttaatgagttatggatgtggctcttatgggatgaatggggagctgga gaatggatggagtgggaagatgatcttcccttggagttcagccatcccgcggctgatctc ctcttcaactgtccccagccgaacttctcctga >gi568815581r:35877569_36081420|GENSCAN_predicted_peptide_10|470_aa MKENLCKKAENSKNQNASSPPKDHNSSPAREQNWMENEFDESTEVGFRRWVLTNSSKLKE HVVTQCKEAKNLDKRLQELLTRIASLEKSINDLIELKNTARELREAYTSINSRIDQAEER ISETEDHLNEIKHEDKIREKRMKRNEQSLQEIWDYVKRPNLHLICVPEKLRIKKLTQNCT TTWKLNNLLLNDYWVNNKVKAEINKFFETNGNKDTTYQNLWDTAKVVFRGKFIALNVHIR KWERSKIDTLTSQLKELEKQEQTNSKANRRQEITKIRAELKEIETQKTLQKINESRCWFF EKINKIDRLLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTAIREYDKHLYANKPENLE EMDKFLDTYTLPRLNQEEVESLNRPITSSEIEAVIDSLPTKKSTGPDGFTAEFYQRYKEE LVPFLLKLFQSTEKEGLLPNSFYEASIILIPKPGKDKTTTTTKEISGQYP >gi568815581r:35877569_36081420|GENSCAN_predicted_CDS_10|1413_bp atgaaggaaaacctttgcaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaaggatcacaactcctcgccagcaagggaacaaaactggatggagaatgagtttgat gaatcgacagaagtaggcttcagaaggtgggtactaacaaactcctccaagctaaaggag catgttgtaacccaatgcaaggaagctaagaaccttgataaaaggttacaggaactgcta actagaatagccagtttagagaagagcataaatgatctgatagagctgaaaaacacagca cgagaacttcgtgaagcatacacaagtatcaatagccgaattgatcaagcagaagaaagg atatcagagactgaagatcaccttaatgaaataaagcatgaagacaagattagagaaaaa agaatgaaaaggaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaac ctacatttgatttgtgtacctgaaaaactcaggattaagaaactcactcaaaactgcaca actacatggaaactgaacaacctgctcctgaatgactactgggtaaacaacaaagttaag gcagaaataaataagttctttgaaaccaatgggaacaaagacacaacgtaccagaatctc tgggacacggcaaaagtagtgtttagagggaaatttatagcactaaatgtccacattaga aagtgggaaagatccaaaattgacaccctaacatcacaattaaaagaactagagaagcaa gagcaaacaaattcgaaagctaacagaagacaagaaataactaagatcagagcagaactg aaggagatagagacacaaaaaaccctgcaaaaaatcaatgaatccaggtgctggtttttt gaaaagattaacaaaatagatagactgctagccagactaataaaaaagaaaagagagaag aatcaaatagacacaataaaaaatgataaaggggatatcaccactgatcccacagaaata caaactgccatcagagaatacgataaacacctctatgcaaataaaccagaaaatctagaa gaaatggataaattcctagacacatacaccctcccaagactaaaccaggaagaagttgaa tccctgaatagaccaataacgagttctgaaattgaggcagtaattgatagcctaccaacc aaaaaaagcacaggaccagatggattcacagccgaattctaccagaggtacaaagaggag ctggtaccattccttctgaaattattccaatcaacagaaaaagagggactcctccctaac tcattttatgaggccagcatcatcctgataccaaaacctggcaaagacaaaacaacaaca acaacaaaggaaatttctggccaatatccctga >gi568815581r:35877569_36081420|GENSCAN_predicted_peptide_11|275_aa MFAKLTSEEKGDKGMFRYTNTHHCVTTAYSILYSSMLHRPCRGLIPVADVFLDPHLAGVG IDLSSLLCGGSNERRSKLPVNGHDPNGKLREDSQDLSENPWEEAGAQERKAARVWQTLTY EELGVPWKRSYLKQQMPYLLFTHYGPLPCQGTSALTLHYQIPCKHTPELTLNLAPTGDQW VPGELIMKGLAAALLVLVCTMALCSCAQVGTNKELCCLVYTSWQIPQKFIVDYSETSPQC PKPGVILLTKRGRQICADPNKKWVQKYISDLKLNA >gi568815581r:35877569_36081420|GENSCAN_predicted_CDS_11|828_bp atgtttgctaagttgacaagtgaggaaaagggagacaaaggtatgtttaggtacacaaat actcaccactgtgttacaactgcctacagtattctgtacagtagcatgctgcacagacct tgccggggtttaattccagttgctgatgtattcctggacccacaccttgctggagttggc atagacctttccagtctcctctgtggaggaagcaatgaaagaagatcgaagttacctgtt aatggtcatgatccaaatggaaaactgagggaagacagccaggacctgtcagagaaccca tgggaagaagctggggcacaagaaaggaaagcagcaagagtctggcagacattgacctat gaggaacttggggtcccatggaaaaggtcctacctgaaacagcagatgccatacttgttg ttcacccattatgggccactgccctgccagggaacctcggcccttacgttgcattaccag atcccctgcaaacatactccagaactcactctgaatttggcacccacaggggatcagtgg gtccctggagagctcatcatgaagggccttgcagctgccctccttgtcctcgtctgcacc atggccctctgctcctgtgcacaagttggtaccaacaaagagctctgctgcctcgtctat acctcctggcagattccacaaaagttcatagttgactattctgaaaccagcccccagtgc cccaagccaggtgtcatcctcctaaccaagagaggccggcagatctgtgctgaccccaat aagaagtgggtccagaaatacatcagcgacctgaagctgaatgcctga