GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:52:51 Sequence gi568815581f:35964343_36171038 : 206696 bp : 44.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1572 1608 37 2 1 86 63 44 0.216 1.88 1.02 Intr + 8698 8743 46 2 1 115 86 15 0.358 1.47 1.03 Term + 9167 9239 73 2 1 111 41 92 0.931 4.18 1.04 PlyA + 12996 13001 6 1.05 2.04 PlyA - 13103 13098 6 1.05 2.03 Term - 13389 13224 166 2 1 95 45 220 0.995 15.79 2.02 Intr - 13942 13801 142 2 1 49 77 73 0.586 1.71 2.01 Init - 17078 17003 76 2 1 67 89 118 0.911 9.07 2.00 Prom - 17347 17308 40 -5.46 3.04 PlyA - 17779 17774 6 1.05 3.03 Term - 19546 19459 88 0 1 120 48 138 0.996 10.13 3.02 Intr - 20110 19996 115 2 1 97 25 129 0.912 7.01 3.01 Init - 22307 22229 79 2 1 76 94 127 0.871 13.36 3.00 Prom - 27051 27012 40 -3.86 4.05 PlyA - 27325 27320 6 1.05 4.04 Term - 33518 33425 94 1 1 119 38 65 0.966 1.90 4.03 Intr - 34049 33938 112 0 1 88 103 84 0.973 9.34 4.02 Intr - 34583 34524 60 0 0 94 98 11 0.712 1.41 4.01 Init - 37150 37075 76 1 1 65 101 119 0.970 10.19 4.00 Prom - 37808 37769 40 -4.66 5.08 PlyA - 39128 39123 6 1.05 5.07 Term - 44073 43862 212 1 2 96 44 129 0.972 6.76 5.06 Intr - 44261 44183 79 2 1 79 93 34 0.411 2.12 5.05 Intr - 45289 45248 42 1 0 104 63 31 0.157 0.64 5.04 Intr - 48966 48909 58 2 1 113 41 48 0.248 1.49 5.03 Intr - 49567 49402 166 2 1 102 53 154 0.932 12.32 5.02 Intr - 50051 49992 60 0 0 116 98 30 0.968 5.51 5.01 Init - 53555 53480 76 2 1 65 113 104 0.962 9.94 5.00 Prom - 55037 54998 40 -7.26 6.03 PlyA - 55422 55417 6 1.05 6.02 Term - 56563 55656 908 2 2 -29 45 349 0.544 11.36 6.01 Init - 58318 57814 505 1 1 74 0 254 0.428 10.35 6.00 Prom - 60128 60089 40 -5.16 7.00 Prom + 65365 65404 40 -6.16 7.01 Init + 65606 65645 40 1 1 38 116 24 0.077 0.55 7.02 Intr + 72522 72597 76 1 1 84 72 29 0.040 -0.53 7.03 Intr + 76522 76622 101 1 2 44 95 50 0.089 1.05 7.04 Intr + 82659 82827 169 1 1 24 86 114 0.170 3.80 7.05 Intr + 83007 83174 168 0 0 54 68 112 0.135 4.86 7.06 Intr + 99997 100067 71 1 2 87 113 59 0.000 7.13 7.07 Intr + 106105 106216 112 2 1 108 103 76 0.986 10.54 7.08 Term + 106609 106699 91 1 1 128 55 146 0.999 12.49 7.09 PlyA + 107667 107672 6 1.05 8.05 PlyA - 109906 109901 6 1.05 8.04 Term - 124420 124330 91 0 1 128 55 104 0.999 8.29 8.03 Intr - 124955 124841 115 0 1 135 103 127 0.999 18.41 8.02 Intr - 125837 125644 194 1 2 75 94 104 0.793 8.84 8.01 Init - 134133 133889 245 0 2 86 55 123 0.588 6.01 8.00 Prom - 138442 138403 40 -0.16 9.00 Prom + 138999 139038 40 -5.46 9.01 Init + 139564 139639 76 0 1 68 108 85 0.987 7.77 9.02 Intr + 140883 140942 60 1 0 110 45 46 0.041 1.21 9.03 Intr + 147935 148023 89 0 2 69 39 46 0.037 -2.51 9.04 Intr + 149012 149167 156 1 0 75 77 60 0.142 3.81 9.05 Intr + 153383 153436 54 1 0 108 72 30 0.233 2.58 9.06 Intr + 154650 154725 76 2 1 22 113 55 0.371 0.49 9.07 Intr + 154946 155003 58 0 1 109 101 42 0.399 5.54 9.08 Term + 156839 156953 115 2 1 53 44 78 0.248 -2.06 9.09 PlyA + 158344 158349 6 1.05 10.00 Prom + 159241 159280 40 -1.56 10.01 Init + 165142 165253 112 0 1 68 72 69 0.537 3.67 10.02 Intr + 183238 183353 116 2 2 68 109 20 0.083 2.27 10.03 Intr + 185238 185369 132 2 0 91 72 21 0.021 1.64 10.04 Intr + 192609 192659 51 2 0 62 79 51 0.049 0.70 10.05 Term + 197271 197402 132 2 0 105 43 104 0.247 5.69 10.06 PlyA + 198341 198346 6 1.05 11.08 PlyA - 198617 198612 6 1.05 11.07 Term - 200293 200136 158 2 2 95 39 129 0.728 6.80 11.06 Intr - 202221 202009 213 1 0 101 49 71 0.634 3.19 11.05 Intr - 203354 203202 153 0 0 85 80 82 0.818 7.14 11.04 Intr - 203824 203725 100 1 1 91 89 78 0.700 7.88 11.03 Intr - 205233 205181 53 1 2 100 38 77 0.764 2.53 11.02 Intr - 205669 205549 121 1 1 92 110 146 0.994 17.27 11.01 Intr - 206261 206213 49 1 1 97 117 62 0.980 8.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100067 67 1 1 86 113 127 0.973 14.26 S.002 Intr + 140186 140300 115 0 1 82 59 48 0.873 0.81 S.003 Term + 140883 140970 88 0 1 110 49 89 0.876 4.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:35964343_36171038|GENSCAN_predicted_peptide_1|51_aa MQRAYNGGSLPTIGSPMYPATCILQGKRPLNSDSLALTLVLAKDFETSCRI >gi568815581f:35964343_36171038|GENSCAN_predicted_CDS_1|156_bp atgcagagagcttacaacggcggctctttaccaaccattggttcccccatgtacccagcc acctgtattctacaaggcaaaaggcctttgaactctgactccctggctttgaccctggtg ctggctaaggactttgaaacttcttgccgcatctga >gi568815581f:35964343_36171038|GENSCAN_predicted_peptide_2|127_aa MKVSEAALSLLVLILIITSASRSQPTCNVIPSEVPEWVNTPSTCCLKYYEKVLPRRLVVG YRKALNCHLPAIIFVTKRNREVCTNPNDDWVQEYIKDPNLPLLPTRNLSTVKIITAKNGQ PQLLNSQ >gi568815581f:35964343_36171038|GENSCAN_predicted_CDS_2|384_bp atgaaggtctccgaggctgccctgtctctccttgtcctcatccttatcattacttcggct tctcgcagccagccaacttgcaatgtgattccttcagaagttcctgagtgggtgaacacc ccatccacctgctgcctgaagtattatgagaaagtgttgccaaggagactagtggtggga tacagaaaggccctcaactgtcacctgccagcaatcatcttcgtcaccaagaggaaccga gaagtctgcaccaaccccaatgacgactgggtccaagagtacatcaaggatcccaaccta cctttgctgcctaccaggaacttgtccacggttaaaattattacagcaaagaatggtcaa ccccagctcctcaactcccagtga >gi568815581f:35964343_36171038|GENSCAN_predicted_peptide_3|93_aa MKISVAAIPFFLLITIALGTKTESSSRGPYHPSECCFTYTTYKIPRQRIMDYYETNSQCS KPGIVFITKRGHSVCTNPSDKWVQDYIKDMKEN >gi568815581f:35964343_36171038|GENSCAN_predicted_CDS_3|282_bp atgaagatctccgtggctgccattcccttcttcctcctcatcaccatcgccctagggacc aagactgaatcctcctcacggggaccttaccacccctcagagtgctgcttcacctacact acctacaagatcccgcgtcagcggattatggattactatgagaccaacagccagtgctcc aagcccggaattgtcttcatcaccaaaaggggccattccgtctgtaccaaccccagtgac aagtgggtccaggactatatcaaggacatgaaggagaactga >gi568815581f:35964343_36171038|GENSCAN_predicted_peptide_4|113_aa MKVSVAALSCLMLVAVLGSQAQFTNDAETELMMSKLPLENPVVLNSFHFAADCCTSYISQ SIPCSLMKSYFETSSECSKPGVIFLTKKGRQVCAKPSGPGVQDCMKKLKPYSI >gi568815581f:35964343_36171038|GENSCAN_predicted_CDS_4|342_bp atgaaggtctccgtggctgccctctcctgcctcatgcttgttgctgtccttggatcccag gcccagttcacaaatgatgcagagacagagttaatgatgtcaaagcttccactggaaaat ccagtagttctgaacagctttcactttgctgctgactgctgcacctcctacatctcacaa agcatcccgtgttcactcatgaaaagttattttgaaacgagcagcgagtgctccaagcca ggtgtcatattcctcaccaagaaggggcggcaagtctgtgccaaacccagtggtccggga gttcaggattgcatgaaaaagctgaagccctactcaatataa >gi568815581f:35964343_36171038|GENSCAN_predicted_peptide_5|230_aa MKVSVAALSCLMLVTALGSQARVTKDAETEFMMSKLPLENPVLLDMLWRRKIGPQMTLSH AAGFHATSADCCISYTPRSIPCSLLESYFETNSECSKPGVIFLTKKGRRFCANPSDKQVQ GLNLKAGIEFQTKKEGEHTDGKVQELGQALLGSGPMVASKGPCLTSWKNQVIVQEDLNDG ECGDFNELWMWLLWDEWGAGEWMEWEDDLPLEFSHPAADLLFNCPQPNFS >gi568815581f:35964343_36171038|GENSCAN_predicted_CDS_5|693_bp atgaaggtctccgtggctgccctctcctgcctcatgcttgttactgcccttggatcccag gcccgggtcacaaaagatgcagagacagagttcatgatgtcaaagcttccattggaaaat ccagtacttctggacatgctctggaggagaaagattggtcctcagatgaccctttctcat gctgcaggattccatgctactagtgctgactgctgcatctcctacaccccacgaagcatc ccgtgttcactcctggagagttactttgaaacgaacagcgagtgctccaagccgggtgtc atcttcctcaccaagaaggggcgacgtttctgtgccaaccccagtgataagcaagttcag ggcctcaacctgaaggctggaattgagtttcagacaaaaaaggagggggaacacacagat gggaaggtgcaggagctggggcaagcacttttgggctctggccccatggtagcatccaaa ggtccttgtctgacatcctggaagaatcaggtcatcgttcaagaagacttgaacgatggt gaatgtggagattttaatgagttatggatgtggctcttatgggatgaatggggagctgga gaatggatggagtgggaagatgatcttcccttggagttcagccatcccgcggctgatctc ctcttcaactgtccccagccgaacttctcctga >gi568815581f:35964343_36171038|GENSCAN_predicted_peptide_6|470_aa MKENLCKKAENSKNQNASSPPKDHNSSPAREQNWMENEFDESTEVGFRRWVLTNSSKLKE HVVTQCKEAKNLDKRLQELLTRIASLEKSINDLIELKNTARELREAYTSINSRIDQAEER ISETEDHLNEIKHEDKIREKRMKRNEQSLQEIWDYVKRPNLHLICVPEKLRIKKLTQNCT TTWKLNNLLLNDYWVNNKVKAEINKFFETNGNKDTTYQNLWDTAKVVFRGKFIALNVHIR KWERSKIDTLTSQLKELEKQEQTNSKANRRQEITKIRAELKEIETQKTLQKINESRCWFF EKINKIDRLLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTAIREYDKHLYANKPENLE EMDKFLDTYTLPRLNQEEVESLNRPITSSEIEAVIDSLPTKKSTGPDGFTAEFYQRYKEE LVPFLLKLFQSTEKEGLLPNSFYEASIILIPKPGKDKTTTTTKEISGQYP >gi568815581f:35964343_36171038|GENSCAN_predicted_CDS_6|1413_bp atgaaggaaaacctttgcaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaaggatcacaactcctcgccagcaagggaacaaaactggatggagaatgagtttgat gaatcgacagaagtaggcttcagaaggtgggtactaacaaactcctccaagctaaaggag catgttgtaacccaatgcaaggaagctaagaaccttgataaaaggttacaggaactgcta actagaatagccagtttagagaagagcataaatgatctgatagagctgaaaaacacagca cgagaacttcgtgaagcatacacaagtatcaatagccgaattgatcaagcagaagaaagg atatcagagactgaagatcaccttaatgaaataaagcatgaagacaagattagagaaaaa agaatgaaaaggaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaac ctacatttgatttgtgtacctgaaaaactcaggattaagaaactcactcaaaactgcaca actacatggaaactgaacaacctgctcctgaatgactactgggtaaacaacaaagttaag gcagaaataaataagttctttgaaaccaatgggaacaaagacacaacgtaccagaatctc tgggacacggcaaaagtagtgtttagagggaaatttatagcactaaatgtccacattaga aagtgggaaagatccaaaattgacaccctaacatcacaattaaaagaactagagaagcaa gagcaaacaaattcgaaagctaacagaagacaagaaataactaagatcagagcagaactg aaggagatagagacacaaaaaaccctgcaaaaaatcaatgaatccaggtgctggtttttt gaaaagattaacaaaatagatagactgctagccagactaataaaaaagaaaagagagaag aatcaaatagacacaataaaaaatgataaaggggatatcaccactgatcccacagaaata caaactgccatcagagaatacgataaacacctctatgcaaataaaccagaaaatctagaa gaaatggataaattcctagacacatacaccctcccaagactaaaccaggaagaagttgaa tccctgaatagaccaataacgagttctgaaattgaggcagtaattgatagcctaccaacc aaaaaaagcacaggaccagatggattcacagccgaattctaccagaggtacaaagaggag ctggtaccattccttctgaaattattccaatcaacagaaaaagagggactcctccctaac tcattttatgaggccagcatcatcctgataccaaaacctggcaaagacaaaacaacaaca acaacaaaggaaatttctggccaatatccctga >gi568815581f:35964343_36171038|GENSCAN_predicted_peptide_7|275_aa MFAKLTSEEKGDKGMFRYTNTHHCVTTAYSILYSSMLHRPCRGLIPVADVFLDPHLAGVG IDLSSLLCGGSNERRSKLPVNGHDPNGKLREDSQDLSENPWEEAGAQERKAARVWQTLTY EELGVPWKRSYLKQQMPYLLFTHYGPLPCQGTSALTLHYQIPCKHTPELTLNLAPTGDQW VPGELIMKGLAAALLVLVCTMALCSCAQVGTNKELCCLVYTSWQIPQKFIVDYSETSPQC PKPGVILLTKRGRQICADPNKKWVQKYISDLKLNA >gi568815581f:35964343_36171038|GENSCAN_predicted_CDS_7|828_bp atgtttgctaagttgacaagtgaggaaaagggagacaaaggtatgtttaggtacacaaat actcaccactgtgttacaactgcctacagtattctgtacagtagcatgctgcacagacct tgccggggtttaattccagttgctgatgtattcctggacccacaccttgctggagttggc atagacctttccagtctcctctgtggaggaagcaatgaaagaagatcgaagttacctgtt aatggtcatgatccaaatggaaaactgagggaagacagccaggacctgtcagagaaccca tgggaagaagctggggcacaagaaaggaaagcagcaagagtctggcagacattgacctat gaggaacttggggtcccatggaaaaggtcctacctgaaacagcagatgccatacttgttg ttcacccattatgggccactgccctgccagggaacctcggcccttacgttgcattaccag atcccctgcaaacatactccagaactcactctgaatttggcacccacaggggatcagtgg gtccctggagagctcatcatgaagggccttgcagctgccctccttgtcctcgtctgcacc atggccctctgctcctgtgcacaagttggtaccaacaaagagctctgctgcctcgtctat acctcctggcagattccacaaaagttcatagttgactattctgaaaccagcccccagtgc cccaagccaggtgtcatcctcctaaccaagagaggccggcagatctgtgctgaccccaat aagaagtgggtccagaaatacatcagcgacctgaagctgaatgcctga >gi568815581f:35964343_36171038|GENSCAN_predicted_peptide_8|214_aa MVSKSLWRKWEQKLDCELKKQIHNYPECRILQDEICRTMSWLEQLISDCHTLTTELASPT LLTMEMDLRRLPQPALACERQRAAYKEESWFQTSEGHGQQTVVSPFLALLTLEPTFRHLL RIMQVSTAALAVLLCTMALCNQFSASLAADTPTACCFSYTSRQIPQNFIADYFETSSQCS KPGVIFLTKRSRQVCADPSEEWVQKYVSDLELSA >gi568815581f:35964343_36171038|GENSCAN_predicted_CDS_8|645_bp atggtgtccaagtcactttggaggaaatgggagcagaagctggattgtgaattgaagaaa caaatacacaattatccagagtgcagaattcttcaggatgaaatctgtaggacaatgagc tggttggagcagctcatcagtgactgtcacactctgactacggagctggccagccccacg cttcttaccatggaaatggaccttcgtcgcctgccacaacctgctcttgcttgtgaaagg cagagggctgcctataaagaggagagctggtttcagacttcagaaggacacgggcagcag acagtggtcagtcctttcttggctctgctgacactcgagcccacattccgtcacctgctc agaatcatgcaggtctccactgctgcccttgctgtcctcctctgcaccatggctctctgc aaccagttctctgcatcacttgctgctgacacgccgaccgcctgctgcttcagctacacc tcccggcagattccacagaatttcatagctgactactttgagacgagcagccagtgctcc aagcccggtgtcatcttcctaaccaagcgaagccggcaggtctgtgctgaccccagtgag gagtgggtccagaaatatgtcagcgacctggagctgagtgcctga >gi568815581f:35964343_36171038|GENSCAN_predicted_peptide_9|227_aa MKLCVTVLSLLMLVAAFCSPALSAPNSKPKEASKSVLIPVNPGSRKRAEIVEKQTQAFIM QVADLQQKGHAQPHQGYINSPVLCHNLIQRDLEHFLLLQDITLVHYNDDIMMTRSSEQEV ANTLDLLAVSMDDDDCWQNRGVVVKNSSPQAQATDWYWSMTCYEPSHTAEDGTIKLQENK LRAPTDSTLCTMYNSKDMEPTQIAINDRRDKENVAYKHPGILRSHKK >gi568815581f:35964343_36171038|GENSCAN_predicted_CDS_9|684_bp atgaagctctgcgtgactgtcctgtctctcctcatgctagtagctgccttctgctctcca gcgctctcagcaccaaattccaaaccaaaagaagcaagcaagtctgtgctgatcccagtg aatcctgggtccaggaaaagagctgaaattgtagaaaaacagacacaagcttttatcatg caagtggctgacctgcaacaaaaggggcatgcacagcctcaccaggggtatatcaactct ccggttttgtgtcataatcttattcagagagaccttgaacactttttgcttctgcaagat atcacactggtccattacaatgatgacattatgatgactagatccagtgaacaagaagta gcaaacacactggacttattggcagtgagcatggatgatgatgattgctggcaaaacagg ggagtggtagtgaagaacagcagtccccaggcccaggccacagattggtactggtccatg acctgttacgaaccgagccacacagcagaagatgggaccatcaagttgcaggaaaacaag ctcagggctcccactgattctacattatgcactatgtacaatagcaaagacatggaacca acccaaattgccatcaacgatagaagggataaagaaaatgtggcatataaacaccctgga atactacgcagccataaaaaatga >gi568815581f:35964343_36171038|GENSCAN_predicted_peptide_10|180_aa MEEVRVSWIEDQTSHNIPLSQNLIKALPVFNSVKAERSICVHQCRKDGDYKEGKCCIRKG CWHIFVPPPQNLIPKQTHAFLLSAKSELEKTNSVPVGFPIFGWLAGLPPQPLSRLLSRSW ASVCISEEKTGTVEGKTAVCEMFHVRGKQHIQIPKLSTSSVTRHLHHFRLMQDSQPLDLS >gi568815581f:35964343_36171038|GENSCAN_predicted_CDS_10|543_bp atggaggaagttcgagtgtcctggatagaagatcaaaccagccacaacattcccttaagc caaaacctaatcaaagccctacctgtcttcaactctgtcaaggctgagagaagcatctgt gtgcaccaatgcagaaaagacggggactacaaggaaggaaaatgttgcatcaggaagggc tgctggcacatctttgttccccctccccaaaacctcatccccaagcagacccatgcgttc ctgctctctgccaagtccgaactggagaaaacaaattctgtcccagtggggtttcccatc tttggctggctggctggacttcctccacaaccactctctcgccttctaagcaggagctgg gcttctgtgtgcatcagtgaagagaagacggggactgtggaagggaaaacagcagtctgt gagatgtttcatgtccgaggcaaacagcacattcagatccccaagctctccacctccagt gtgaccaggcacctgcaccacttcaggctcatgcaggactcacagcctttggacctcagc taa >gi568815581f:35964343_36171038|GENSCAN_predicted_peptide_11|282_aa XQRELLHILLAYEEYNPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAAR APAAIGAHEWADQAQISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPW ARFCNRFVDTWARDEDTVLKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGRKT LCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAI VNARRRNLTVRPGFSRVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581f:35964343_36171038|GENSCAN_predicted_CDS_11|849_bp nngcagcgggaactactccacatcctcctggcatatgaggagtataacccggaggtgggc tactgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgaggaggat gcattctgggcactggtgcagctgctggccagtgagaggcactccctgcaggctgcccgg gctcctgctgccatcggtgcccacgaatgggccgaccaagcccagatctctctcgggctc accctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgttgatgccgataaca agaatcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggcccgtgg gcacgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtgctcaag catcttagggcctctatgaagaaactaacaagaaagcagggggacctgccacccccagcc aaacccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcaggaagacc ctctgcaagggggacaggcaggcccctccaggcccaccagcccggttcccgcggcccatt tggtcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggggctgtc cgggaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaacgccatt gttaatgcacggaggaggaacctgactgttagacctgggttttccagggttgcacggctt ctgggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccaggatggaat gagctgtga