GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:22:55 Sequence gi568815588r:88835226_89048930 : 213705 bp : 39.02% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1012 1051 40 -3.55 1.01 Init + 5941 6075 135 0 0 74 51 88 0.554 3.89 1.02 Intr + 6228 6365 138 2 0 12 106 79 0.570 1.84 1.03 Term + 16417 16644 228 0 0 65 35 172 0.774 5.25 1.04 PlyA + 16963 16968 6 1.05 2.04 PlyA - 17117 17112 6 1.05 2.03 Term - 20593 20565 29 2 2 105 49 35 0.063 -1.44 2.02 Intr - 30216 30143 74 2 2 43 101 91 0.313 4.03 2.01 Init - 32978 32869 110 2 2 32 103 99 0.289 5.55 2.00 Prom - 36824 36785 40 -6.45 3.00 Prom + 40647 40686 40 -7.35 3.01 Init + 41935 42224 290 0 2 70 73 253 0.232 18.63 3.02 Intr + 44887 45413 527 1 2 14 75 419 0.196 24.86 3.03 Intr + 47237 47397 161 0 2 104 68 71 0.283 5.49 3.04 Intr + 57159 57259 101 2 2 47 53 51 0.052 -4.51 3.05 Intr + 63139 63241 103 1 1 31 93 115 0.471 5.56 3.06 Intr + 70218 70435 218 2 2 74 111 172 0.411 14.48 3.07 Intr + 73477 73552 76 1 1 93 67 70 0.997 4.00 3.08 Intr + 75691 75786 96 0 0 25 106 81 0.880 2.89 3.09 Intr + 77876 78233 358 1 1 65 89 384 0.999 30.10 3.10 Intr + 79309 79433 125 2 2 104 64 59 0.884 4.48 3.11 Intr + 81455 81592 138 1 0 91 65 113 0.994 9.04 3.12 Term + 97606 97986 381 0 0 51 51 191 0.027 5.55 3.13 PlyA + 99022 99027 6 1.05 4.10 PlyA - 99875 99870 6 1.05 4.09 Term - 100141 99998 144 1 0 83 41 134 0.932 5.13 4.08 Intr - 103017 102836 182 1 2 100 79 220 0.647 20.97 4.07 Intr - 104473 104282 192 2 0 58 74 293 0.921 23.44 4.06 Intr - 106165 106004 162 2 0 91 81 224 0.992 21.03 4.05 Intr - 106644 106560 85 0 1 106 82 63 0.943 5.97 4.04 Intr - 108682 108572 111 1 0 57 108 115 0.991 10.06 4.03 Intr - 112161 112033 129 0 0 76 86 143 0.973 12.87 4.02 Intr - 113728 113577 152 2 2 87 94 151 0.428 14.56 4.01 Init - 120358 120223 136 1 1 42 81 89 0.040 3.95 4.00 Prom - 146216 146177 40 -3.95 5.00 Prom + 149805 149844 40 -2.55 5.01 Init + 164597 164639 43 1 1 52 115 7 0.700 0.43 5.02 Intr + 167600 167730 131 0 2 58 55 137 0.764 6.89 5.03 Intr + 167804 167969 166 1 1 108 91 127 0.999 13.61 5.04 Intr + 172475 172612 138 0 0 113 95 132 0.999 15.91 5.05 Intr + 173664 173772 109 1 1 63 108 37 0.914 1.62 5.06 Intr + 175314 175375 62 0 2 106 86 85 0.943 7.56 5.07 Intr + 176774 176872 99 0 0 100 37 61 0.466 1.36 5.08 Term + 178894 179225 332 2 2 89 40 352 0.993 24.33 5.09 PlyA + 180041 180046 6 -0.45 6.02 PlyA - 180841 180836 6 1.05 6.01 Sngl - 183725 183312 414 2 0 76 37 265 0.349 14.28 6.00 Prom - 193711 193672 40 -4.55 7.00 Prom + 194613 194652 40 -6.85 7.01 Init + 194931 194935 5 2 2 107 74 0 0.032 0.02 7.02 Intr + 204174 204201 28 0 1 95 74 16 0.073 -1.90 7.03 Term + 206867 207337 471 1 0 82 43 223 0.575 11.14 7.04 PlyA + 211048 211053 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:88835226_89048930|GENSCAN_predicted_peptide_1|166_aa MRTRGKQKILHLQRGVRVEEEWVLAARKWQIRSPDIPLLPQTTTEFQPTKERATNSELSK IGFHKPVQADSSQLQPAPIHQQQELAYISVKLLESSGGISDEYQNPSWLRGKSLEVQAGG KLQECFSSKHLSVLFQQLRQQHTFQRAQPNETKALVSCVTTQACII >gi568815588r:88835226_89048930|GENSCAN_predicted_CDS_1|501_bp atgaggaccaggggaaagcagaagatccttcacctccagcgaggagtaagggtggaggaa gagtgggttctggctgctagaaaatggcaaatcagaagccctgatattcccctactacct cagaccaccacagagtttcaacctacaaaagagagagcaacgaactcagagttgagcaaa attggctttcacaaaccagttcaagctgactcaagccagctccagcctgctccaatacac cagcaacaggaactggcatatatttccgttaaactccttgaatcttctggaggtatttct gatgaatatcaaaatccttcctggctgagaggaaagtccctagaagtccaagctggtggc aagctccaggagtgcttttctagcaaacacctgtcagtgcttttccagcagctcaggcag cagcacaccttccagagagcccagcccaatgaaacaaaggcactggtgtcatgtgtaacg acccaagcctgtatcatctaa >gi568815588r:88835226_89048930|GENSCAN_predicted_peptide_2|70_aa MLSALEHRTPSSSVLGLGLALLSPLLADILLWELVIMWDRATTTWDSKKVERLRMGILLE DDPLSVQQPE >gi568815588r:88835226_89048930|GENSCAN_predicted_CDS_2|213_bp atgctttctgccctcgaacatcggactccaagttcttcagttttgggactcggactggct ctcctttctcctctgcttgcagacatcctgttgtgggaacttgtgatcatgtgggaccga gccactactacctgggacagcaagaaggtggaaagactgaggatgggaattctcttagag gatgatccactctctgtacaacagccagagtga >gi568815588r:88835226_89048930|GENSCAN_predicted_peptide_3|857_aa MNPESIKFTNLNMIIHIQSNMLATVMEILKGVAEGNVSRFVERREFSEELLAKVRKKGKV ALVFVARFHEMYGKLHIVVQVTTYTLDVLLCDIRLKMGPASFHLRAFSAECGKPPPPPQR RGRRQRRRRHPGAGAGEAGTIAVTIYFPAAAAAGTVAAGSGSAGLRVGLRGRCSSPALGC PTARRRTTHGARATGTSKAPSVGRGEGGGGTAEANERTPRRRVPVSPRCNAFQQKPPRPG LQQPDGCQGHTAATGAAVAVAVPHGLRTPRVARPFPAVRIYSYNPRNEIAFYKKRKLMIL AQNMRSRRNVRENNVDSVQEVLGAFLHTPPPPSHCNAGKLSRNHSQREHGIQIRGMSALG LPDRIVGNLDHILRATELENIPVYVDLAIVYNATKKLAAMPDHTDVSLSPEERVRALSKL GCNITISEDITPRRYFRSGVEMERMASVYLEEGNLENAFVLYNKFITLFVEKLPNHRDYQ QCAVPEKQDIMKKLKEIAFPRTDELKNDLLKKYNVEYQEYLQSKNKYKAEILKKLEHQRL IEAERKRIAQMRQQQLESEQFLFFEDQLKKQELARGQMRSQQTSGLSEQIDGSALSCFST HQNNSLLNVFADQPNKSDATNYASHSPPVNRALTPAATLSAVQNLVVEGLRCVVLPEDLC HKFLQLAESNTVRGIETCGILCGKLTHNEFTITHVIVPKQSAGPDYCDMENVEELFNVQD QHDLLTLGWIHELMPPGGKYEVQQCNRPVLLQGNPVEMTAMVGDFCLNYESDLVASICTL ILKFKSSGYEVPLHGKHEFAAGRFPSIIFQWFLNFDVHKNHQGNLIEVQVSSFSSRDSSS AGVRCDPGLYPFKMSFR >gi568815588r:88835226_89048930|GENSCAN_predicted_CDS_3|2574_bp atgaatccagaaagcataaagtttactaatttaaacatgatcatccatatccagtcaaat atgttagcaactgtgatggagatactaaagggcgttgcagaaggaaatgtatcaagattc gttgaaagacgtgagttctcagaggaactgctggccaaagtgaggaaaaaagggaaagtt gcgcttgtctttgtggccagatttcatgagatgtatggaaaactgcacatcgttgtccag gtcaccacttacactttggatgttctgctctgcgacatccgtttaaaaatgggccctgct tccttccacctgcgagctttttctgcagaatgcgggaagccgccgccgccgccacagagg agggggcggaggcagaggcggaggcggcacccaggggccggggcaggggaggccgggacc atcgcagtgacaatttattttcctgcagcagcggcagcagggacggttgctgcaggttcg gggtcggccggcctgcgcgtgggcttgcgaggacgctgttcgtcccctgcgctggggtgt ccgacagcgaggaggagaacgacgcacggagcccgcgcgactggaaccagcaaagctcca tctgtcggcagaggagaagggggaggaggcacggccgaggcaaacgagcggacgcctcgt cgccgggtgccggtatcaccccgctgcaacgccttccagcaaaagccaccgcggcccggg ttgcagcagccggacggatgccaaggccacacggcagccacgggggcagccgtcgcagtc gccgtcccacacgggctgcggacaccaagggttgctagacccttcccagctgtaagaatt tacagctacaatccaagaaatgagattgcattttacaagaaaagaaaactgatgattctt gcccaaaacatgagaagtagaaggaatgtgagagaaaataatgtagattccgtccaggaa gtgctaggtgctttcctgcacacccctcctccaccaagtcattgtaatgcaggcaagtta tccaggaaccactcccagagggaacatggcatccaaatcagggggatgtcagctctagga ctccctgacaggatcgtggggaacctggaccacattttgagagccactgaactagaaaac atacctgtttatgttgatcttgcaattgtttataatgcaactaaaaagttagctgctatg cctgaccatacagatgtttccctaagcccagaagagcgagtccgtgccctaagcaagctt ggttgtaatatcaccatcagtgaagacatcactccacgacgttactttaggtctggagta gagatggagaggatggcgtctgtgtatttggaagaaggaaatttggaaaatgcctttgtt ctttataataaatttataaccttatttgtagaaaagcttcctaaccatcgagattaccag caatgtgcagtacctgaaaagcaggatattatgaagaaactgaaggagattgcattccca aggacagatgaattgaaaaacgaccttttaaagaaatataacgtagaataccaagaatat ttgcaaagcaaaaacaaatataaagctgaaattctcaaaaaattggagcatcagagattg atagaggcagaaaggaagcggattgctcagatgcgccagcagcagctagaatcggagcag tttctgtttttcgaagatcaactcaagaagcaagagttagcccgaggtcaaatgcgaagt cagcaaacctcagggctgtcagagcagattgatgggagcgctttgtcctgcttttccaca caccagaacaattccttgctgaatgtatttgcagatcaacctaataaaagtgatgcaacc aattatgctagccactctcctcctgtaaacagggccttaacgccagctgctactctaagt gctgttcagaatttagtggttgaaggactgcgatgtgtagttttgccagaagatctttgc cacaaatttctgcaactggcagaatctaatacagtgagaggaatagaaacctgtggaata ctctgtggaaaactgacacataatgaatttactattacccatgtaattgtgccaaagcag tctgcgggaccagactattgtgacatggagaatgtagaggaattattcaatgttcaggat caacatgatctcctcactctaggatggatccatgagctcatgccgcctggtggtaaatat gaagtacagcagtgcaacagaccagttttactccaaggaaaccctgtagagatgacagca atggttggtgatttctgcctcaattatgaaagtgatctggtggcaagtatctgcacacta atccttaagtttaaatctagtggctatgaggtgcccctgcatggcaagcatgaatttgca gctggaagatttccctcgatcatattccagtggttcttaaattttgatgtacataagaat caccagggaaatctgattgaagttcaggtttccagcttcagctccagagattctagctca gcgggtgttagatgtgacccagggctctacccatttaaaatgagcttccgatga >gi568815588r:88835226_89048930|GENSCAN_predicted_peptide_4|430_aa MRNRAEPGKSLQMWQDGHCASVDSMPLSSWAILGELFVTLLGNSKESCEAAPAMCEEEDS TALVCDNGSGLCKAGFAGDDAPRAVFPSIVGRPRHQGVMVGMGQKDSYVGDEAQSKRGIL TLKYPIEHGIITNWDDMEKIWHHSFYNELRVAPEEHPTLLTEAPLNPKANREKMTQIMFE TFNVPAMYVAIQAVLSLYASGRTTGIVLDSGDGVTHNVPIYEGYALPHAIMRLDLAGRDL TDYLMKILTERGYSFVTTAEREIVRDIKEKLCYVALDFENEMATAASSSSLEKSYELPDG QVITIGNERFRCPETLFQPSFIGMESAGIHETTYNSIMKCDIDIRKDLYANNVLSGGTTM YPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWISKQEYDEA GPSIVHRKCF >gi568815588r:88835226_89048930|GENSCAN_predicted_CDS_4|1293_bp atgaggaacagagcagagcctggcaaatccctgcagatgtggcaggatggccattgtgcc agcgtagactccatgcctttgtcttcatgggctatcttaggagaactgtttgtgactctc ctggggaattccaaagaatcctgtgaagcagctccagctatgtgtgaagaagaggacagc actgccttggtgtgtgacaatggctctgggctctgtaaggccggctttgctggggacgat gctcccagggctgttttcccatccattgtgggacgtcccagacatcagggggtgatggtg ggaatgggacaaaaagacagctacgtgggtgacgaagcacagagcaaaagaggaatcctg accctgaagtacccgatagaacatggcatcatcaccaactgggacgacatggaaaagatc tggcaccactctttctacaatgagcttcgtgttgcccctgaagagcatcccaccctgctc acggaggcacccctgaaccccaaggccaaccgggagaaaatgactcaaattatgtttgag actttcaatgtcccagccatgtatgtggctatccaggcggtgctgtctctctatgcctct ggacgcacaactggcatcgtgctggactctggagatggtgtcacccacaatgtccccatc tatgagggctatgccttgccccatgccatcatgcgtctggatctggctggccgagatctc actgactacctcatgaagatcctgactgagcgtggctattccttcgttactactgctgag cgtgagattgtccgggacatcaaggagaaactgtgttatgtagctctggactttgaaaat gagatggccactgccgcatcctcatcctcccttgagaagagttacgagttgcctgatggg caagtgatcaccatcggaaatgaacgtttccgctgcccagagaccctgttccagccatcc ttcatcgggatggagtctgctggcatccatgaaaccacctacaacagcatcatgaagtgt gatattgacatcaggaaggacctctatgctaacaatgtcctatcagggggcaccactatg taccctggcattgccgaccgaatgcagaaggagatcacggccctagcacccagcaccatg aagatcaagatcattgcccctccggagcgcaaatactctgtctggatcggtggctccatc ctggcctctctgtccaccttccagcagatgtggatcagcaaacaggaatacgatgaagcc gggccttccattgtccaccgcaaatgcttctaa >gi568815588r:88835226_89048930|GENSCAN_predicted_peptide_5|359_aa MGIYQGVVNKVHFHVWFEEPEIQTAIQVTCCFLGERNLKDSGALTLSLPVHSRYCQFWVL TSVARLSSKSVNAQVTDINSKGLELRKTVTTVETQNLEGLHHDGQFCHKPCPPGERKARD CTVNGDEPDCVPCQEGKEYTDKAHFSSKCRRCRLCDEGHGLEVEINCTRTQNTKCRCKPN FFCNSTVCEHCDPCTKCEHGIIKECTLTSNTKCKEEVKRKEVQKTCRKHRKENQGSHESP TLNPVGIEIDVDLSKYITTIAGVMTLSQVKGFVRKNGVNEAKIDEIKNDNVQDTAEQKVQ LLRNWHQLHGKKEAYDTLIKDLKKANLCTLAEKIQTIILKDITSDSENSNFRNEIQSLV >gi568815588r:88835226_89048930|GENSCAN_predicted_CDS_5|1080_bp atgggaatctatcagggtgtggttaataaagtacatttccatgtgtggtttgaagaacct gagatccaaactgctatacaagtgacctgctgctttcttggagagagaaatctgaaagac agtggagccctcacattgtctttgcctgtgcacagcagatactgccaattttgggttctt acgtctgttgctagattatcgtccaaaagtgttaatgcccaagtgactgacatcaactcc aagggattggaattgaggaagactgttactacagttgagactcagaacttggaaggcctg catcatgatggccaattctgccataagccctgtcctccaggtgaaaggaaagctagggac tgcacagtcaatggggatgaaccagactgcgtgccctgccaagaagggaaggagtacaca gacaaagcccatttttcttccaaatgcagaagatgtagattgtgtgatgaaggacatggc ttagaagtggaaataaactgcacccggacccagaataccaagtgcagatgtaaaccaaac tttttttgtaactctactgtatgtgaacactgtgacccttgcaccaaatgtgaacatgga atcatcaaggaatgcacactcaccagcaacaccaagtgcaaagaggaagtgaagagaaag gaagtacagaaaacatgcagaaagcacagaaaggaaaaccaaggttctcatgaatctcca actttaaatcctgtaggtattgaaatagatgttgacttgagtaaatatatcaccactatt gctggagtcatgacactaagtcaagttaaaggctttgttcgaaagaatggtgtcaatgaa gccaaaatagatgagatcaagaatgacaatgtccaagacacagcagaacagaaagttcaa ctgcttcgtaattggcatcaacttcatggaaagaaagaagcgtatgacacattgattaaa gatctcaaaaaagccaatctttgtactcttgcagagaaaattcagactatcatcctcaag gacattactagtgactcagaaaattcaaacttcagaaatgaaatccaaagcttggtctag >gi568815588r:88835226_89048930|GENSCAN_predicted_peptide_6|137_aa MGGNRGCTWCLWASMSSRWAWAQQTLHWERPASPAGPGAVRGLAPGPAAAVLDFSPGLSC LPMGQGSGPAAHHAQASPTITAPCSKAPSRIDHPRAEECGSMARDWQAAPPAAPVQDPRG EASWASESGRDLGNLYV >gi568815588r:88835226_89048930|GENSCAN_predicted_CDS_6|414_bp atgggcgggaaccggggctgcacatggtgcttgtgggccagcatgagttccaggtgggcg tgggctcagcagaccctgcactgggagcggccagccagccccgccggccctggggcagtg aggggcttagcacctgggccagcagctgctgtgcttgacttctcaccaggccttagctgc ctccccatggggcagggctcgggacctgcagcccaccatgcccaagcctccccaacgatc accgccccctgctctaaggctcccagtcgcattgaccatccaagggctgaggagtgcggg agcatggcgcgggactggcaggcagctccacctgcagccccggtgcaggatccacggggt gaagccagctgggcttctgagtctggtagggacttagggaacctttatgtctag >gi568815588r:88835226_89048930|GENSCAN_predicted_peptide_7|167_aa MGFWIPHGSILAHGLYNQQVEDQPGLCLSLQDSEFLPALRGSGDTFKSQGLETGTLGIYL ALYSTLAELAPKLQDNALPTLPCHFLKQDSPWPPPLRPMASIAWHRQCSLKAQGLFSHYL VNAARPGTLFREVGSPLTQGSCRNAIQEPKPKVGDLRSPLGALPHCG >gi568815588r:88835226_89048930|GENSCAN_predicted_CDS_7|504_bp atggggttttggattcctcatggttcaattttggcccatgggctctacaatcagcaggtg gaagaccagccaggcttgtgtctttcccttcaggacagtgagttcctcccagccctgcgt ggatctggagatactttcaagagccaaggcctggagacaggaactttaggaatctacctg gcactctattctactctggctgagctggcacctaagctgcaagacaatgccctccccact cttccctgccatttcctcaaacaggactccccatggccaccaccactcaggcccatggca agtattgcctggcaccgccaatgttcactcaaggcccaaggactcttcagtcattatttg gtgaatgctgccaggcctgggactctctttagggaagtgggctcccctctcacccagggc agttgcagaaatgccatccaagaaccaaagccaaaagttggggacctcaggagtccactt ggtgctctaccccactgtggctaa