GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:58:40 Sequence gi568815596r:189957261_190162596 : 205336 bp : 37.54% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 154 149 6 1.05 1.02 Term - 7419 7148 272 1 2 47 48 223 0.805 8.86 1.01 Init - 7461 7443 19 0 1 50 63 12 0.321 -4.87 1.00 Prom - 14211 14172 40 -3.45 2.02 PlyA - 14492 14487 6 1.05 2.01 Sngl - 15983 14727 1257 2 0 34 32 354 0.938 20.00 2.00 Prom - 33785 33746 40 -5.25 3.02 PlyA - 34720 34715 6 1.05 3.01 Sngl - 41038 40712 327 1 0 87 48 160 0.612 7.66 3.00 Prom - 48690 48651 40 -5.85 4.02 PlyA - 50998 50993 6 1.05 4.01 Sngl - 58497 57757 741 0 0 -1 48 451 0.960 28.15 4.00 Prom - 60502 60463 40 -6.25 5.04 PlyA - 60764 60759 6 1.05 5.03 Term - 61775 61539 237 2 0 68 49 165 0.822 5.78 5.02 Intr - 64376 64267 110 0 2 28 115 40 0.136 -0.22 5.01 Init - 75887 75872 16 2 1 83 101 5 0.285 1.87 5.00 Prom - 76345 76306 40 -5.75 6.02 PlyA - 76720 76715 6 1.05 6.01 Sngl - 81884 81600 285 2 0 85 46 577 0.983 46.49 6.00 Prom - 83392 83353 40 -1.35 7.05 PlyA - 84007 84002 6 1.05 7.04 Term - 87381 87227 155 1 2 65 43 168 0.920 7.10 7.03 Intr - 87616 87507 110 0 2 65 84 72 0.916 3.51 7.02 Intr - 88515 88409 107 0 2 31 62 92 0.933 -1.01 7.01 Init - 89424 89299 126 0 0 69 101 92 0.648 8.81 7.00 Prom - 95658 95619 40 -6.05 8.04 PlyA - 95710 95705 6 1.05 8.03 Term - 100378 99998 381 1 0 101 50 211 0.963 12.45 8.02 Intr - 103175 102802 374 0 2 93 119 192 0.966 16.66 8.01 Init - 105336 104964 373 0 1 52 84 399 0.940 33.07 8.00 Prom - 105502 105463 40 -2.25 9.00 Prom + 113885 113924 40 -6.25 9.01 Init + 123044 123113 70 1 1 84 103 20 0.683 4.36 9.02 Term + 128543 128715 173 0 2 71 32 125 0.654 2.21 9.03 PlyA + 130537 130542 6 1.05 10.03 PlyA - 131507 131502 6 1.05 10.02 Term - 135673 135657 17 2 2 90 42 31 0.335 -3.88 10.01 Init - 136717 136657 61 1 1 78 105 42 0.523 6.36 10.00 Prom - 137572 137533 40 -1.05 11.00 Prom + 144439 144478 40 -4.85 11.01 Sngl + 160862 161905 1044 1 0 57 37 282 0.941 16.88 11.02 PlyA + 164349 164354 6 1.05 12.00 Prom + 179928 179967 40 -7.25 12.01 Init + 182780 182821 42 1 0 94 52 44 0.397 1.87 12.02 Intr + 182867 182897 31 1 1 84 105 24 0.824 0.49 12.03 Term + 182944 183221 278 2 2 87 48 163 0.920 6.84 12.04 PlyA + 185593 185598 6 1.05 13.02 PlyA - 186472 186467 6 1.05 13.01 Term - 190926 190621 306 0 0 1 31 246 0.571 4.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_1|96_aa MFETNIGSHIHNIKVQGEAASADVEATSYPEDLAKIIDEGGYTKQQIFNINKKYFYWKKM PSRTFKAIEAKSVLGNKAFKDRLTLFLRANAAGDLS >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_1|291_bp atgtttgagactaacataggaagccatatccataacataaaagtacaaggtgaagcagct agtgctgatgtagaagctacaagttatccagaagatctagctaagatcattgatgaaggt ggctacactaaacaacagattttcaatatcaacaaaaaatacttctattggaagaagatg ccatctaggacttttaaagctatagaagcgaagtcagtgcttggaaacaaggctttcaag gataggctgactctctttttaagggctaatgcagctggtgatttaagttga >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_2|418_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLK TSTRQGCPLSPLLFNIVLEVLARAIRQEKEIKDIQLGKEEVKLSLFADDMIVYLEKPIIS APNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQITSELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRISIMKMAILPKVIYRFNAIPIKLPM TFFAELEKTTSKFIWNQKRARIAKTILSQKNRAGCIMLPDFKLYYKTTAIKTAWYWYQNR DIDQWNRTEPSEIIPHIYNHLIFDKPDKNKKRGKDSLFNKWCWENWLAICRKLKLDPFFT PYTKINSRWIKDLNVRHKTIKTLEENLGNTIQDIGMGKDFMSKHQKQWQQKPKLTNGI >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_2|1257_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacagccctttatgcta aaaactctcaataaattaggtattgatgggatgtatctcaaaataataagagctatttat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actagcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaatcaggcaagagaaagaaataaaggatattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaagcccatcatctca gccccaaatctccttaagctgataagcaactttagcaaggtctcaggatacaaaatcaat gtgcaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcacg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaagag gacacaaacaaatggaagaatattccatgctcatggataggaagaatcagtatcatgaaa atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttctttgcagaattggaaaaaactacttcaaagttcatatggaaccaaaaaagagcc cgcattgccaagacaatcctaagccaaaagaacagagctggatgcatcatgctacctgac ttcaaactatactacaagactacagcaatcaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaaccgaacagaaccctcagaaataataccacacatctacaaccat ctgatctttgacaaacctgacaaaaacaagaaaaggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttctttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagacataaaaccata aaaaccctagaagaaaacctaggcaatactattcaggacataggaatgggcaaggacttc atgtcaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaa >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_3|108_aa MQLPFGWTEQHVETDTMDFCSKNYHRNIPGKLREFTDSLKEVHTTGNSVRQTKNCDFPKY ERGKSYLQKHILTGELEKSRLWEDLSFPRAETDLAQNIKVKAAVGRGL >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_3|327_bp atgcagctcccatttggatggacagaacagcatgtggagactgacaccatggacttttgc tccaagaactaccacaggaacataccaggaaaactgagagaattcacagattctttgaaa gaagtgcacaccactggaaattctgtaagacagacaaaaaactgtgactttccaaagtat gaaagggggaaatcttacctccaaaaacacatactcactggggaactggaaaaatccaga ttatgggaggacttaagctttcctagagctgaaacagatttagcacaaaatataaaagta aaagcagcagtgggaagaggcttgtag >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_4|246_aa MHSQRDGLKLELIHKREAECKSLKNLQPDHAVEKKNPFSGKEFKLAAEICMSNEEPSVNS QDDGENVSRESQRLSQQPLLSQAQRPTGKQCFHGLSPGLCCCSVQPRDLAPCIPTMAKRG QCTAEAVVSEGISPKPWWLTCGVGPMSAQKPRIEVWEPPYRFQRMYGNAWMSRQKFAAGV ETSRRTSAKAVQKGNVGLEPPQRVLTGALPSGAVRRGPPSSRPQDDRSTDSLHVHVEKLQ TLNASL >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_4|741_bp atgcattcacaaagagatggtttgaaactggaacttatacataaaagggaagcagagtgt aaaagtttgaaaaatttgcagcctgaccatgcagtagaaaagaaaaacccattttccgga aaagaattcaagctggctgcagaaatttgcatgagtaatgaggagccaagtgttaatagc caagatgatggagaaaatgtctccagggaatctcagaggctttcacagcagcctctccta tcacaggcccagaggcctacagggaaacaatgttttcatgggctgagcccagggctttgc tgctgctctgtgcagcctcgggacttggcaccctgcatcccaaccatggctaaaaggggc caatgtacagctgaggctgttgtttcagagggtataagccccaaaccttggtggcttaca tgtggtgttgggcccatgagtgcacagaagccaagaattgaggtttgggaacctccatat agatttcaaaggatgtatggaaatgcctggatgtccaggcagaagtttgctgcaggggtg gaaacttcacggagaacctctgctaaggcagtgcagaagggaaatgtgggtttggagccc ccacagagagtcctcactggggccctgcctagtggagctgtaagaagagggccaccatcc tccagaccccaggatgacagatcaaccgacagcttgcatgtgcatgtggaaaagctacag acactcaatgccagcctgtga >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_5|120_aa MAGDKFIGSGFPRKAWAQLKSLDSEGTYSWRLSANCTPCNQMNANSGTTWPENTPYDPSQ SETIVVSSHQLCPIAESSRQPHLNSEQRQWPSHLENPKASSACPGFLSAGPSRITGYMKQ >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_5|363_bp atggctggggataaattcattggctctggcttcccaagaaaagcatgggctcagctcaaa agcttagattctgaaggcacttacagctggaggctatcagctaactgcactccttgcaat caaatgaatgcaaacagcggtaccacctggccagaaaatacaccctatgacccatcccaa tcagagacaattgtggtgtccagccatcagctctgcccgattgcagaatccagccggcag ccccacctaaactcagagcaaagacagtggcccagccatctagagaacccaaaagcaagc tctgcctgcccagggttcttatcagctggcccatccagaatcacaggctacatgaaacag tga >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_6|94_aa MEITPLHSSLGDRARLCLKKQTKKEEEEEEEEEEEEEEEEEEEEEEEEEKKERKKKKKEE EEEEEEEEEEEEEEEEEEEEERKKERKKEGGLYK >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_6|285_bp atggagatcacaccactgcactccagcctgggggacagagcaagactctgtctcaaaaaa caaacaaaaaaagaagaggaagaagaggaagaagaggaagaagaggaagaagaggaagaa gaggaagaggaagaggaagaagaagagaagaaagaaaggaagaagaagaagaaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaaagaaagaaagaaagaaagaaagagggaggactctacaaataa >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_7|165_aa MVTDIMNNPLISSLYLEWKDAGASDALSIRLLEWKDVKFQLNERGSIEAAEQRQPAGPTS MAPHKIRPTGLEIPPSHWCLWAGNRPVPPWDKAPRKRDKAAIFAVSQPSLVISPGRSSRS GPPVTPHQSYRASSSLATPWTEPLRATKSLFATASAVELPLTPLD >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_7|498_bp atggtcacagatattatgaataatcctctcatttcaagtctatatttagaatggaaagat gctggtgcctctgacgcactgagcatcagacttctagaatggaaagatgtcaaatttcag ttaaatgaaagaggctcaattgaggctgctgagcagcgacagcctgcaggccccacttcc atggcacctcacaagataagacccactggcttggaaattccacccagccactggtgcctt tgggccggcaacagacctgtacctccctgggacaaagctcccagaaaaagggacaaagct gccatctttgctgtttcacagccttccctggtaatatctccaggtaggtcctccaggtct gggcctccagtcaccccccaccagagctatcgagccagtagcagcttggcaactccctgg acagagcctctcagggcaactaaaagcctctttgctactgcctctgcagtagaactgccc ttgacacccttggactaa >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_8|375_aa MQKLQLCVYIYLFMLIVAGPVDLNENSEQKENVEKEGLCNACTWRQNTKSSRIEAIKIQI LSKLRLETAPNISKDVIRQLLPKAPPLRELIDQYDVQRDDSSDGSLEDDDYHATTETIIT MPTESDFLMQVDGKPKCCFFKFSSKIQYNKVVKAQLWIYLRPVETPTTVFVQILRLIKPM KDGTRYTGIRSLKLDMNPGTGIWQSIDVKTVLQNWLKQPESNLGIEIKALDENGHDLAVT FPGPGEDGLNPFLEVKVTDTPKRSRRDFGLDCDEHSTESRCCRYPLTVDFEAFGWDWIIA PKRYKANYCSGECEFVFLQKYPHTHLVHQANPRGSAGPCCTPTKMSPINMLYFNGKEQII YGKIPAMVVDRCGCS >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_8|1128_bp atgcaaaaactgcaactctgtgtttatatttacctgtttatgctgattgttgctggtcca gtggatctaaatgagaacagtgagcaaaaagaaaatgtggaaaaagaggggctgtgtaat gcatgtacttggagacaaaacactaaatcttcaagaatagaagccattaagatacaaatc ctcagtaaacttcgtctggaaacagctcctaacatcagcaaagatgttataagacaactt ttacccaaagctcctccactccgggaactgattgatcagtatgatgtccagagggatgac agcagcgatggctctttggaagatgacgattatcacgctacaacggaaacaatcattacc atgcctacagagtctgattttctaatgcaagtggatggaaaacccaaatgttgcttcttt aaatttagctctaaaatacaatacaataaagtagtaaaggcccaactatggatatatttg agacccgtcgagactcctacaacagtgtttgtgcaaatcctgagactcatcaaacctatg aaagacggtacaaggtatactggaatccgatctctgaaacttgacatgaacccaggcact ggtatttggcagagcattgatgtgaagacagtgttgcaaaattggctcaaacaacctgaa tccaacttaggcattgaaataaaagctttagatgagaatggtcatgatcttgctgtaacc ttcccaggaccaggagaagatgggctgaatccgtttttagaggtcaaggtaacagacaca ccaaaaagatccagaagggattttggtcttgactgtgatgagcactcaacagaatcacga tgctgtcgttaccctctaactgtggattttgaagcttttggatgggattggattatcgct cctaaaagatataaggccaattactgctctggagagtgtgaatttgtatttttacaaaaa tatcctcatactcatctggtacaccaagcaaaccccagaggttcagcaggcccttgctgt actcccacaaagatgtctccaattaatatgctatattttaatggcaaagaacaaataata tatgggaaaattccagcgatggtagtagaccgctgtgggtgctcatga >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_9|80_aa MPLMTSYSSGGDRLVNRYVNIEGGASQQQGLRDEVTGAIKYYSGNYEHEGHGKGSACTRN YLKDCLVSCTSGAEIAKKQT >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_9|243_bp atgcccctgatgacctcatattccagtgggggagacagacttgtaaacagatatgttaac atagaaggtggtgcctctcagcagcaaggcttaagagatgaagtaactggtgcaataaaa tattatagtggaaattatgagcacgaaggccatgggaaaggttctgcttgcactagaaac tacctaaaagactgtcttgtttcttgtaccagtggggctgaaatagccaaaaaacaaacc taa >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_10|25_aa MCLVTRQMRTEGRPFLAQVGAIDSL >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_10|78_bp atgtgcttggtcacaagacagatgaggacagagggtaggcccttcttggcccaagttgga gcaatcgactcgctgtaa >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_11|347_aa MDKFLDTYTLPRLNQEKVESLTRPITGSEIEAIINSLPTKKSPRPDRFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFNEASIILIAKPGRDTTKKENFRPISLMNIDAKILNKILA NQIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKACDKI QQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPFSPFLF NILLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDRIVYLENSIVSARNLLKLIGNFSKV SGYKINVQKSQAFLYSNNRQTESQIMSELPFTIASKRTPGFLVFMPV >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_11|1044_bp atggataaattcctggacacatacaccctcccaagactaaaccaggaaaaagttgaatct cttactagaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaa aaaagtccaagaccagacagattcacagccgaattctaccagaggtacaaggaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca tttaatgaggccagcatcatcctgatagcaaagcctggcagagacacaacaaaaaaagag aattttagaccaatatccctgatgaacatcgatgcgaaaatcctcaataaaatactggca aaccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaacatatgcaaatcaataaacgtaatccagcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctgtgacaaaatt caacagcccttcatgctaaaaactctcaataaattaggtattgatgggacttatctcaaa ataatcagagctatttatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaacaggcacaagacagggatgccctttctcaccattcctattc aacatactgttggaagttctggccagggcaatcaggcaggagaaagaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacaggattgtatatcta gaaaactccattgtctcagcccgaaatctccttaagctgataggcaacttcagcaaagtc tcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacagcaataacagacaa acagagagccaaatcatgagtgaactcccattcacgattgcttcaaagagaacaccaggg ttcttggtcttcatgccagtttag >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_12|116_aa MKGLQTPCKSEIRWVSHPGHTDIRGYTPLLAAFMGCCWVSVAFPDTRCKLSVDLPFWGLE DSGPLLTAPLGSAPVGSQCGGSHTTFPFCTALGEVLHKGSIPAAHYCLDIQASSEI >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_12|351_bp atgaaggggctacagaccccatgcaagtctgaaatccgatgggtctcacatccaggtcac actgatataagaggttatactcccctcctggctgctttcatgggctgctgttgggtgtct gtggcttttccagatacacggtgcaagctctctgtggatctaccattctggggtctagag gacagtggtcctcttctcacagctccactaggcagtgccccagtgggatctcagtgtggg ggctcccacaccacatttcccttctgcactgccctaggagaggttctccacaagggctcc attcctgcagcacactactgcttggacatccaggcatcttctgaaatctag >gi568815596r:189957261_190162596|GENSCAN_predicted_peptide_13|101_aa QTRDILSVIKAIYDKATANIILNGEKLKALPLRTGRRQGCPPSPLVFNTVLEVLARAIRQ EKEIKGIHVGKEEVKLSLFADDMIVYLENPKDSSRKLLEQK >gi568815596r:189957261_190162596|GENSCAN_predicted_CDS_13|306_bp caaacaagggacatcctcagtgtaataaaagccatctatgacaaagccacagccaacata atactgaatggggaaaagttgaaagcattacctctgagaactggaagaagacaaggatgc ccaccctcaccactcgtcttcaacacagtactggaagtcctagccagagcaatcagacaa gagaaagaaataaagggcatccacgttggtaaagaggaagtcaaactgtcactgtttgct gatgatatgatcgtttaccttgaaaaccctaaggactcctccagaaagctcctagaacaa aaataa