GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:30:59 Sequence gi568815590r:100820696_101048889 : 228194 bp : 43.91% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9311 9385 75 1 0 80 105 49 0.734 5.51 1.02 Term + 15263 15361 99 1 0 106 55 47 0.615 1.33 1.03 PlyA + 16283 16288 6 1.05 2.00 Prom + 21564 21603 40 -2.56 2.01 Init + 31020 31071 52 2 1 100 105 -1 0.350 4.17 2.02 Term + 32824 33509 686 2 2 -56 43 291 0.157 4.00 2.03 PlyA + 33529 33534 6 1.05 3.00 Prom + 34063 34102 40 -4.96 3.01 Sngl + 35014 36396 1383 0 0 70 38 484 0.972 37.70 3.02 PlyA + 36523 36528 6 1.05 4.00 Prom + 37287 37326 40 -2.46 4.01 Init + 37411 37543 133 0 1 78 47 64 0.225 1.60 4.02 Intr + 39626 39890 265 0 1 34 115 50 0.026 -1.13 4.03 Intr + 41850 41977 128 0 2 98 109 -21 0.026 1.32 4.04 Term + 53487 53515 29 1 2 142 48 22 0.191 1.84 4.05 PlyA + 54324 54329 6 1.05 5.00 Prom + 56748 56787 40 -2.86 5.01 Init + 75076 75419 344 0 2 75 56 381 0.126 30.31 5.02 Intr + 84016 84075 60 1 0 111 97 65 0.705 7.55 5.03 Intr + 86702 86820 119 2 2 35 53 59 0.218 -2.79 5.04 Intr + 86898 87043 146 1 2 45 101 59 0.505 2.90 5.05 Intr + 87088 87166 79 0 1 77 84 27 0.632 0.32 5.06 Intr + 87995 88169 175 0 1 63 84 90 0.614 5.10 5.07 Term + 89333 89480 148 2 1 14 43 119 0.477 -2.63 5.08 PlyA + 90550 90555 6 1.05 6.11 PlyA - 90713 90708 6 1.05 6.10 Term - 100057 99998 60 1 0 77 41 56 0.557 -2.40 6.09 Intr - 103355 103260 96 2 0 62 98 88 0.942 7.41 6.08 Intr - 103603 103440 164 2 2 70 45 112 0.993 4.79 6.07 Intr - 104344 104221 124 1 1 76 91 142 0.916 13.46 6.06 Intr - 128205 127901 305 1 2 116 85 251 0.628 23.91 6.05 Intr - 130143 130066 78 1 0 91 49 59 0.265 1.82 6.04 Intr - 138642 138544 99 1 0 77 53 62 0.274 1.68 6.03 Intr - 161496 161378 119 2 2 80 80 6 0.068 -0.89 6.02 Intr - 195665 195528 138 1 0 72 91 83 0.336 6.58 6.01 Init - 197855 197797 59 2 2 46 80 53 0.490 1.08 6.00 Prom - 199420 199381 40 -2.46 7.02 PlyA - 203490 203485 6 1.05 7.01 Sngl - 206507 206259 249 2 0 42 39 219 0.719 7.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:100820696_101048889|GENSCAN_predicted_peptide_1|57_aa IGVPNPLAKDRYCLWPVRKQATQQKGNNSLSPHSGNIWVLDHEAGEGHMAAETIVII >gi568815590r:100820696_101048889|GENSCAN_predicted_CDS_1|174_bp atcggggtgcccaatcccctggccaaggaccggtactgtctgtggcctgttaggaagcag gccacacagcagaaggggaacaatagtctcagtcctcattctggcaacatttgggtactg gaccatgaggcaggtgagggacacatggcagcggagaccatcgtcattatttga >gi568815590r:100820696_101048889|GENSCAN_predicted_peptide_2|245_aa MDEGLLSRRASVYKMLGGGSSTNGKEVKNFEKKLDEWITRITNAEKSLKDLIELKTKARE LRDKCRRLSSQCDQLEERVSVMEDEMNEMKQEEKFREKRIKRNEQSLQEIWDYVKRPNLH LIGVSESDGENGTKLENTLQDIIQENFLNLARQANIQIQEIQRTPQRYFSRRATPRHIII RFTKVEMKEKMLRAAREKGWVTHKGKPIRLTADLSAETLQARREWGPIFNILKRIFNSEF HIQPN >gi568815590r:100820696_101048889|GENSCAN_predicted_CDS_2|738_bp atggatgaggggcttctgagcaggagggccagtgtgtacaaaatgctgggaggaggaagt tcgaccaatggcaaagaagttaaaaactttgaaaaaaaattagacgaatggataactaga ataactaatgcagagaagtccttaaaggacctgatagagctgaaaaccaaggcacgagaa ctacgtgacaaatgcagaaggctcagtagccaatgtgatcaactggaagaaagggtatca gtgatggaagatgaaatgaatgaaatgaagcaagaagagaagtttagagaaaaaagaata aaaagaaacgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacat ctcattggtgtatctgaaagtgacggggagaatggaaccaagttggaaaacactctgcag gatattatccaggagaacttcctcaatctagcaaggcaggccaacattcagattcaggaa atacagagaacgccacaaagatacttctcgagaagagcaactccaagacacataattatc agattcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaagggtgg gttacccacaaagggaagcccatcagactaacagctgatctctcagcagaaactctacaa gccagaagagagtgggggccaatattcaacattcttaaaagaattttcaactcagaattt catatccagccaaactaa >gi568815590r:100820696_101048889|GENSCAN_predicted_peptide_3|460_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIEVIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEANIILIPKPGRDTTKKENFRPISLMNIDAKILSKILA NRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRTNDKNHMIVSIDAEKAFDKI QQPFMLKTLNKLGIDGTYLKIRAIYDKPTANIILNGQNLEAFPLKTGTRQGCPLSPLLFN MVLEFLARAIRQEKEIKGIQLGKEEVKLPLFADDMTVYLENPIVSAQNLLKLISNFSKVS GYKINVQKSQAFLYTNNRQTESQIMSELQFTIASKRIKYLGIQLTRHVKDLFKESYKPLL KEIKENTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIW NQKRAHIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWY >gi568815590r:100820696_101048889|GENSCAN_predicted_CDS_3|1383_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgaggtaataattaatagcttaccaaccaaa aaaagtccaggaccagatggattcacagctgaattctaccagaggtacaaggaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccaacatcatcctgataccaaagcctggcagagacacaacaaaaaaagag aattttagaccaatatccctgatgaacatcgatgcaaaaatcctcagtaaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaacatatgcaaatcaataaatgtaatccagcatataaacaga accaacgacaaaaaccacatgattgtctcaatagatgcagaaaaggcctttgacaaaatt caacagcccttcatgctaaaaactctcaataaattaggtattgatgggacgtatctcaaa ataagagctatttatgacaaacccacagccaatatcatactgaatgggcaaaacctggaa gcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaac atggtgttggaatttctggccagggcaatcaggcaggagaaagaaataaagggtattcaa ttaggaaaagaggaagtcaaattgcccctgtttgcagatgacatgactgtatatctagaa aaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctca ggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaaaca gagagccaaatcatgagtgaactccaattcacaattgcttcaaagagaataaaataccta ggaatccaacttacaaggcatgtgaaggacctcttcaaggagagctacaaaccactgctc aaggaaataaaagagaatacaaacaaatggaagaacattccatgctcatgggtaggaaga atcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatgg aaccaaaaaagagcccacattgccaagtcaatcctaagccaaaagaacaaagctggaggc atcacgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtac tag >gi568815590r:100820696_101048889|GENSCAN_predicted_peptide_4|184_aa MEYYAAIKNDEFMSFVGTWMKLETIILSKLSQRQKTKHRMFSLIAGPGRTQGLNLLHCGA GDPPESGRQFPEDRQEAGLQAGQPSCGRALHGLSWDSSDICCTIPLMSKKTTYIPLSLWG NYLSALSTVWWDRLDWGICSGSRTGEDGGVDSVEVPFTLVYFLSKCEGYKCNSGAGEFCT IQST >gi568815590r:100820696_101048889|GENSCAN_predicted_CDS_4|555_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcgcaaagacaaaaaaccaaacaccgcatg ttctcgcttatagcagggccaggcaggacccagggtctgaacctcctccactgtggagcc ggagaccccccggagtcaggccgtcagttcccagaagacagacaggaggccgggctgcag gcggggcagcccagctgtggccgcgccctccacggcctgagctgggactcaagtgacatc tgttgtactatcccgttgatgtctaaaaaaacaacttacatcccgctttcactctggggg aattacctgtctgcactgagcacagtctggtgggacaggctggactggggaatatgctca ggttcaaggacaggtgaagatggtggcgtagactcagtagaagttccttttactttggtt tacttcctaagtaagtgtgaaggatacaaatgcaattctggtgcaggggaattctgcaca atccagtccacatga >gi568815590r:100820696_101048889|GENSCAN_predicted_peptide_5|356_aa MTKKRRNNGRAKKGRGHVQPIRCTNCARCVPKDKAIKKFVIRNIVEAAAVRDISEVSVFD AYVLPKLYVKLHYCVSCAIHSKVVRNRSREARKDRTPPPRFRPAGAAPRPPPKPMLRVFV VTDTGFRGNRDDMNKGKPGAEQQGMCEQAQGLATVHSQVYGLPGVWAVLERTVPGTGQRG GTEKLGDNRNCRAPKRVSQTWLGELLGLGSLKGSSSLLVTCNMLFQSRHSAGPSSCRKSR KNEVYGKLEAPWSLEPQPRLPGCSCCIHSSRFRRAATAISFSWPPGSDEAWDFLPCTPQN KKPPYCSLLATFQELHSPTWRLAAAVLDSTVLQLLICDLFCALVLKLDCTGIIRGA >gi568815590r:100820696_101048889|GENSCAN_predicted_CDS_5|1071_bp atgacaaagaaaagaaggaacaatggtcgtgccaaaaagggccgcggccacgtgcagcct attcgctgcactaactgtgcccgatgcgtgcccaaggacaaggccattaagaaattcgtc attcgaaacatagtggaggccgcagcagtcagggacatttctgaagtgagcgtcttcgat gcctatgtgcttcccaagctgtatgtgaagctacattactgtgtgagttgtgcaattcac agcaaagtagtcaggaatcgatctcgtgaagcccgcaaggaccgaacacccccaccccga tttagacctgcgggtgctgccccacgtcccccaccaaagcccatgctaagagtctttgta gtgacagacactggattccgaggaaatagagatgacatgaataagggcaagccaggtgct gagcagcaagggatgtgtgagcaagcacagggtctggccactgtgcacagccaggtgtat gggctgccaggtgtatgggctgtgctggagaggacagttccaggcactgggcaacgtggt ggcaccgagaagcttggagacaacaggaactgcagagccccaaagagggtgtcacagacc tggctcggggagctcctaggtctgggctccctaaagggtagcagttctctcctagtcacc tgcaacatgctctttcagtcccgccattcagcgggtccgagttcttgtcgcaagtccagg aagaatgaggtatatggaaaactggaggctccgtggagcctggagccccagccgcgcctc cctggctgcagctgctgtattcacagcagccgcttcaggcgggccgccacggcgatcagt ttttcatggcctccaggttctgatgaagcgtgggattttctgccctgtacccctcagaac aagaaaccaccttactgcagtcttctggccacatttcaagagctccatagccccacgtgg cggctagcagctgctgtattggacagtacagtcctacagctcctcatctgtgacctgttt tgtgctctggttctcaaactcgactgcactggaatcatcagaggagcttaa >gi568815590r:100820696_101048889|GENSCAN_predicted_peptide_6|413_aa MSFNIEKGQGTSGKIIYVDSMHLDFPSLGLLICEMEMIKEQLQKVVVKMKMEIKNAKHLA QEPGVWITCCERSHVVRTFKQAWKAPRGKERRPLASSLHPFSSPVDNWTIGSGEDLVMAS AQQAINVNVSWIPEFGQETVGPLETLYTVTGADRWRGRRWVGEREHPVMDKNELVQKAKL AEQAERYDDMAACMKSVTEQGAELSNEERNLLSVAYKNVVGARRSSWRVVSSIEQKTEGA EKKQQMAREYREKIETELRDICNDVLSLLEKFLIPNASQAESKVFYLKMKGDYYRYLAEV AAGDDKKGIVDQSQQAYQEAFEISKKEMQPTHPIRLGLALNFSVFYYEILNSPEKACSLA KTAFDEAIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDTQGDEAEAGEGGEN >gi568815590r:100820696_101048889|GENSCAN_predicted_CDS_6|1242_bp atgagcttcaatattgaaaagggccagggaacatctggtaaaatcatttatgtggacagc atgcacttagacttcccaagtctcggtctcctcatctgtgaaatggagatgataaaggag caacttcagaaggttgttgtgaagatgaaaatggagataaagaatgccaagcacttagca caggagcctggcgtgtggatcacctgctgtgagaggagccatgtggtaaggacattcaag caggcttggaaagctccacgtggcaaagaacggaggcctttggccagcagcctgcatcca ttctccagccctgtggacaactggacaattgggtcaggggaagatcttgtcatggcatcc gctcaacaggcgataaatgtaaatgtcagctggattccagaatttgggcaagaaactgtc ggtcccttagagacgctgtacacggtcaccggcgctgatcggtggcggggaaggcgctgg gtgggagagcgggaacatccagtcatggataaaaatgagctggttcagaaggccaaactg gccgagcaggctgagcgatatgatgacatggcagcctgcatgaagtctgtaactgagcaa ggagctgaattatccaatgaggagaggaatcttctctcagttgcttataaaaatgttgta ggagcccgtaggtcatcttggagggtcgtctcaagtattgaacaaaagacggaaggtgct gagaaaaaacagcagatggctcgagaatacagagagaaaattgagacggagctaagagat atctgcaatgatgtactgtctcttttggaaaagttcttgatccccaatgcttcacaagca gagagcaaagtcttctatttgaaaatgaaaggagattactaccgttacttggctgaggtt gccgctggtgatgacaagaaagggattgtcgatcagtcacaacaagcataccaagaagct tttgaaatcagcaaaaaggaaatgcaaccaacacatcctatcagactgggtctggccctt aacttctctgtgttctattatgagattctgaactccccagagaaagcctgctctcttgca aagacagcttttgatgaagccattgctgaacttgatacattaagtgaagagtcatacaaa gacagcacgctaataatgcaattactgagagacaacttgacattgtggacatcggatacc caaggagacgaagctgaagcaggagaaggaggggaaaattaa >gi568815590r:100820696_101048889|GENSCAN_predicted_peptide_7|82_aa MERDPIPRGETVQVHWLTEALAAGSRRSVHVASVGDMKKPRIAELHTELCFKPVEPEGQQ HSVQKLSFTMEYTQPHSEGDAA >gi568815590r:100820696_101048889|GENSCAN_predicted_CDS_7|249_bp atggagagggaccctatcccaagaggagagacagtacaggtgcactggctgaccgaggca ttggctgcagggtcaaggagatctgtgcatgtagcaagcgttggtgacatgaagaagccc cggatagcagagctacatacagagctgtgttttaagcccgtggagcccgaaggtcagcag cacagcgtgcagaagctgagtttcaccatggagtacactcagccccactcagaaggcgat gctgcctaa