GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:34:42 Sequence gi568815595f:136754461_136955696 : 201236 bp : 41.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9154 9286 133 0 1 68 55 68 0.105 1.85 1.02 Intr + 17719 17935 217 2 1 62 72 123 0.354 4.74 1.03 Intr + 25694 25847 154 2 1 10 98 155 0.270 7.85 1.04 Intr + 33896 34002 107 1 2 77 31 97 0.164 0.99 1.05 Intr + 35479 35550 72 1 0 106 84 10 0.213 0.00 1.06 Intr + 37890 38002 113 0 2 73 71 69 0.301 2.80 1.07 Term + 51390 51628 239 1 2 44 49 177 0.552 4.75 1.08 PlyA + 53222 53227 6 1.05 2.08 PlyA - 54144 54139 6 1.05 2.07 Term - 54574 54470 105 1 0 86 49 79 0.152 1.23 2.06 Intr - 63868 63738 131 2 2 69 30 74 0.039 -0.81 2.05 Intr - 64808 64511 298 2 1 34 59 378 0.527 25.02 2.04 Intr - 65176 65034 143 2 2 29 -61 160 0.245 -5.65 2.03 Intr - 65736 65624 113 2 2 57 62 83 0.712 1.70 2.02 Intr - 66087 65922 166 1 1 97 56 101 0.606 5.90 2.01 Init - 68641 68566 76 1 1 70 77 29 0.721 1.30 2.00 Prom - 73138 73099 40 -5.55 3.00 Prom + 83227 83266 40 -4.85 3.01 Sngl + 100001 101239 1239 1 0 70 42 537 0.688 42.86 3.02 PlyA + 102173 102178 6 1.05 4.02 PlyA - 102197 102192 6 1.05 4.01 Sngl - 108378 107965 414 0 0 69 42 292 0.777 16.75 4.00 Prom - 110483 110444 40 -6.75 5.00 Prom + 117190 117229 40 -6.65 5.01 Sngl + 118373 118873 501 1 0 30 44 282 0.807 13.79 5.02 PlyA + 119678 119683 6 1.05 6.00 Prom + 121148 121187 40 -6.15 6.01 Sngl + 121241 121981 741 1 0 44 42 253 0.965 12.25 6.02 PlyA + 121986 121991 6 1.05 7.02 PlyA - 122202 122197 6 1.05 7.01 Sngl - 135653 135423 231 2 0 97 36 157 0.537 4.92 7.00 Prom - 138675 138636 40 -4.65 8.02 PlyA - 139658 139653 6 1.05 8.01 Sngl - 145616 144972 645 2 0 103 42 528 0.799 45.62 8.00 Prom - 159633 159594 40 -4.35 9.00 Prom + 165209 165248 40 -7.15 9.01 Init + 173542 173767 226 0 1 71 82 322 0.999 28.58 9.02 Intr + 191123 191835 713 0 2 73 123 622 0.969 54.43 9.03 Term + 193799 193993 195 1 0 104 43 126 0.996 6.13 9.04 PlyA + 194641 194646 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 112889 112575 315 2 0 65 50 164 0.944 6.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:136754461_136955696|GENSCAN_predicted_peptide_1|344_aa MHHVPGKAADTQRHPVKAARREAVPCKATGAELPKTMGTHLLHQALWEANKAEELSPKSE LYKVGGTGPQKECNFTREYSKTPTEFQAMAINLGPLCSLCPGIAGKKKSHHSGIRERAGG NLFEEAPEPVPTGRMVHRCTGASTDDCSYGSRAMAGQLSSQEPSYSGTVCIILSTMWSSD RPMRGAQGGQCQQTQKQHQLQAKRQLDVLPLQSKVKVFLTLLTSPGPSLPKDFLQVQLEA PMRVTMLKANGTNRASSTRSAPCCLVIGRESPTRLSSGLWFITGSGEVLVFLSCEQRSLW VQKPLIVILERQGEKPAKNETNNEEVELRNREKLGLDLVILGAK >gi568815595f:136754461_136955696|GENSCAN_predicted_CDS_1|1035_bp atgcaccatgtgcctggaaaagctgcagacactcaacgccatcctgtgaaagcagccagg agggaggctgtaccctgcaaagccacaggggcagagctgcccaagaccatgggaacccac ctcttgcatcaagctctttgggaggctaataaggcagaagaattgtctcccaagtctgag ctctacaaagttggaggtacaggtccccagaaggagtgcaattttaccagagaatatagc aagactcccactgagttccaagctatggctatcaacctgggccctttgtgttctttgtgt ccagggatagcaggcaaaaagaagagtcaccattctggcatcagggaaagagcaggaggc aacttatttgaggaagcaccagagccagtgcccacaggaagaatggtccacaggtgcact ggggccagcacagatgactgcagctatgggtcacgagcaatggcaggacaactcagcagt caagaacccagctacagtggaacagtatgtatcatactcagcacaatgtggagctcagac aggcccatgcgaggagcccaagggggtcagtgccagcagactcagaaacagcaccagctc caggctaagaggcagttggatgtgctccctctgcagtcaaaggtcaaagtttttcttacc ttactcacctctccagggcccagcctcccaaaggacttcttacaagttcagttagaggca cccatgagagtaaccatgctcaaagctaatggtacaaacagagcctcctcaacaagatct gccccttgctgcctggtgattggaagagaaagtccaaccagactcagctctggactttgg ttcatcactgggagtggagaagtgctcgtcttcctttcatgtgaacaaagaagcctgtgg gtccagaaacctttgatagtcatcttggaacgacagggagagaagcctgccaagaatgaa accaacaatgaagaagtagaattgagaaacagagagaaacttggtcttgatctcgtcatt ttaggagctaaatga >gi568815595f:136754461_136955696|GENSCAN_predicted_peptide_2|343_aa MAFIQKPGNNKSWQGCGERELSHIVGNKICYAYAYLYLQGKPHLLSGYVQETSLNLSEKS FSVGPAPLWPPAMEMGYFWQSTLDVGGYYLQVLDQHSDPSFPPRSSRSRVKENLARDAVT HPLLGLATGREESNSPWPSSWTLQEGPDTRQRKETGNRAVADAGPPAGSRSQSAAADRLP GPGYLRVVARKVPRRRLTRACPRLGKSRRALPGVLRRERGAAARTLRLRSPPSAAPTPQP DWRRSWRLRGTLRVVPELKDFLYPRAGHPVSSGYVLHLKAITPILWPSPAVFSRFWLPLT TLAPSGLEWKNDCRITFMAWNAKDVAVSYYFYQLAKMHPEPGT >gi568815595f:136754461_136955696|GENSCAN_predicted_CDS_2|1032_bp atggcttttatccaaaagccaggcaataacaaaagctggcaaggatgtggagaaagggaa ctctcacacattgttggtaataaaatctgttatgcatatgcttatctgtacctccaaggc aagccacatttgctttcgggttatgttcaggaaaccagcctcaacttgtctgaaaagagt ttcagtgttgggcctgctccactctggccaccagccatggagatgggctatttttggcaa agcactctggacgtggggggttactatctccaggtcctcgatcaacactcggacccgagc tttccgcctcgctcctcgcggagccgggtgaaagaaaacctggcccgagatgcagtgact caccccttgcttgggttggctactgggcgagaggaaagcaacagcccctggccttcatca tggacgctgcaggagggcccagacacgcgacagcggaaagaaacgggaaaccgggccgta gccgatgctggccctcccgcaggatcaaggtcccagtccgccgccgccgaccggctcccg gggcccggttacctgcgtgtcgtggcgaggaaggtgccccggaggcgactgacgcgggcg tgccctcgcctgggaaaatcgaggcgggcgctgcccggagtactgcggagggagaggggc gccgccgcgcggactctgcgcctgcgctctccgccctccgccgcccccaccccacaacct gactggcgccggagttggagactgcgggggacgctcagggttgtgcctgaactgaaggac ttcctctacccgcgagcgggccaccctgttagcagtggctatgttcttcatctgaaagcc attactcccatcctgtggccctctcctgcagttttctccaggttctggttaccactaact acccttgccccttcaggcctggagtggaagaatgactgcaggataaccttcatggcatgg aatgccaaagatgtggctgtgtcttattatttctaccagttggcaaaaatgcacccagag cctggaacctag >gi568815595f:136754461_136955696|GENSCAN_predicted_peptide_3|412_aa MDTSPSRKYPVKKRVKIHPNTVMVKYTSHYPQPGDDGYEEINEGYGNFMEENPKKGLLSE MKKKGRAFFGTMDTLPPPTEDPMINEIGQFQSFAEKNIFQSRKMWIVLFGSALAHGCVAL ITRLVSDRSKVPSLELIFIRSVFQVLSVLVVCYYQEAPFGPSGYRLRLFFYGVCNVISIT CAYTSFSIVPPSNGTTMWRATTTVFSAILAFLLVDEKMAYVDMATVVCSILGVCLVMIPN IVDEDNSLLNAWKEAFGYTMTVMAGLTTALSMIVYRSIKEKISMWTALFTFGWTGTIWGI STMFILQEPIIPLDGETWSYLIAICVCSTAAFLGVYYALDKFHPALVSTVQHLEIVVAMV LQLLVLHIFPSIYDVFGGVIIMISVFVLAGYKLYWRNLRKQDYQEILDSPIK >gi568815595f:136754461_136955696|GENSCAN_predicted_CDS_3|1239_bp atggatacttctccctccagaaaatatccagttaaaaaacgggtgaaaatacatcccaac acagtgatggtgaaatatacttctcattatccccagcctggcgatgatggatatgaagaa atcaatgaaggctatggaaattttatggaggaaaatccaaagaaaggtctgctgagtgaa atgaaaaaaaaagggagagctttctttggaaccatggataccctacctccaccaacagaa gacccaatgatcaatgagattggacaattccagagctttgcagaaaaaaacatttttcaa tcccgaaaaatgtggatagtgctgtttggatctgctttggctcatggatgtgtagctctt atcactaggcttgtttctgatcggtctaaagttccatctctagaactgatttttatccgt tctgtttttcaggtcttatctgtgttagttgtgtgttactatcaggaggccccctttgga cccagtggatacagattacgactcttcttttatggtgtatgcaatgtcatttctatcact tgtgcttatacatcattttcaatagttcctcccagcaatgggaccactatgtggagagcc acaactacagtcttcagtgccattttggcttttttactcgtagatgagaaaatggcttat gttgacatggctacagttgtttgcagcatcttaggtgtttgtcttgtcatgatcccaaac attgttgatgaagacaattctttgttaaatgcctggaaagaagcctttgggtacaccatg actgtgatggctggactgaccactgctctctcaatgatagtatacagatccatcaaggag aagatcagcatgtggactgcactgtttacttttggttggactgggacaatttggggaata tctactatgtttattcttcaagaacccatcatcccattagatggagaaacctggagttat ctcattgctatatgtgtctgttctactgcagcattcttaggagtttattatgccttggac aaattccatccagctttggttagcacagtacaacatttggagattgtggtagctatggtc ttgcagcttctcgtgctgcacatatttcctagcatctatgatgtttttggaggggtaatc attatgattagtgtttttgtccttgctggctataaactttactggaggaatttaagaaag caggactaccaggaaatactagactctcccattaaatga >gi568815595f:136754461_136955696|GENSCAN_predicted_peptide_4|137_aa MSTRSTAQADASPPLGLEVRAPFLLPRPYLPGLGLRHSWQKDFFMRKQTSQPARRTLDLP GSHGRPPGLSARHSAPGADKEAAGSRSLGQRPEGLAESRCRSLDPPSSLRAPSGSAPSLQ TAPPGLTQREKRKRRSE >gi568815595f:136754461_136955696|GENSCAN_predicted_CDS_4|414_bp atgtccacccgcagcaccgcgcaggctgacgcctcccctcctctcgggctggaggtgaga gcgcccttcctcctaccacgtccctaccttcccggcctcggattacgccacagctggcaa aaggactttttcatgagaaaacagacaagtcaaccagcgcgccgcacactcgacctccca ggctcgcacgggcggccgcccgggctctcagcccgacacagcgctccaggggctgacaag gaggccgcggggtcacgctccctggggcaacggcccgagggtctcgcggaatcccggtgc agaagtctggaccctccgagcagcctccgcgccccctccggctcggctccgagcctgcag acggcgcccccggggctaactcagcgcgagaagaggaagcgacgcagcgagtaa >gi568815595f:136754461_136955696|GENSCAN_predicted_peptide_5|166_aa MLQCGVDAASAQKSRIGVWEPPPAFQKMYGNAWMPRQKFAAGVEHSWRTSARAVQKGNVG SEPPHRVPTRVPPSGAVRRGPPSSSLQNDRSTDSLHHAPGKAADTQCKPVKAARREAVLC KTTESEVPKTMEIHFLDQHDLDVRHGVKGDHFGALRFDCPAVFWIC >gi568815595f:136754461_136955696|GENSCAN_predicted_CDS_5|501_bp atgctgcaatgtggtgttgatgctgcaagtgcacagaagtcaagaattggggtttgggaa cctccacctgcatttcagaagatgtatggaaatgcttggatgcccaggcagaagtttgca gcaggggtagagcattcgtggagaacctctgctagggcagtgcagaagggaaatgtgggg tcagagcccccacacagagttcctactcgggtaccgcctagtggagctgtgagaagaggg ccaccatcctccagcctccagaatgatagatccactgacagcttgcaccatgcgcctgga aaagctgcagacactcaatgcaagcctgtgaaagcagccaggagagaggctgtactctgc aaaaccacagagtcagaggtgcccaagaccatggaaatccacttcttggatcagcatgac ctggatgtgagacatggagtcaaaggagatcattttggagctttaagatttgactgccct gctgtattttggatttgctag >gi568815595f:136754461_136955696|GENSCAN_predicted_peptide_6|246_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQVDLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKYKRTEIITNYLSGHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNEMKAEVKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAQKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKRRIK >gi568815595f:136754461_136955696|GENSCAN_predicted_CDS_6|741_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatataaaagaacagaaatt ataacaaactatctctcaggccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaagtaaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccaaaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaagaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaagagaagaatcaaatag >gi568815595f:136754461_136955696|GENSCAN_predicted_peptide_7|76_aa MPEPPRPMGCCAARASPTRTTLCSRAPNPIDHPRAQECGHTAQDWQAAPPATPVRDPLGE ASWAPESGGALENLYV >gi568815595f:136754461_136955696|GENSCAN_predicted_CDS_7|231_bp atgcctgagcctccccgccccatgggctgctgtgccgcccgagcctccccaacaaggacc accctctgctccagggcgcccaatcccattgaccacccaagggctcaggagtgcgggcac acagcgcaggactggcaggcagctccacctgcaacccctgtgcgggatccactgggtgaa gccagctgggctcctgagtctggtggggccttggagaatctttatgtctag >gi568815595f:136754461_136955696|GENSCAN_predicted_peptide_8|214_aa MVQPVRCKKPINYSQFGDSDSDDDFVSATVPLNKQSRTSKELKQDKPKPNLNNLQKEEIP LEEKTPKKKRMALDDKLYQRDLEVALALSVKELSTVTTNVQKSQDKRVEKHGNSRTETVS KSPRISNCSVASDYLDLDKITKKDNGGIQGKRKAASKAAVQQRKIFLEGSDGNSANNTKP DLATGEDSEDDSDFGESEDNDKDSSMRKSKVKEI >gi568815595f:136754461_136955696|GENSCAN_predicted_CDS_8|645_bp atggtgcagcctgtgagatgtaagaaaccaatcaattactcacagtttggcgactctgac agtgatgatgattttgtttctgcaactgtacctttaaacaagcaatccagaacatcaaag gagttaaaacaagataaaccaaaacctaatttgaacaatctccagaaagaagaaatccca ctagaagagaaaacccctaaaaaaaaaaggatggctttagatgataagctctaccagaga gacttagaagttgcactagctttatcagtgaaggaactttcaacagtcaccactaatgtg cagaagtctcaagataaaagagttgaaaaacatggcaatagtagaacagaaacagtgagt aagtctcctcgtatctctaattgcagtgtagccagtgattatttagatttggataagatt actaagaaagacaatggtggtattcaagggaaaagaaaagcagcatctaaagctgcggta caacagaggaaaatttttctggaaggcagtgatggcaatagtgctaataacaccaaacca gacttggcaactggtgaagattctgaggatgattctgattttggtgagagtgaggataat gacaaagactcctctatgagaaaaagtaaagttaaagaaatttaa >gi568815595f:136754461_136955696|GENSCAN_predicted_peptide_9|377_aa MAEEVVVVAKFDYVAQQEQELDIKKNERLWLLDDSKSWWRVRNSMNKTGFVPSNYVERKN SARKASIVKNLKDTLGIGKVKRKPSVPDSASPADDSFVDPGERLYDLNMPAYVKFNYMAE REDELSLIKGTKVIVMEKCSDGWWRGSYNGQVGWFPSNYVTEEGDSPLGDHVGSLSEKLA AVVNNLNTGQVLHVVQALYPFSSSNDEELNFEKGDVMDVIEKPENDPEWWKCRKINGMVG LVPKNYVTVMQNNPLTSGLEPSPPQCDYIRPSLTGKFAGNPWYYGKVTRHQAEMALNERG HEGDFLIRDSESSPNDFSVSLKAQGKNKHFKVQLKETVYCIGQRKFSTMEELVEHYKKAP IFTSEQGEKLYLVKHLS >gi568815595f:136754461_136955696|GENSCAN_predicted_CDS_9|1134_bp atggcagaagaagtggtggtagtagccaaatttgattatgtggcccaacaagaacaagag ttggacatcaagaagaatgagagattatggcttctggatgattctaagtcctggtggcga gttcgaaattccatgaataaaacaggttttgtgccttctaactatgtggaaaggaaaaac agtgctcggaaagcatctattgtgaaaaacctaaaggataccttaggcattggaaaagtg aaaagaaaacctagtgtgccagattctgcatctcctgctgatgatagttttgttgaccca ggggaacgtctctatgacctcaacatgcccgcttatgtgaaatttaactacatggctgag agagaggatgaattatcattgataaaggggacaaaggtgatcgtcatggagaaatgcagt gatgggtggtggcgtggtagctacaatggacaagttggatggttcccttcaaactatgta actgaagaaggtgacagtcctttgggtgaccatgtgggttctctgtcagagaaattagca gcagtcgtcaataacctaaatactgggcaagtgttgcatgtggtacaggctctttaccca ttcagctcatctaatgatgaagaacttaatttcgagaaaggagatgtaatggatgttatt gaaaaacctgaaaatgacccagagtggtggaaatgcaggaagatcaatggtatggttggt ctagtaccaaaaaactatgttaccgttatgcagaataatccattaacttcaggtttggaa ccatcacctccacagtgtgattacattaggccttcactcactggaaagtttgctggcaat ccttggtattatggcaaagtcaccaggcatcaagcagaaatggcattaaatgaaagagga catgaaggggatttcctcattcgtgatagtgaatcttcgccaaatgatttctcagtatca ctaaaagcacaagggaaaaacaagcattttaaagtccaactaaaagagactgtctactgc attgggcagcgtaaattcagcaccatggaagaacttgtagaacattacaaaaaggcacca atttttacaagtgaacaaggagaaaaattatatcttgtcaagcatttatcatga