GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:49:04 Sequence gi568815582r:49179407_49381465 : 202059 bp : 43.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4686 4826 141 2 0 53 31 91 0.003 0.45 1.02 Intr + 20841 20928 88 2 1 98 81 65 0.887 6.34 1.03 Intr + 26556 26647 92 1 2 73 115 -15 0.120 -0.59 1.04 Intr + 33783 33942 160 2 1 46 97 57 0.614 1.96 1.05 Intr + 34749 34957 209 1 2 23 96 98 0.542 2.90 1.06 Term + 37412 37522 111 1 0 81 37 99 0.777 2.66 1.07 PlyA + 39017 39022 6 1.05 2.00 Prom + 41238 41277 40 -4.06 2.01 Init + 48745 48764 20 0 2 84 60 38 0.537 0.10 2.02 Intr + 51480 52234 755 0 2 54 72 242 0.371 10.31 2.03 Term + 56607 56764 158 1 2 48 38 102 0.465 -0.70 2.04 PlyA + 56823 56828 6 1.05 3.00 Prom + 72100 72139 40 -4.26 3.01 Init + 84806 84881 76 1 1 50 77 74 0.426 4.00 3.02 Intr + 92102 92224 123 0 0 70 108 45 0.638 5.26 3.03 Intr + 92955 93131 177 1 0 56 41 94 0.501 1.39 3.04 Term + 99188 99282 95 0 2 61 33 148 0.765 4.59 3.05 PlyA + 99676 99681 6 1.05 4.11 PlyA - 99773 99768 6 1.05 4.10 Term - 100195 99998 198 1 0 98 42 239 0.991 17.70 4.09 Intr - 101636 101517 120 2 0 122 76 169 0.897 19.79 4.08 Intr - 101993 101796 198 2 0 31 109 344 0.001 30.35 4.07 Intr - 102411 102195 217 2 1 81 34 249 0.001 17.31 4.06 Intr - 103198 103082 117 0 0 71 101 88 0.991 7.98 4.05 Intr - 104175 104135 41 0 2 93 78 35 0.389 0.02 4.04 Intr - 115944 115805 140 1 2 121 60 57 0.429 6.38 4.03 Intr - 133423 133309 115 1 1 0 94 62 0.001 -1.98 4.02 Intr - 146180 146104 77 0 2 86 84 40 0.248 2.63 4.01 Init - 148432 148309 124 1 1 58 81 65 0.272 3.13 4.00 Prom - 158304 158265 40 -2.46 5.00 Prom + 164830 164869 40 -6.66 5.01 Init + 165114 165216 103 2 1 93 92 91 0.929 8.68 5.02 Intr + 167849 167949 101 0 2 89 63 6 0.662 -1.97 5.03 Term + 168634 168732 99 0 0 125 36 77 0.670 4.33 5.04 PlyA + 169017 169022 6 1.05 6.00 Prom + 174927 174966 40 -1.66 6.01 Init + 179327 179549 223 1 1 97 27 128 0.484 6.32 6.02 Term + 185906 186033 128 0 2 118 38 53 0.487 1.74 6.03 PlyA + 187235 187240 6 1.05 7.00 Prom + 189649 189688 40 -3.86 7.01 Init + 194534 194683 150 1 0 59 90 122 0.898 9.54 7.02 Intr + 198325 198444 120 0 0 105 95 128 0.998 15.89 7.03 Term + 199064 199246 183 1 0 80 43 178 0.603 10.04 7.04 PlyA + 199845 199850 6 1.05 8.02 PlyA - 200971 200966 6 1.05 8.01 Sngl - 202011 201214 798 0 0 60 31 312 0.862 18.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 5022 4891 132 0 0 125 39 63 0.921 3.19 S.002 Init - 102059 101796 264 2 0 88 109 596 0.990 56.71 S.003 Term - 102411 102189 223 2 1 81 36 260 0.997 16.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:49179407_49381465|GENSCAN_predicted_peptide_1|266_aa DLQAPEEEADSIYPGLGCAQVVAAGGSESEQQRPLLVYTFTTPGDSELGNDQAIAGRLKG VLCQLAQHADDNTSLGGNREWGKGEKQRGCKIENLKGLLRNGSENSMSVALGSREALQAQ SKATGIGKQDEKKDQVRIQIKPNTLGIFLEEMGCGLPAGEGRENRKRKRPKAEESLDYLR NSNNSNENWCGWIRVGEKIMSRSSEEPVSTALKIMVYIMHDENMSKYFGQEPEEKEGERL REAEIKEPGREPFKRRKKKGKSQHEF >gi568815582r:49179407_49381465|GENSCAN_predicted_CDS_1|801_bp gatctgcaagctccagaggaggaagcagacagtatctacccaggccttgggtgtgcccag gtggttgctgctgggggtagcgagagtgagcagcagcggccccttttggtgtatacattt actacacctggtgattctgagctgggaaatgaccaagctattgcagggagactaaaggga gtgctttgccagctggctcagcacgccgatgacaacacctctcttggcgggaacagagag tggggaaaaggagaaaaacagagggggtgtaagatagaaaatttaaaaggattgctacga aatggaagcgaaaacagcatgtctgtagcactggggagtagagaagccctccaagcacag agtaaagccactgggattgggaaacaggatgagaagaaagatcaagtcagaattcaaatc aagcccaacacactgggaatctttctggaggaaatgggatgtggccttcctgcaggagaa ggcagagaaaatagaaagcgtaaacgccctaaggcagaagaaagcctggattatttgagg aatagcaacaacagcaatgaaaactggtgtggatggatcagagttggggagaagattatg tcaagaagcagtgaggaacccgtaagtacggccttaaagatcatggtttatatcatgcat gatgagaacatgtcaaagtattttgggcaggagcctgaagagaaggagggagaaagactg agagaagcagagatcaaagaaccaggaagagagccatttaaaaggaggaagaagaaaggc aagagccagcatgagttctga >gi568815582r:49179407_49381465|GENSCAN_predicted_peptide_2|310_aa MSDTEDSAIKLELRVKKLTQNFSATWKLNNLLLNDYWVHNETKVEIKVFFETNENKDTTY QNLWDTFKAVCRGKFIALNTHKRKQQRSKIDTLTSQLKELEKQEQTYSKASRREEITKIR AELKEIETQKTLQKINESRSWFFEKINKINRPLARLIKKKREKNQIDTIKNDKGDITTNP TQIQTTIREYYKHLYTNKLENLEEMDEFLHTYALPRLNQEEFESLNRPITGSEIEAIINS LPTKKVQDQMDSQLNSIRGLLNNISKAIKPSFQEDASGSAVSGWESQPNYFQLVVNCGKV DFTSLNFSSH >gi568815582r:49179407_49381465|GENSCAN_predicted_CDS_2|933_bp atgagcgacacagaagacagcgcaatcaaactagaactcagggttaagaagctcactcaa aacttctcagctacatggaaactgaacaacctgctcctgaatgactactgggtacataat gaaacgaaggtagaaataaaggtgttctttgaaaccaatgagaacaaagacacaacatat cagaatctctgggacacatttaaagcagtgtgtagagggaaatttatagcactaaatacc cacaagagaaagcagcaaagatctaaaattgacaccctaacatcacaattaaaagaacta gagaagcaggagcaaacatattcaaaagctagcagaagggaagaaattactaagatcaga gcagaactgaaggagatagagacacaaaaaacccttcaaaaaatcaatgaatccaggagc tggttttttgaaaagatcaacaaaattaatagaccgctagcaaggctaataaagaagaaa agagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccaccaatccc acacaaatacaaactaccatcagagaatattataaacacctctacacaaataaactagaa aatctagaagaaatggatgaattcctgcacacatacgccctcccaagactaaaccaggaa gaatttgaatccctgaatagaccaataacaggctctgaaattgaggcaataattaatagc ctaccaaccaaaaaagtccaggaccagatggattcacagctgaattctatcagaggacta cttaacaacatcagcaaagcaattaagcccagttttcaggaagatgccagcggctctgca gtgtctggatgggagtctcagcccaattatttccagctagttgtgaactgtggaaaagtg gactttacttcattaaacttcagttcccactga >gi568815582r:49179407_49381465|GENSCAN_predicted_peptide_3|156_aa MPLKPMLLISIYTALSLFVHGTYGLETGTYLGNQGKQLQTSQPQRAPLSLTLMQEVIEAY MLVNLPREPKAMMGFQSSFCFQPEQKPSDLMAIHEQGITSPLIGQDPVWLLFTSGQCAST MGKPAEEEEASGRSSLAFRGVTEHDGIDKHQNIKQP >gi568815582r:49179407_49381465|GENSCAN_predicted_CDS_3|471_bp atgcctctaaagcccatgctcctgatcagtatttatactgccttgtccctcttcgttcat gggacctatgggctagagacagggacatatttaggcaatcaaggaaaacagctgcaaact tcacagcctcaaagagcaccactgtcactcactttgatgcaggaggttattgaggcctac atgctggtgaacctgccaagagaacccaaagctatgatgggatttcagagttcattctgc ttccagccagaacaaaaacccagtgaccttatggccatccatgaacaaggaattaccagc cccctcattggccaagaccctgtatggctactttttacatctggccagtgtgccagtacc atgggaaagcctgctgaggaagaggaggcgtccggaagaagctccttggcgtttcgtgga gtcacagagcacgacggcattgacaagcatcagaacatcaagcagccttag >gi568815582r:49179407_49381465|GENSCAN_predicted_peptide_4|448_aa MKWIGAFVKAHPRSEIIDHSMLTTAPALFLLNHKWATLQKEGGDEREKAHAYDKSLWFAF PRCFYYSQADPTPCSLTPLCRHRHRVPRCVCKLLFSRQIFCWGLTGWGGALEHKEELKNI PVTYKAEKRETSGDKTNHPFDIWLALPITALGRPWPAVLNGGGGRRAVAGEAGESERPSS TIAAAGWGRGTRRHSAPRLRTESPRAGCINNSERRRRRQRRVRAGGGGSSGTEQQRLCIQ VRLGSRGTPEAREGLREHSAEGTLVAEGTLVPECEEENETEPIVLEGKCLVVCDSNPTSD PTGTALGISVRSGSAKVAFSAIRSTNHEPSEMSNRTMIIYFDQVLVNIGNNFDSERSTFI APRKGIYSFNFHVVKVYNRQTIQVSLMLNGWPVISAFAGDQDVTREAASNGVLIQMEKGD RAYLKLERGNLMGGWKYSTFSGFLVFPL >gi568815582r:49179407_49381465|GENSCAN_predicted_CDS_4|1347_bp atgaagtggataggagcatttgtcaaggcccacccacgctcagaaatcatcgatcacagc atgctgaccactgctcctgcactctttctcctgaatcacaaatgggccactctgcagaaa gaaggcggagatgagagggaaaaagctcatgcctatgataaaagcctatggtttgctttt ccacgatgtttctattacagtcaagctgacccaacgccctgctcgctgactcctttatgc cggcaccggcaccgagtgccacgctgcgtttgcaagctattgttcagcaggcagattttc tgctgggggttaacaggttggggaggtgctttagaacataaggaggaactaaagaatatt cctgttacatacaaggcagagaaaagggaaactagtggtgacaagacaaatcatcccttt gacatttggctggctctacctattacagctctggggcgaccctggccggccgtgctcaac gggggcgggggtcggagagcagtcgctggagaggcgggagagagcgagaggccaagctcc accatcgcggccgccggctggggcagagggacacggagacactcggcgccccggctccgc accgagtccccgcgcgccggctgcatcaataattcagagcggcggcggcggcggcagcgg cgggtgcgggcaggaggcggcggcagcagcgggaccgagcagcagcggctatgcatccaa gtgcggctgggcagccgcggcacccctgaggcccgggaggggctccgggaacacagcgcg gaggggacgctagtcgcggaggggacgctagtcccggagtgcgaggaagagaatgagacg gagcccatcgtgctggagggcaagtgcctggtggtgtgcgactccaaccccacgtccgac cccacgggcactgccctgggcatctctgtgcgctctggcagcgccaaggtggctttctct gccatcaggagcaccaaccacgagccgtccgagatgagtaatcgcaccatgatcatctac ttcgaccaggtactagtgaacattgggaacaactttgattcagaacgcagcactttcatc gccccgcgcaaagggatctacagttttaacttccacgtggtaaaagtctacaacagacaa accatacaggtgagcctcatgctaaacgggtggccggtgatttcagccttcgctggtgac caggacgtgacccgggaggccgccagcaacggagtcctaatccaaatggagaaaggcgac cgagcatacctcaagctggagcggggaaacttgatggggggctggaagtactcgaccttc tccggattcctggtgtttcctctctga >gi568815582r:49179407_49381465|GENSCAN_predicted_peptide_5|100_aa MRKGLAIPCASAGRPAQVPTPLREQTQLSPNLNAGGSLHGSKWPPEAPDTSFYQQPQENF SFSVKSHKEKIYNFHQITQGDFILKRLKTTNINFDSYGHL >gi568815582r:49179407_49381465|GENSCAN_predicted_CDS_5|303_bp atgaggaaggggcttgccatcccctgtgccagtgcgggaagaccagcccaagtgcccacc ccactgcgggagcagactcagctgtccccaaacctgaatgcaggtggctcccttcatggg tcaaagtggccaccagaagctccagatacatctttttatcaacagccccaggagaacttc tctttctcagtaaaatcccacaaggaaaaaatctacaactttcatcaaattactcaggga gactttattctaaagagactgaaaacgaccaacatcaattttgactcctatgggcatctg taa >gi568815582r:49179407_49381465|GENSCAN_predicted_peptide_6|116_aa MEYQGELNTWAHTLQRETRVMLCVPCTEKMNACAGSLRNSSGTAKSQIRNWRRVGAKKLS SSRETQTQGVEGLGGSIGGGFEPRKNAIFGVIKSKAHTTHSTIANKSRDKVLGQGK >gi568815582r:49179407_49381465|GENSCAN_predicted_CDS_6|351_bp atggaataccagggggagctgaacacgtgggctcacacgctacagagggagacccgtgtg atgctatgtgtcccatgcactgaaaagatgaatgcttgtgctgggtcactgagaaactcc tctggcacagcaaaatcgcagattcggaattggaggcgagtgggtgctaagaaactgagc agcagcagggaaactcagactcagggagtggaggggcttggggggagtattgggggagga tttgaaccaagaaaaaatgcaatttttggtgtcatcaagtccaaagctcataccactcac agcacaatagccaataagtcaagagacaaggtcttggggcaaggaaagtga >gi568815582r:49179407_49381465|GENSCAN_predicted_peptide_7|150_aa MSEQQMDLKDLMPTKRKYMWKTAEDRRMSDLTCVLEWLERRQGKKKQAPEKQKPKVVTVL KRNKKKEEKKGKGLMTARGGNRRDTETSQQALGKRFRKDAASYRSLYGVEQKGKHLSMVP GSYIKDGPKKSGKGKTLAPATRILLHNLLL >gi568815582r:49179407_49381465|GENSCAN_predicted_CDS_7|453_bp atgtcagagcaacaaatggacctgaaggatttaatgcccacaaagaggaaatacatgtgg aagactgctgaagataggcgcatgtctgacctcacctgtgtgctggagtggctggagcgg aggcaggggaagaagaaacaagctcccgagaagcaaaagcccaaagtggtgacagtcctt aaacgaaataagaagaaggaagagaagaaaggcaaaggcctcatgacagcacggggaggg aaccgcagggacacggagacttcccagcaggccttaggaaagagattcaggaaggacgcc gcctcctaccgaagcctctatggagtggagcaaaaggggaaacacctcagcatggtccct ggcagctacatcaaggatggccccaagaaatctggtaagggaaaaaccttggccccggcc accagaatcctgctccataatctcctcctctag >gi568815582r:49179407_49381465|GENSCAN_predicted_peptide_8|265_aa MSELPFTIASKRIKYLGIQLTKDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KMAILPKVNYRFNAIPIKLPMPFFTQLEKTTLKFIWNQKTARIAKSILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKSEKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKD FMSKTPKAMATEAKIDKWDLIKLKM >gi568815582r:49179407_49381465|GENSCAN_predicted_CDS_8|798_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatatctaggaatccaactt acaaaggatgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatattgtg aaaatggccatactgcccaaggtaaattacagattcaatgccatccccatcaagctacca atgcctttcttcacacaattggaaaaaactactttaaagttcatatggaaccaaaaaaca gctcgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaac tatctgatctttgacaaatctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggcttgccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaaaccctagaagaaaacttaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacagaagccaaaattgacaaatgggatcta attaaactaaagatgtaa