GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:22:06 Sequence gi568815578f:44380000_44589538 : 209539 bp : 45.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10833 10877 45 2 0 41 100 59 0.250 3.08 1.02 Term + 15000 15128 129 2 0 48 45 136 0.381 3.48 1.03 PlyA + 17024 17029 6 1.05 2.00 Prom + 17371 17410 40 -4.56 2.01 Init + 21374 21488 115 1 1 51 78 178 0.482 13.47 2.02 Intr + 26059 26233 175 2 1 100 94 324 0.998 33.20 2.03 Intr + 27382 27476 95 1 2 23 105 186 0.711 13.41 2.04 Intr + 33695 33801 107 0 2 122 86 161 0.526 19.23 2.05 Intr + 34508 34663 156 1 0 44 94 314 0.999 27.81 2.06 Intr + 38264 38317 54 1 0 131 80 -22 0.564 0.48 2.07 Intr + 38426 38513 88 1 1 108 66 132 0.646 12.54 2.08 Intr + 39722 39877 156 0 0 95 92 324 0.998 33.48 2.09 Intr + 44019 44255 237 1 0 110 65 534 0.998 50.89 2.10 Intr + 48336 48458 123 1 0 127 56 122 0.593 13.46 2.11 Term + 49524 49666 143 1 2 99 43 123 0.396 6.99 2.12 PlyA + 52822 52827 6 1.05 3.02 PlyA - 55512 55507 6 1.05 3.01 Sngl - 86945 86694 252 2 0 86 49 188 0.309 9.82 3.00 Prom - 90081 90042 40 -7.06 4.00 Prom + 94830 94869 40 -4.26 4.01 Init + 100001 100445 445 1 1 6 70 653 0.974 51.48 4.02 Intr + 104338 104531 194 2 2 55 110 116 0.977 9.71 4.03 Intr + 106597 106707 111 0 0 117 89 63 0.999 9.88 4.04 Term + 109264 109542 279 0 0 69 41 291 0.950 17.95 4.05 PlyA + 113287 113292 6 1.05 5.12 PlyA - 114508 114503 6 1.05 5.11 Term - 120435 120297 139 2 1 94 46 80 0.974 1.84 5.10 Intr - 121301 121074 228 1 0 57 91 201 0.648 14.28 5.09 Intr - 123996 123816 181 1 1 72 97 34 0.990 1.63 5.08 Intr - 124892 124802 91 2 1 97 95 52 0.914 6.37 5.07 Intr - 126997 126828 170 2 2 75 80 41 0.977 1.67 5.06 Intr - 130029 129892 138 1 0 43 82 118 0.839 7.14 5.05 Intr - 131369 131290 80 1 2 61 91 80 0.998 4.79 5.04 Intr - 132995 132802 194 2 2 69 71 155 0.996 10.19 5.03 Intr - 133985 133880 106 1 1 63 116 41 0.161 4.62 5.02 Intr - 147522 147483 40 1 1 68 92 44 0.088 0.08 5.01 Init - 152201 151688 514 2 1 53 51 284 0.800 16.27 5.00 Prom - 157424 157385 40 -2.06 6.00 Prom + 169028 169067 40 -2.16 6.01 Init + 190167 190182 16 2 1 83 79 5 0.447 -0.53 6.02 Term + 196917 196978 62 1 2 104 49 88 0.856 4.47 6.03 PlyA + 198692 198697 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 27529 27599 71 2 2 85 39 38 0.802 -3.30 S.002 Term + 45321 45466 146 1 2 42 43 199 0.979 8.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:44380000_44589538|GENSCAN_predicted_peptide_1|57_aa MTFQSEKENPDDLNKPQAPSSSSRKYSCDKPVATHHRVNKPVATHHRVNKHNATHGL >gi568815578f:44380000_44589538|GENSCAN_predicted_CDS_1|174_bp atgacgttccaatcagagaaggagaatccagacgacctcaacaagccccaagctccctca tcttcctctcggaaatactcctgtgacaagccagtggccacccatcatcgggtgaacaag ccagtggccacccatcatcgggtgaacaaacacaatgccactcatggtctgtga >gi568815578f:44380000_44589538|GENSCAN_predicted_peptide_2|482_aa MRLSKTLVDMDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNSLGVSALC AICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKC FRAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKK IASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVSSFMGSSFMMPISQFRQVALLRA HAGEHLLLGATKRSMVFKDVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQID DNEYAYLKAIIFFDPDAKGLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLP TLQSITWQMIEQIQFIKLFGMAKIDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTNV IVANTMPTHLSNGQMSTPETPQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQE VI >gi568815578f:44380000_44589538|GENSCAN_predicted_CDS_2|1449_bp atgcgactctccaaaaccctcgtcgacatggacatggccgactacagtgctgcactggac ccagcctacaccaccctggaatttgagaatgtgcaggtgttgacgatgggcaatgacacg tccccatcagaaggcaccaacctcaacgcgcccaacagcctgggtgtcagcgccctgtgt gccatctgcggggaccgggccacgggcaaacactacggtgcctcgagctgtgacggctgc aagggcttcttccggaggagcgtgcggaagaaccacatgtactcctgcagatttagccgg cagtgcgtggtggacaaagacaagaggaaccagtgccgctactgcaggctcaagaaatgc ttccgggctggcatgaagaaggaagccgtccagaatgagcgggaccggatcagcactcga aggtcaagctatgaggacagcagcctgccctccatcaatgcgctcctgcaggcggaggtc ctgtcccgacagatcacctcccccgtctccgggatcaacggcgacattcgggcgaagaag attgccagcatcgcagatgtgtgtgagtccatgaaggagcagctgctggttctcgttgag tgggccaagtacatcccagctttctgcgagctccccctggacgaccaggtttctagtttt atgggtagtagttttatgatgcccatttcacagttcaggcaggtggccctgctcagagcc catgctggcgagcacctgctgctcggagccaccaagagatccatggtgttcaaggacgtg ctgctcctaggcaatgactacattgtccctcggcactgcccggagctggcggagatgagc cgggtgtccatacgcatccttgacgagctggtgctgcccttccaggagctgcagatcgat gacaatgagtatgcctacctcaaagccatcatcttctttgacccagatgccaaggggctg agcgatccagggaagatcaagcggctgcgttcccaggtgcaggtgagcttggaggactac atcaacgaccgccagtatgactcgcgtggccgctttggagagctgctgctgctgctgccc accttgcagagcatcacctggcagatgatcgagcagatccagttcatcaagctcttcggc atggccaagattgacaacctgttgcaggagatgctgctgggagggtcccccagcgatgca ccccatgcccaccaccccctgcaccctcacctgatgcaggaacatatgggaaccaacgtc atcgttgccaacacaatgcccactcacctcagcaacggacagatgtccacccctgagacc ccacagccctcaccgccaggtggctcagggtctgagccctataagctcctgccgggagcc gtcgccacaatcgtcaagcccctctctgccatcccccagccgaccatcaccaagcaggaa gttatctag >gi568815578f:44380000_44589538|GENSCAN_predicted_peptide_3|83_aa MTCKMFLESSAVTTSNCFLKAQSNSLGSDLVRGDMAKPTKKVGIVGKYGTRYGASLWKMV KKVEISQQAKYTCSFCGKTKMKR >gi568815578f:44380000_44589538|GENSCAN_predicted_CDS_3|252_bp atgacttgtaagatgtttctagaaagctctgctgtgacaaccagtaactgttttttaaaa gcccagtccaactctctgggctcggacctagttcgcggtgacatggccaaacctaccaag aaagtcgggatcgtcggtaaatacgggacccgctatggggcctccctctggaaaatggtg aagaaagttgaaatcagccagcaggccaagtacacttgctctttctgtggcaaaaccaag atgaagagatga >gi568815578f:44380000_44589538|GENSCAN_predicted_peptide_4|342_aa MSEESDSLRTSPSVASLSENELPPPPEPPGYVCSLTEDLVTKAREELQEKPEWRLRDVQA LRDMVRKEYPNLSTSLDDAFLLRFLRARKFDYDRALQLLVNYHSCRRSWPEVFNNLKPSA LKDVLASGFLTVLPHTDPRGCHVVCIRPDRWIPSNYPITENIRAIYLTLEKLIQSEETQV NGIVILADYKGVSLSKASHFGPFIAKKVIGILQDGFPIRIKAVHVVNEPRIFKGIFAIIK PFLKEKIANRFFLHGSDLNSLHTNLPRSILPKEYGGTAGELDTATWNAVLLASEDDFVKE FCQPVPACDSILGQTLLPEGLTSDAQCDDSLRAVKSQLYSCY >gi568815578f:44380000_44589538|GENSCAN_predicted_CDS_4|1029_bp atgtccgaagaaagtgactctctgagaaccagcccttctgtggcctcactctctgaaaat gagctgccaccaccacctgagcctccgggctatgtgtgctcactgacagaagacctggtc accaaagcccgggaagagctgcaggaaaagccggaatggagacttcgagatgtgcaggcc cttcgtgacatggtgcggaaggagtaccccaacctgagcacatccctcgacgatgccttc ctgctgcgcttcctccgagcccgcaagtttgattacgaccgggccctgcagctcctcgtc aactaccacagctgtagaagaagctggcccgaagtcttcaataacttgaagccatcagcc ttaaaagatgtccttgcttccgggttcctcaccgtgctgccccacactgaccccaggggc tgccatgtcgtctgcatccgcccagacagatggataccaagcaactatccaattactgaa aacatccgagccatatacttgaccttagaaaaactcattcagtctgaagaaacccaggtg aatggaattgtaattcttgcagactacaaaggagtgagtttatcaaaagcatctcacttt ggcccttttatagccaaaaaggtgattggcatcctccaggatggtttccccattcggata aaagcagtccatgtggtgaatgaacctcgaatatttaaaggcatttttgccatcataaaa ccatttctaaaggagaaaatagcaaacagattcttcctccatgggtctgacttgaactct ctccacacaaaccttccaagaagcatcctccccaaggagtatgggggcacggctggggag ctggacactgccacctggaacgcggtactgctggcttcagaagacgattttgtgaaagag ttctgccaacctgttcctgcctgtgacagcatcctgggccagacgctgctgcccgagggc ctgacctcagatgcacagtgtgacgactccttgcgagctgtgaagtcacagctgtactcc tgctactag >gi568815578f:44380000_44589538|GENSCAN_predicted_peptide_5|626_aa MATNIINSRKRTLPERGAGSKGVGGAVPGAAEGQRSGRSLPPRDPRRLSPSPPATCRRQA RSVGAATGRPAPAHLRSRRLSVPGPPVRSASPGQPSARSPQPAAEPVALATWAAELPRHS RYGGSDGTRTVRVPFRSRGGGRGGTKWESRAELRRRDPGWNLARGRYGKGDIFIPTSSQN DPFESKNSTVTRLIYAFILLLSTVVSYIMQRKEMETYLKKIPGFCEGGFKIHEADINADK DCDVLVGYKAVYRISFAMAIFFFVFSLLMFKVKTSKDLRAAVHNGFWFFKIAALIGIMVG SFYIPGGYFSSVWFVVGMIGAALFILIQLVLLVDFAHSWNESWVNRMEEGNPRLWYAALL SFTSAFYILSIICVGLLYTYYTKPDGCTENKFFISINLILCVVASIISIHPKIQEHQPRS GLLQSSLITLYTMYLTWSAMSNEPDRSCNPNLMSFITRITAPTLAPGNSTAVVPTPTPPS KSGSLLDSDNFIGLFVFVLCLLYSSIRTSTNSQVDKLTLSGSDSVILGDTTTSGASDEED GQPRRAVDNEKEGVQYSYSLFHLMLCLASLYIMMTLTSWYSPDAKFQSMTSKWPAVWVKI SSSWVCLLLYVWTLVAPLVLTSRDFS >gi568815578f:44380000_44589538|GENSCAN_predicted_CDS_5|1881_bp atggctacaaatataataaacagccgaaaacggactctccctgaacgcggggcggggtca aagggcgtcgggggcgccgtccccggcgcggctgagggacaaagatcgggccgcagcctc cctccccgggatccccggcggctcagcccctcgccccctgcgacgtgtcgacgccaggcc cggagcgttggggccgcaaccggccgcccggctcctgctcacctgcggtctcgccgcctc tccgtgcctgggccgccggtccgcagcgcctccccggggcagcctagcgcccgcagcccg caacccgcagcggagcccgttgccttggcgacctgggctgccgaactcccgcggcactcg cgctacggcggctcggatgggaccaggacggttcgcgtccccttccgcagccgcggaggg ggcagaggagggacgaagtgggagtcgagggctgagctgcgaaggagggatccgggttgg aacttggcccggggaagatacggaaagggggacatctttattccaaccagcagccagaat gatccttttgaaagtaagaattccacggtgactcgcctcatttatgctttcattctcctc ctgagcactgtcgtatcctatatcatgcagagaaaagagatggaaacttacttgaagaag attcctggattttgtgaagggggatttaaaatccatgaggctgatataaatgcagataaa gattgtgatgtgctggttggttataaagctgtgtatcggatcagctttgccatggccatc tttttctttgtcttttctctgctcatgttcaaagtaaaaacaagtaaagatctccgagcg gcagtacacaatgggttttggttcttcaaaattgctgcccttattggaatcatggttggc tctttctacatccctgggggctatttcagctcagtctggtttgttgttggcatgataggg gccgccctcttcatcctcattcagctggtgctgctggtagattttgctcattcttggaat gaatcatgggtaaatcgaatggaagaaggaaacccaaggttgtggtatgctgctttactg tctttcacaagcgccttttatatcctgtcaatcatctgtgtcgggctgctctatacatat tacaccaaaccagatggctgcacagaaaacaagttcttcatcagtattaacctgatcctt tgcgttgtggcttctattatatcgatccacccaaaaattcaggaacaccagcctcgctcc ggcctcttgcagtcctccctcatcaccctctacactatgtacctcacctggtcagccatg tccaatgaacctgatcgttcctgcaatcccaacctgatgagctttattacacgcataact gcaccaaccctggctcctggaaattcaactgctgtggtccctacccctactccaccatca aagagtgggtctttactggattcagataattttattggactgtttgtctttgttctctgc ctcttgtattctagcatccgcacttccactaatagccaagtagacaagctgaccctgtca gggagtgacagcgtcatccttggtgatacaactaccagtggtgccagtgatgaagaagat ggacagcctcggcgggctgtggacaacgagaaagagggagtgcagtatagctactcctta ttccacctcatgctctgcttggcttccttgtacatcatgatgaccctgaccagctggtac agccctgatgcaaagtttcagagcatgaccagcaagtggccagctgtgtgggtcaagatc agctccagctgggtctgcctcctgctttacgtctggacccttgtggctccacttgtcctc accagtcgggacttcagctga >gi568815578f:44380000_44589538|GENSCAN_predicted_peptide_6|25_aa MDCEPDTPNQQSKGDHKVDPQDIEP >gi568815578f:44380000_44589538|GENSCAN_predicted_CDS_6|78_bp atggactgtgagccagacacaccaaatcagcagtccaaaggggaccacaaagtggatcct caggacatcgaaccctga