GENSCAN 1.0 Date run: 19-Feb-121 Time: 20:36:30 Sequence gi568815597r:116419231_116670972 : 251742 bp : 44.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 9605 9644 40 -2.16 1.01 Init + 25309 25371 63 0 0 76 82 72 0.148 6.55 1.02 Term + 26353 26418 66 0 0 80 41 51 0.069 -2.46 1.03 PlyA + 28422 28427 6 1.05 2.03 PlyA - 29348 29343 6 1.05 2.02 Term - 38986 38885 102 1 0 31 48 162 0.487 4.78 2.01 Init - 45764 45741 24 2 0 69 119 7 0.398 1.50 2.00 Prom - 45944 45905 40 -4.46 3.03 PlyA - 46293 46288 6 1.05 3.02 Term - 47836 47698 139 0 1 75 42 166 0.620 8.14 3.01 Init - 50055 50027 29 0 2 84 73 29 0.599 0.07 3.00 Prom - 52472 52433 40 -5.36 4.07 PlyA - 53982 53977 6 1.05 4.06 Term - 54191 54040 152 0 2 80 47 62 0.087 -0.63 4.05 Intr - 58942 58822 121 1 1 117 119 13 0.104 7.37 4.04 Intr - 77126 77049 78 2 0 125 31 32 0.321 0.95 4.03 Intr - 78503 78477 27 2 0 124 79 1 0.475 1.11 4.02 Intr - 79673 79572 102 2 0 46 72 74 0.377 1.97 4.01 Init - 91802 91749 54 2 0 86 81 63 0.456 4.73 4.00 Prom - 106741 106702 40 -2.26 5.04 PlyA - 107284 107279 6 1.05 5.03 Term - 113918 113706 213 2 0 -8 49 328 0.542 16.53 5.02 Intr - 114237 113926 312 0 0 -29 -53 296 0.178 0.18 5.01 Init - 115018 114284 735 1 0 28 6 563 0.496 36.67 5.00 Prom - 115796 115757 40 -2.16 6.03 PlyA - 118500 118495 6 1.05 6.02 Term - 130939 130833 107 2 2 61 49 103 0.725 2.27 6.01 Init - 151742 151673 70 2 1 83 97 152 0.999 14.81 6.00 Prom - 155651 155612 40 -7.36 7.12 PlyA - 155743 155738 6 -0.45 7.11 Term - 158332 158082 251 2 2 108 47 276 0.896 21.37 7.10 Intr - 160647 160162 486 1 0 103 89 745 0.989 69.09 7.09 Intr - 165822 165415 408 1 0 118 94 445 0.996 42.34 7.08 Intr - 169874 169464 411 0 0 109 110 588 0.999 57.26 7.07 Intr - 176092 175955 138 2 0 38 110 91 0.130 6.74 7.06 Intr - 181115 180711 405 0 0 107 109 561 0.252 54.42 7.05 Intr - 184795 184394 402 2 0 98 97 630 0.995 59.30 7.04 Intr - 189101 188712 390 0 0 81 89 593 0.772 53.40 7.03 Intr - 194945 194535 411 0 0 94 65 538 0.743 46.26 7.02 Intr - 197227 196850 378 2 0 102 82 392 0.463 34.94 7.01 Init - 247096 247054 43 1 1 107 106 103 0.855 12.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 21215 21072 144 2 0 34 54 169 0.835 2.01 S.002 Term - 151342 151143 200 2 2 68 48 60 0.894 -2.44 S.003 Intr + 234564 234825 262 2 1 96 86 78 0.923 5.46 S.004 Term + 235699 235814 116 2 2 100 49 47 0.940 0.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:116419231_116670972|GENSCAN_predicted_peptide_1|42_aa MEPEDATHVDETHLANAQPENTNSTHEKIKPVSKLKAAPLTA >gi568815597r:116419231_116670972|GENSCAN_predicted_CDS_1|129_bp atggagccagaggatgcaactcatgtggatgagacccatttggccaatgcacagccggag aataccaactcaacacatgagaagattaagccagtatccaagctcaaggctgcaccccta acagcctga >gi568815597r:116419231_116670972|GENSCAN_predicted_peptide_2|41_aa MTGTRKKLRVGIPLALPQLHPECRQLKYTSALTELAGDVQL >gi568815597r:116419231_116670972|GENSCAN_predicted_CDS_2|126_bp atgactgggacaagaaagaagctgagggttgggattccactggccctgccccagctgcac cctgaatgccgtcagctcaagtacaccagcgcgctgacggagctggcaggtgacgtgcag ctgtga >gi568815597r:116419231_116670972|GENSCAN_predicted_peptide_3|55_aa MDNINAVIKGLIPIISTLPLSSQPLSIVTVITIIISTTIITVITITTIITTILPS >gi568815597r:116419231_116670972|GENSCAN_predicted_CDS_3|168_bp atggataacatcaatgcagtaattaaaggtcttatccccatcatctcaacactacctcta tcatcacaaccactgtcaattgtcaccgtcatcaccattatcatcagcactactatcatt actgtcatcaccatcaccaccatcattaccaccatcttaccatcatga >gi568815597r:116419231_116670972|GENSCAN_predicted_peptide_4|177_aa MALASGLCSWSLAAAQHLTVAPRNLLLPYLMGLNRRYIPRFTIATFRERLHQIQSYKQRQ QDHADSVGPGPGSTSQASEHVNLRTEQVLTSLSCFPLLQYGFDILIITATAKIPLCFINP EENTMSADPGTHSLDSKFHEDRVHVSFISEVPGPSKGPGTMQMLKKYCWMLNEWMNE >gi568815597r:116419231_116670972|GENSCAN_predicted_CDS_4|534_bp atggcactggcttctgggctatgcagctggagcctagctgcagcccagcacctgaccgtg gcaccaaggaacctgctgctaccctacttgatggggctgaacaggagatatattccacgc ttcacaattgccacctttcgggagcggcttcaccagattcagtcatataagcagaggcag caggaccatgctgattctgtgggaccaggcccgggctctaccagccaggcctctgaacat gtcaacttgagaactgaacaggttctcacatcgctttcctgtttccctctcctgcaatat ggatttgacatattaatcattacagcaactgctaaaattccactgtgcttcattaatccg gaggaaaataccatgtcagctgaccctggcactcactcattagattccaagttccatgag gacagggtccatgtctccttcatcagtgaggtcccagggcctagcaaagggcctggcacg atgcagatgctcaagaaatattgctggatgttgaatgaatggatgaatgaatga >gi568815597r:116419231_116670972|GENSCAN_predicted_peptide_5|419_aa MGKKVLGIASQLLGAVRCEPDGRRVLPEDWPECLSLVTHGTWFTRQGDKNIQMADNSFSD GVPSDFVEAAKNASKTEKLTDQVMQNPGVLAALQERLDNAPHTPSSYIETLPKAVKRRIN ALKQIQVRCAHIEAKFYEEVHDLERKYAALYQPLFDERREFITGDVEPTSDMESEWHREN EEEEKLAGDRKNKVVITEKAAATAEEPNPKGIPEFWFTIFRNVDMLSELVQEYDEPILKY LQDIKFHFQPNDCFANSVLTKTCKMKSEPDKADPFSFAGPEIMDCDGCTIDWKKGKNVTV KTIKKKQKRKGPGTVRTITKQVSNESFFNFFNPLKASGDGESLDKDSEFPLILKLDTFSV SGPQAVLYFAGEALKDDDNFDEGEEEEDELEGDEEGEDEDEVEINPNKEPSQPAECKQQ >gi568815597r:116419231_116670972|GENSCAN_predicted_CDS_5|1260_bp atggggaagaaagtccttggcattgcatctcagctgttgggagctgtgcggtgcgagcca gatgggcggcgtgtgctccccgaggactggcctgagtgccttagcctagttactcacggg acttggtttaccagacaaggggataaaaacattcagatggcagataacagtttttcagat ggggttccttcagatttcgtggaagctgctaaaaatgcaagtaagacggaaaagctcacg gatcaggtgatgcagaatcctggagttttggcagctttacaggagcgacttgacaatgcc cctcacaccccttccagctacatcgaaactttacctaaagcagtaaaaagaagaattaat gcattgaaacaaattcaggtgagatgcgctcatatagaagccaagttctatgaagaagtt catgacttggaaagaaaatatgcagcattataccagcctctctttgacgagagaagagaa tttatcaccggtgatgttgaaccaacatctgatatggaatcagaatggcaccgtgaaaat gaagaggaggagaaattggctggagacaggaaaaataaagtagtcataacagaaaaagca gcagcaacagctgaagagccaaatcccaaaggaattccagagttctggtttaccatcttt agaaatgtagatatgctaagtgaattagtccaggaatatgatgaaccaatcttgaaatac ctgcaggatattaaattccactttcaacccaatgactgctttgccaactcagtcctgaca aaaacctgcaagatgaaatcagaaccagataaggctgatcccttttcatttgcaggtcct gagatcatggactgcgatgggtgtactattgactggaagaaaggaaagaatgttactgtc aaaaccatcaagaaaaagcagaagcgtaagggtccaggcactgttagaacaattaccaaa caagtatccaatgagtcatttttcaacttcttcaatccactgaaagcatctggggacgga gaatcactggataaagattctgaattccctctgattttgaaattggacactttttccgtg agcggaccacaggctgtgctgtacttcgctggggaggccttaaaggatgatgacaatttt gacgaaggtgaagaagaagaggacgaattagaaggtgatgaggagggagaagatgaggat gaggtggaaattaatcccaacaaggaacccagccagccggcggaatgcaagcagcagtag >gi568815597r:116419231_116670972|GENSCAN_predicted_peptide_6|58_aa MVAGSDAGRALGVLSVVCLLHCFEIATAIAAFSNQHPDQSAAINIEARPFTSKNIRTC >gi568815597r:116419231_116670972|GENSCAN_predicted_CDS_6|177_bp atggttgctgggagcgacgcggggcgggccctgggggtcctcagcgtggtctgcctgctg cactgctttgaaattgccacagccattgcagcctttagcaaccaacaccctgatcagtca gcagccatcaacattgaggcaagacctttcaccagcaaaaatattaggacttgctga >gi568815597r:116419231_116670972|GENSCAN_predicted_peptide_7|1240_aa MKCFFPVLSCLAVLGVVSAQRQVTVQEGPLYRTEGSHITIWCNVSGYQGPSEQNFQWSIY LPSSPEREVQIVSTMDSSFPYAIYTQRVRGGKIFIERVQGNSTLLHITDLQARDAGEYEC HTPSTDKQYFGSYSAKMNLVVIPDSLQTTAMPQTLHRVEQDPLELTCEVASETIQHSHLS VAWLRQKVGEKPVEVISLSRDFMLHSSSEYAQRQSLGEVRLDKLGRTTFRLTIFHLQPSD QGEFYCEAAEWIQDPDGSWYAMTRKRSEGAVVNVQPTDKEFTVRLETEKRLHTVGEPVEF RCILEAQNVPDRYFAVSWAFNSSLIATMGPNAVPVLNSEFAHREARGQLKVAKESDSVFV LKIYHLRQEDSGKYNCRVTEREKTVTGEFIDKESKRPKNIPIIVLPLKSSISVEVASNAS VILEGEDLRFSCSVRTAGRPQGRFSVIWQLVDRQNRRSNIMWLDRDGTVQPGSSYWERSS FGGVQMEQVQPNSFSLGIFNSRKEDEGQYECHVTEWVRAVDGEWQIVGERRASTPISITA LEMGFAVTAISRTPGVTYSDSFDLQCIIKPHYPAWVPVSVTWRFQPVGTVEFHDLVTFTR DGGVQWGDRSSSFRTRTAIEKAESSNNVRLSISRASDTEAGKYQCVAELWRKNYNNTWTR LAERTSNLLEIRVLQPDRMGVSARALRGELSPSQIPQACCHGNTGALVIGIKEPGSLPYL QTVTKLQVSKSKRTLTLVENKPIQLNCSVKSQTSQNSHFAVLWYVHKPSDADGKLILKTT HNSAFEYGTYAEEEGLRARLQFERHVSGGLFSLTVQRAEVSDSGSYYCHVEEWLLSPNYA WYKLAEEVSGRTEVTVKQPDSRLRLSQAQGNLSVLETRQVQLECVVLNRTSITSQLMVEW FVWKPNHPERETVARLSRDATFHYGEQAAKNNLKGRLHLESPSPGVYRLFIQNVAVQDSG TYSCHVEEWLPSPSGMWYKRAEDTAGQTALTVMRPDASLQVDTVVPNATVSEKAAFQLDC SIVSRSSQDSRFAVAWYSLRTKAGGKRSSPGLEEQEEEREEEEEEDDDDDDDPTERTALL SVGPDAVFGPEGSPWEGRLRFQRLSPVLYRLTVLQASPQDTGNYSCHVEEWLPSPQKEWY RLTEEESAPIGIRVLDTSPTLQSIICSNDALFYFVFFYPFPIFGILIITILLVRFKSRNS SKNSDGKNGVPLLWIKEPHLNYSPTCLEPPVLSIHPGAID >gi568815597r:116419231_116670972|GENSCAN_predicted_CDS_7|3723_bp atgaagtgctttttcccggtgctgagctgtctggctgtgctgggtgtggtgtcagcacag cggcaggtcaccgttcaggaaggacccttgtaccgcacggagggctcccacatcactatc tggtgcaatgtgagtggctaccagggaccttctgagcagaatttccagtggtccatttac ctgccttcgtcgccagagcgagaggtgcagatcgtcagcaccatggactcttccttcccc tatgccatctacacccagcgcgtccgcggagggaagatcttcatagaaagagtccagggg aactcaaccctattgcacatcacagatcttcaggcccgggatgccggggagtatgaatgc cacacacccagcactgataagcaatactttgggagttacagtgcaaagatgaacctagtg gtgatcccagactccctgcagaccactgccatgccccagactctgcacagagtggagcag gacccgctggagctcacttgtgaggtggcctcagagaccattcagcacagccacctgtct gtggcctggctccggcagaaagttggcgagaagcccgtggaggtcatctccctgagccga gatttcatgcttcactccagcagcgaatatgcccagaggcagagcctgggggaggtgcgg ctggacaagctggggaggaccaccttccgcctcaccatcttccacctgcagccttctgac cagggcgaattctactgcgaggccgccgagtggatccaggatccggatgggtcgtggtat gctatgacccgaaagcgttccgagggagccgtggtcaacgtccagccaactgacaaagaa ttcactgttcggctggagacagagaagcggctgcacacggtgggcgagccggtggagttc agatgcatcctggaggctcagaatgttcccgaccgttactttgctgtctcctgggccttc aacagctcgctcatcgccaccatgggtcctaacgctgtgcctgtcctcaacagcgaattt gctcaccgggaagccaggggacagcttaaggtggccaaagagagcgacagtgtctttgtg ctgaagatctaccacctccgccaggaagatagcgggaaatacaactgccgggtgactgag cgagagaaaaccgtgaccggggaattcattgataaggagagcaagcgtcccaagaacatc cccatcatagtcctccccctcaagagcagcatctccgtggaggtggccagcaatgccagc gtcatccttgagggcgaggacctgcgcttctcctgcagtgtccgcacggcaggcaggccg cagggtcgcttctctgtcatctggcagcttgtggacaggcagaaccgccgcagcaatatc atgtggctagaccgggatggcaccgtgcagccaggctcgtcctactgggagcgcagcagc tttgggggcgtccagatggagcaggtgcagcccaactcgttcagcctgggcatcttcaac agcaggaaggaggacgagggccagtatgaatgccatgtgactgaatgggtgcgggcagtg gatggcgagtggcagattgttggggagcgccgggccagcactcccatctccatcacagct cttgaaatgggcttcgcagtcacagccatctcccggacaccgggggtgacctacagcgac tcctttgacttgcagtgtatcatcaaaccccactaccctgcctgggtccccgtgtcggtg acatggcggttccagccggtgggcacggtggagttccatgacttggtgaccttcacccgg gacggaggggtccagtggggggacaggtcctccagcttccgaacccgaactgccatcgag aaggctgagtccagcaacaacgtccgcctaagcatcagccgagccagtgacacggaagca ggcaagtaccagtgtgtggcagagctgtggcggaagaactacaacaacacctggacgcga ctggcggagaggacctccaacctgctggagatcagggtgctgcagccagatagaatgggt gtcagtgcaagagccctaagaggagagctcagcccctcccagatccctcaggcctgttgt catggtaacaccggggcacttgttattggcatcaaggagccagggtcactgccctacctg cagacagtgacaaagctgcaggtgagcaaatcgaagaggaccctcaccctggtggaaaac aagcccattcagttgaactgctcagtcaagtctcagactagccagaactcccactttgcg gtgctctggtatgtccacaagccctcggatgccgatggcaagcttatcctgaagaccacc cacaactccgcctttgaatacggtacttacgccgaggaggagggcctgagagccaggctc cagtttgagaggcatgtgtcggggggcctgttcagcctcaccgtccagagagccgaggtc agcgacagcggcagctactactgccacgtggaggagtggctgctgagccccaactacgcc tggtacaagctggcagaggaggtttctgggcgcacagaagtcactgtgaaacagccagac agccgcctgaggctcagccaagcccaggggaacctgtcggttctggagacccggcaggta cagctggagtgtgtggttctcaaccgcaccagcataacctcccagctcatggtggaatgg tttgtatggaagcccaaccaccctgagcgggagactgtggcccgcttgagccgtgacgcc accttccactatggagagcaggcagccaagaacaatctgaaggggcggctgcatttggag agtccttcccccggcgtgtaccgtctcttcatccagaacgtggctgtgcaggacagcggg acctacagctgccatgtggaggagtggctgcccagccccagtggcatgtggtataagcgg gcagaggacaccgctgggcagacagctctgacagtcatgcgaccagatgcttccctgcag gtggacacagtggtccccaatgccacggtctctgagaaggcagctttccagctggactgt agcatcgtgtcccgctccagccaggactcccgcttcgctgtggcctggtattccctgagg actaaagctggggggaaaaggagcagccctggcctggaagaacaggaagaggaaagggag gaggaggaggaggaggacgacgacgacgacgacgacccaacagagcggacggccctgctg agcgtgggcccagatgctgtctttggcccagagggcagtccttgggagggcaggcttcgc ttccagaggctctccccggtgctctaccggctcacagtgctgcaggcaagcccccaagat acaggcaattactcctgccatgtggaggagtggctgcccagccctcagaaggaatggtac cggctgacggaggaggagtcagcccccatcggcatccgtgttctagatacaagtcccacc ctccagtccatcatctgctccaacgacgcactcttctacttcgtcttcttctaccctttc cccatctttggcattcttatcatcaccatccttctggtgcgtttcaagagccggaactcc agcaagaactctgatgggaagaatggggtgcctctgctgtggatcaaagagccacacctc aactactcccctacttgcctggagccccctgttctcagtatccatccaggggccatagac taa