GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:52:26 Sequence gi568815590r:10625741_10830433 : 204693 bp : 47.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3447 3506 60 2 0 40 55 78 0.258 0.95 1.02 Intr + 5829 6111 283 2 1 -4 47 231 0.684 6.79 1.03 Intr + 7560 7653 94 1 1 98 38 95 0.769 4.52 1.04 Intr + 8555 8667 113 2 2 71 37 93 0.686 2.52 1.05 Intr + 18059 18202 144 0 0 83 74 112 0.391 9.55 1.06 Term + 33185 33390 206 0 2 65 55 122 0.117 4.13 1.07 PlyA + 35621 35626 6 1.05 2.00 Prom + 39463 39502 40 -5.66 2.01 Init + 46926 46973 48 2 0 97 72 3 0.261 0.55 2.02 Intr + 48906 49098 193 2 1 88 86 305 0.946 29.37 2.03 Intr + 51932 51994 63 0 0 98 85 37 0.409 3.09 2.04 Intr + 53104 53235 132 2 0 56 37 111 0.414 3.42 2.05 Intr + 63050 63235 186 0 0 119 76 41 0.267 5.66 2.06 Intr + 65839 66045 207 2 0 46 19 142 0.083 2.15 2.07 Intr + 71859 72265 407 1 2 99 94 629 0.403 58.67 2.08 Term + 74495 74731 237 1 0 129 43 339 0.978 29.67 2.09 PlyA + 77737 77742 6 1.05 3.07 PlyA - 78543 78538 6 1.05 3.06 Term - 82688 82668 21 2 0 103 55 14 0.495 -2.09 3.05 Intr - 83614 83096 519 1 0 27 90 287 0.734 15.95 3.04 Intr - 83932 83807 126 1 0 11 60 231 0.999 13.48 3.03 Intr - 84184 84005 180 1 0 20 62 336 0.991 24.26 3.02 Intr - 84326 84203 124 1 1 75 49 202 0.816 15.59 3.01 Init - 97714 97692 23 1 2 66 106 18 0.156 0.64 3.00 Prom - 98449 98410 40 -6.46 4.04 PlyA - 99609 99604 6 1.05 4.03 Term - 100926 99998 929 1 2 46 43 1240 0.971 107.62 4.02 Intr - 104919 104456 464 2 2 145 92 550 0.788 53.90 4.01 Init - 107321 107245 77 2 2 68 33 139 0.464 4.96 4.00 Prom - 113883 113844 40 -3.66 5.00 Prom + 116876 116915 40 -2.16 5.01 Init + 130836 130873 38 2 2 84 93 -1 0.364 -0.50 5.02 Term + 135310 135460 151 1 1 98 43 187 0.991 12.58 5.03 PlyA + 135472 135477 6 1.05 6.12 PlyA - 137861 137856 6 1.05 6.11 Term - 140176 139661 516 1 0 105 41 611 0.999 52.42 6.10 Intr - 146888 146854 35 0 2 69 70 20 0.009 -3.76 6.09 Intr - 166816 166628 189 2 0 105 94 71 0.174 8.96 6.08 Intr - 179510 179451 60 0 0 69 71 49 0.042 0.01 6.07 Intr - 183891 183870 22 0 1 104 110 -3 0.331 0.62 6.06 Intr - 186952 186806 147 1 0 40 87 131 0.588 8.63 6.05 Intr - 193400 193245 156 2 0 77 41 128 0.420 7.21 6.04 Intr - 194529 194453 77 1 2 60 60 -6 0.174 -6.97 6.03 Intr - 200504 200412 93 0 0 71 116 93 0.940 10.34 6.02 Intr - 201709 201611 99 2 0 49 66 68 0.431 0.78 6.01 Init - 202804 202609 196 1 1 48 66 126 0.321 3.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:10625741_10830433|GENSCAN_predicted_peptide_1|299_aa MEETYKQPRNKQIQQLLVLERDGREIPLNKGDIRGLAKFHGLSLSPLLHFRDDHSKAGAG SQTASAGFSLPVPPEPVDKGWLEAEGKQPTPEQTKTKGDGSWGALHGGEDTARQVPPLTK SNPKSEGKKPDKATQKGQTLSSKDKGTREGAAPMLVGRAARGCNTDPRMSPTLQDNRMVI ISICGGAGQSIQFQEHVDSAMEIQGMEATRVLPETDLQDQGNKIGCGLYQAAVLKGILPP QAFLGMGWFSTSCRCRAIILTRGHSPSFLQRACPLSASTTIGSSLLLSSLDLMDSLPRS >gi568815590r:10625741_10830433|GENSCAN_predicted_CDS_1|900_bp atggaggaaacatataaacagccaaggaacaagcaaatccagcagctgctggtgctggag agagatggccgggaaatcccactgaacaaaggagacatccgaggcttggcaaagttccac ggtttgtccctgtcccccctcctccacttccgagacgaccacagcaaggctggagctggc tctcagaccgcaagcgctgggttctctctgccagtgccgcccgagcctgtggacaagggc tggttggaggccgaagggaagcagcccacccctgaacagacgaagaccaagggcgatggg tcctggggggctctccacggcggggaagacactgccaggcaagtgcccccattgaccaaa tccaacccgaaatcagagggcaagaagcctgacaaggcaacccagaaaggccagaccctt agttcaaaggacaaagggacccgagaaggagcagcaccgatgcttgtgggcagagctgca agaggctgcaacacggaccctaggatgagccccaccctccaggacaaccgcatggtcatc atcagcatctgtggtggtgcggggcagagcatccagttccaggaacatgtcgatagtgcc atggaaattcagggaatggaagcaaccagagtgttaccagaaactgatctgcaagaccag ggcaacaaaattggctgtggtctctatcaagcagctgttctcaagggaattctgccaccg caggccttcctagggatgggctggttttccacaagctgccgctgccgggccatcattctt acacgtgggcacagcccctccttccttcaaagggcatgcccactgtcagcttccaccacc attggcagctccttgctgctgtcgtccttggacctcatggactcgctccctcgttcatga >gi568815590r:10625741_10830433|GENSCAN_predicted_peptide_2|490_aa MALLTPQGVKEVFQLQRPQGRERLRRLLNWEEFDEQRDSRRSILLDTLYESIIFAVGKGF PWVEVAQVVKFTEELLRETKGSLVFASIILNIRHLDSNTAIGWCLLSLSQEAGPGGVLVS TCVLTALHRDTRESENKPCSPPDAEVYQVALERGHPEGTTCAAVGGGKINRPRTPGQAGA TEEVSSKGGKLSGLFLTILPVPRPWTAGPESSLLIKNDGLMPAHPLGSLNGQPLLSSMRP PTSLWDGPNLSRLASVPLHQVLNPKEFDNPMYAILGGCSITEAVTILGNKLRDYRGHFNT THLLALCDYFHHTFIRHYKLYQYVLGQDQQVDLTVAHLEVCMPPHPLPLAEGMDRDLWIH EQQVATLTEAEAQKRADVLLLKEALRLERENSLQKAFAAAAPAQPGQVLERQELESLICQ AVHTQMELLQELLQRQIQNTFAILDLKLQKKTLNLNAPTPIPPPITSHAGQEEALKPQRA SKGKKAKARK >gi568815590r:10625741_10830433|GENSCAN_predicted_CDS_2|1473_bp atggcactcttaacaccccagggagtgaaagaagtcttccaacttcagagaccacaaggt cgggagcgcctgcggaggcttctgaactgggaggagtttgacgaacagagagactcccgg aggagcatcctgctggacaccctctacgagagcatcatctttgcagtgggcaaaggcttc ccatgggtggaggtggcccaggtggtcaagttcacagaagagctgctaagggaaaccaaa ggtagccttgtgtttgccagtattattcttaacatacggcatctggattcaaacacagca atcggatggtgcctgctgtccctgagccaggaggctggcccaggtggcgtcctagtcagc acctgtgttcttactgccttgcacagggacactcgtgagagcgagaacaagccctgcagt cccccagatgccgaggtttaccaggtggctttggaaaggggtcaccctgaagggaccact tgtgctgcagtaggaggtgggaagatcaacagaccacggaccccgggacaggcaggagcc acagaagaggtctcaagcaaaggtggcaagctctcagggctcttcctgacaatcctgcct gtccccaggccgtggacagcaggcccagagtccagcttgctgatcaaaaatgatggcctc atgcctgctcaccctctgggatccctcaatgggcagcctctgctgtcctccatgcggcca ccgacttccctctgggatggccccaacctctcacgtctggcctcagtacctctccaccag gttcttaaccccaaggaatttgataatcctatgtacgctatccttggaggctgctccatt actgaggctgtgacgatcctggggaacaagcttagagattaccggggccatttcaacacc acccacctgctggccctctgtgactacttccaccacaccttcatccgccactacaaactc taccagtatgtcctgggccaggaccagcaggtcgacctgaccgttgcccacctggaggtg tgcatgccaccccatcccctcccgctggccgagggcatggacagggacttgtggatccac gagcagcaggtggccacactgacggaggccgaggcacagaagcgcgccgacgtgctgctc ctgaaagaggcgctgcgcctggagcgggagaactcgctgcagaaggcgttcgctgccgcc gcgcctgcgcagcccggccaggtcctggagagacaggagttggagagcctcatctgccag gcagtccacacccagatggagctcctgcaggagctgctgcagcgccagatccagaacaca ttcgccatcttggacctgaagcttcagaagaagactctgaacctcaacgcccccacccct atcccgccccccatcaccagccacgcaggccaggaggaagccctgaagccccaaagagcg agcaaaggaaagaaagcgaaggcaaggaagtag >gi568815590r:10625741_10830433|GENSCAN_predicted_peptide_3|330_aa MRLSEEGKASTNAHEMWKLPTINVTANVTNWVTVNGWTIENDWISVNDQISVINRVSVIE WNCVIDQVCIIDWISVNDKVSVIDWVCMINWKSVINWIHVINWICVIDQINVIDWISVND WINVIDWIRVNDWISAIDQISVINCISKIDRISVIDWISVIDQVSVIDWVSVIDWISVIG QVSVVDWPMTGTKAGQPLACMKWAVPPSGIFGPESELLFHVDSLTLFIGSLLCPVSEKFS KTAYNSLECGILGDEDSRGTQEVEFPMGKAAVFTCLKLALRCDMGDLGGNPDPVLQPALP ITKTVTLDKSSSSRFPFPQPEVAENPTAML >gi568815590r:10625741_10830433|GENSCAN_predicted_CDS_3|993_bp atgaggctgtctgaagaaggaaaggcttccacaaatgctcatgaaatgtggaagctgccc actatcaatgtgacagccaatgtgaccaactgggtcactgtgaatggctggaccattgaa aatgactggatcagtgtgaatgaccagatcagtgtgatcaaccgggtcagtgtgatcgaa tggaactgtgtgattgaccaggtctgcataatcgactggattagtgtgaatgacaaagtc agtgtgatcgactgggtctgcatgatcaactggaagagcgtgatcaactggatccatgtg atcaactggatctgtgtgattgaccagatcaatgtgattgactggatcagtgtgaatgac tggatcaatgtgattgactggatcagggtgaatgactggatcagtgcaattgaccagatc agtgtgatcaactgcatcagtaagatcgaccggatcagtgtgatcgactggattagtgtg atcgaccaggtcagtgtgattgactgggtcagtgtgattgactggatcagtgtgattggc caggtcagtgtggtcgactggcccatgactggcaccaaagcaggccagcctttggcctgc atgaagtgggcagtgccacccagcggcatttttggacccgagtctgagctccttttccac gttgacagcctcaccctgtttatagggtcccttctgtgcccagtttctgaaaagttttca aaaactgcttacaacagtttggagtgtggcattttgggggatgaagattcacgaggaacc caggaggttgagtttcctatgggaaaggccgcagtcttcacatgccttaagcttgcattg aggtgtgacatgggggacctgggtgggaatccagaccctgtgttgcaacctgcactgcca ataaccaagacagtgactttggacaaatcctcttcttcccgttttccttttcctcagcca gaggtggctgagaacccaactgccatgttgtga >gi568815590r:10625741_10830433|GENSCAN_predicted_peptide_4|489_aa MERASPATAVLTISKPRAAPASALGRSERGPGGGPGRREGTRADTEGGAAPPWRVIGGRG RPKLINQGPGRGCGPSWTPRPVRGPGPRLPRQAKRGDPRAAMASLLGAYPWPEGLECPAL DAELSDGQSPPAVPRPPGDKGSESRIRRPMNAFMVWAKDERKRLAVQNPDLHNAELSKML GKSWKALTLSQKRPYVDEAERLRLQHMQDYPNYKYRPRRKKQAKRLCKRVDPGFLLSSLS RDQNALPEKRSGSRGALGEKEDRGEYSPGTALPSLRGCYHEGPAGGGGGGTPSSVDTYPY GLPTPPEMSPLDVLEPEQTFFSSPCQEEHGHPRRIPHLPGHPYSPEYAPSPLHCSHPLGS LALGQSPGVSMMSPVPGCPPSPAYYSPATYHPLHSNLQAHLGQLSPPPEHPGFDALDQLS QVELLGDMDRNEFDQYLNTPGHPDSATGAMALSGHVPVSQVTPTGPTETSLISVLADATA TYYNSYSVS >gi568815590r:10625741_10830433|GENSCAN_predicted_CDS_4|1470_bp atggagcgagcgagcccggctaccgcggtgctgacaatttccaagccccgggcggcgccc gcgtcagcgctgggaaggtcggagcggggacccggcggcggcccgggacgcagggagggg acccgcgccgacaccgagggaggggccgcgccgccttggagagtcattggaggacgcggc cgcccgaagctgataaatcaggggccgggtcgcggctgcgggccaagttggacgccccga cccgtgcgagggccaggtccgcgcctgccccgccaggcgaagcgaggcgacccgcgtgcg gccatggcttcgctgctgggagcctacccttggcccgagggtctcgagtgcccggccctg gacgccgagctgtcggatggacaatcgccgccggccgtcccccggcccccgggggacaag ggctccgagagccgtatccggcggcccatgaacgccttcatggtttgggccaaggacgag aggaaacggctggcagtgcagaacccggacctgcacaacgccgagctcagcaagatgctg ggaaagtcgtggaaggcgctgacgctgtcccagaagaggccgtacgtggacgaggcggag cggctgcgcctgcagcacatgcaggactaccccaactacaagtaccggccgcgcaggaag aagcaggccaagcggctgtgcaagcgcgtggacccgggcttccttctgagctccctctcc cgggaccagaacgccctgccggagaagagaagcggcagccggggggcgctgggggagaag gaggacaggggtgagtactcccccggcactgccctgcccagcctccggggctgctaccac gaggggccggctggtggtggcggcggcggcaccccgagcagtgtggacacgtacccgtac gggctgcccacacctcctgaaatgtctcccctggacgtgctggagccggagcagaccttc ttctcctccccctgccaggaggagcatggccatccccgccgcatcccccacctgccaggg cacccgtactcaccggagtacgccccaagccctctccactgtagccaccccctgggctcc ctggcccttggccagtcccccggcgtctccatgatgtcccctgtacccggctgtccccca tctcctgcctattactccccggccacctaccacccactccactccaacctccaagcccac ctgggccagctttccccgcctcctgagcaccctggcttcgacgccctggatcaactgagc caggtggaactcctgggggacatggatcgcaatgaattcgaccagtatttgaacactcct ggccacccagactccgccacaggggccatggccctcagtgggcatgttccggtctcccag gtgacaccaacgggtcccacagagaccagcctcatctccgtcctggctgatgccacggcc acgtactacaacagctacagtgtgtcatag >gi568815590r:10625741_10830433|GENSCAN_predicted_peptide_5|62_aa MEVTPFCHPFMNCTTGSETQQLQNQRDKQGGDEEVLTPINVQLLPADPRAEITRTDCAES LL >gi568815590r:10625741_10830433|GENSCAN_predicted_CDS_5|189_bp atggaggttacacccttctgccacccctttatgaactgcacaacaggctccgaaacgcag cagctgcagaaccagcgagataaacagggcggtgacgaagaggttcttactccaatcaac gtacagctgctgcctgctgatccaagggctgagatcactcggacagactgtgccgaaagc cttctctga >gi568815590r:10625741_10830433|GENSCAN_predicted_peptide_6|529_aa MWPLRHVCLGFGPIAGASWVCLISVVLCSEFRHHSSSLMEWLWEGNPRIHGKPPVQRLVP HTLAVDQPALINLSGKDLLSGSHFPFRQRRYHSDVFLNDSSDKKEKKSFSLEEKSKISKN RVHYMKFTKGKDLSSRSKTDLDCIFGKRQSKKTPEQMEAAGWERIPGFGACADYAESRDR QWSLGRSRSDIRPCRLLLHKGLLERAMEYVQGIAFRDSGVWKACYAQKQMEEMGVVIRVT VWTKEELALTDEIHRKCPHSIGTASRLSVLDGSSKHALQEVAAEPLLSDGGCQALLTMDC EEEAQSVPSGSAQRHGGGTVNDFDAVWLTVELEARCARGPRGISLSGKFPEAAHCRDLGD ASPSTPEENETTTTSAFTIQEYFAKRMAALKNKPQVPVPGSDISETQVERKRGKKRNKEA TGKDVESYLQPKAKRHTEGKPERAEAQERVAKKKSAPAEEQLRGPCWDQSSKASAQDAGD HVQPPEGRDFTLKPKKRRGKKKLQKPVEIAEDATLEETLVKKKKKKDSK >gi568815590r:10625741_10830433|GENSCAN_predicted_CDS_6|1590_bp atgtggcctctgcgtcatgtctgcctgggcttcggtcccatcgcgggggcttcctgggtg tgccttatcagtgtcgtcctctgtagtgagttcaggcatcacagcagctctcttatggag tggctgtgggaaggaaatccaagaattcatggaaagcctccagtccagcgcctcgtccct catacacttgcagtcgatcaacctgctctgatcaacctttcaggcaaggacctactttct ggctcacattttcccttccgtcaaaggcgttaccactcggatgtcttcctcaatgattcc tcggacaagaaggaaaagaaatcttttagccttgaggaaaagtccaaaatctccaaaaac cgtgttcactatatgaaattcacaaaagggaaggatctgtcatctcggagcaaaacagat cttgactgcatttttgggaaaagacagagtaagaagactcccgagcagatggaagcagca ggctgggaaaggatacctggctttggtgcctgcgcggactatgctgagtcacgggatagg cagtggtcactagggaggagccgctctgacatccggccgtgccggctgctccttcacaag gggctgctggaaagagccatggagtacgttcagggaattgccttcagggacagtggcgtc tggaaagcatgttatgcacagaaacagatggaagaaatgggtgtggttatcagggtgaca gtttggacaaaggaagaactagcactgactgatgaaattcatcgaaagtgccctcacagc ataggaacagccagcaggctgtctgtgctggatggttcctccaagcacgccctgcaggag gtcgcagcagaaccactcctgagtgatggcgggtgccaggcgctcttaaccatggactgt gaagaagaggcacagtctgtcccttcagggagcgcacagcgtcacgggggagggacggtc aatgattttgatgcggtttggttaactgttgagctggaggcaaggtgtgcacgtggcccc cggggcatctccctctcaggcaaattccccgaggctgctcattgccgagacttgggcgat gccagtccctccactccagaggagaacgaaaccacgacaaccagcgccttcaccatccag gagtactttgccaagcggatggcagcactgaagaacaagccccaggttccagttccaggg tctgacatttctgagacgcaggtggaacgtaaaagggggaagaaaagaaataaagaggcc acaggtaaagatgtggaaagttacctccagcctaaggccaagaggcacacggagggaaag cccgagagggccgaggcccaggagcgagtggccaagaagaagagcgcgccagcagaagag cagctcagaggcccctgctgggaccagagttccaaggcctctgctcaggatgcaggggac catgtgcagccgcctgagggccgggacttcaccctgaagcccaaaaagaggagagggaag aaaaagctgcaaaaaccagtagagatagcagaggacgctacactagaagaaacgctagtg aaaaagaagaagaagaaagattccaaatga