GENSCAN 1.0 Date run: 7-Nov-116 Time: 04:09:20 Sequence gi568815586r:10601291_10823069 : 221779 bp : 37.79% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1125 1120 6 1.05 1.05 Term - 3363 3252 112 2 1 55 42 73 0.151 -3.55 1.04 Intr - 6646 6564 83 1 2 77 106 56 0.460 3.82 1.03 Intr - 8651 8541 111 2 0 60 72 134 0.926 8.66 1.02 Intr - 9390 9332 59 1 2 117 107 29 0.996 5.38 1.01 Init - 12242 12149 94 2 1 82 93 54 0.975 5.89 1.00 Prom - 13040 13001 40 -7.25 2.10 PlyA - 15373 15368 6 1.05 2.09 Term - 19058 18854 205 1 1 106 49 274 0.970 21.16 2.08 Intr - 23569 23361 209 1 2 85 9 80 0.187 -3.25 2.07 Intr - 26434 26351 84 1 0 107 99 82 0.977 10.30 2.06 Intr - 28384 28203 182 2 2 128 88 131 0.991 15.77 2.05 Intr - 30018 29755 264 1 0 90 93 214 0.999 18.66 2.04 Intr - 32834 32700 135 0 0 58 98 97 0.983 7.32 2.03 Intr - 33396 33277 120 1 0 40 103 77 0.159 3.95 2.02 Intr - 40767 40695 73 0 1 94 61 5 0.028 -3.54 2.01 Init - 41036 40959 78 2 0 84 110 14 0.412 4.31 2.00 Prom - 49970 49931 40 -4.05 3.03 PlyA - 49992 49987 6 1.05 3.02 Term - 60261 60058 204 0 0 49 32 159 0.212 2.79 3.01 Init - 72957 72664 294 0 0 42 39 228 0.569 10.53 3.00 Prom - 74524 74485 40 -5.05 4.03 PlyA - 75873 75868 6 1.05 4.02 Term - 78809 78729 81 2 0 90 34 116 0.375 3.11 4.01 Init - 94954 94769 186 1 0 65 60 127 0.129 6.71 4.00 Prom - 98832 98793 40 -3.95 5.17 PlyA - 99593 99588 6 1.05 5.16 Term - 100063 99998 66 1 0 104 41 37 0.650 -2.44 5.15 Intr - 100844 100670 175 1 1 94 84 186 0.927 17.82 5.14 Intr - 102858 102761 98 0 2 109 77 166 0.578 15.49 5.13 Intr - 104958 104887 72 0 0 56 102 41 0.174 0.98 5.12 Intr - 108824 108618 207 2 0 59 98 300 0.924 26.45 5.11 Intr - 112043 111921 123 2 0 103 63 295 0.999 28.36 5.10 Intr - 114493 114404 90 1 0 99 100 135 0.998 15.07 5.09 Intr - 116418 116353 66 0 0 61 55 90 0.691 1.18 5.08 Intr - 121271 120777 495 2 0 56 43 406 0.669 25.16 5.07 Intr - 121413 121365 49 2 1 49 99 42 0.390 -0.84 5.06 Intr - 121661 121560 102 1 0 60 109 94 0.104 7.07 5.05 Intr - 129869 129728 142 0 1 100 91 34 0.018 3.39 5.04 Intr - 131782 131681 102 2 0 112 58 64 0.122 5.03 5.03 Intr - 138980 138941 40 2 1 57 87 23 0.036 -4.02 5.02 Intr - 140034 139891 144 0 0 73 69 82 0.081 4.36 5.01 Init - 144037 143558 480 1 0 78 86 147 0.055 8.91 5.00 Prom - 145685 145646 40 -6.15 6.12 PlyA - 147231 147226 6 1.05 6.11 Term - 148873 148697 177 1 0 41 32 167 0.008 3.10 6.10 Intr - 154416 153891 526 2 1 43 39 280 0.000 10.62 6.09 Intr - 162988 162847 142 2 1 85 77 53 0.100 2.39 6.08 Intr - 163429 163141 289 1 1 21 31 215 0.020 5.00 6.07 Intr - 173117 173004 114 2 0 79 89 89 0.013 7.82 6.06 Intr - 190605 190075 531 0 0 51 39 218 0.010 5.40 6.05 Intr - 191159 190997 163 1 1 2 38 198 0.001 5.26 6.04 Intr - 196947 196833 115 1 1 33 31 101 0.002 -2.51 6.03 Intr - 208169 207987 183 0 0 108 -11 164 0.020 7.34 6.02 Intr - 214857 214704 154 0 1 74 24 115 0.583 2.42 6.01 Intr - 215656 215550 107 2 2 72 81 65 0.248 3.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 154382 153729 654 2 0 91 36 293 0.876 20.32 S.002 Term + 176028 176123 96 2 0 108 38 113 0.970 5.29 S.003 Sngl - 191131 190838 294 1 0 51 38 265 0.835 13.25 S.004 Term - 208169 207847 323 0 2 108 44 161 0.812 7.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:10601291_10823069|GENSCAN_predicted_peptide_1|152_aa MAVASDFYLRYYVGHKGKFGHEFLEFEFRPDGKLRYANNSNYKNDVMIRKEAYVHKSVME ELKRIIDDSEITKEDDALWPPPDRVGRQELEIVIGDEHISFTTSKIGSLIDVNQSKCQPQ KQEQSSFELPVGPQLLVVPQNRLWLLLNKMIH >gi568815586r:10601291_10823069|GENSCAN_predicted_CDS_1|459_bp atggctgtggctagcgatttctacctgcgctactacgtagggcacaagggcaagtttggg cacgagtttctggagttcgaatttcggccggacggaaagcttagatatgccaacaacagc aattacaaaaatgatgtcatgatcagaaaagaggcttatgtgcacaagagtgtaatggaa gaactgaagagaattattgatgacagtgaaattacaaaagaagatgatgctttgtggcct ccccctgatagggttggccgacaggagcttgaaattgtaattggagatgagcacatatct tttaccacatcaaaaataggttctcttattgatgtaaatcagtcaaaatgtcaacctcag aaacaggaacaatcgtcttttgaacttccagtaggcccacagttgttggttgttcctcaa aacaggttgtggctcctgttgaataagatgatccattaa >gi568815586r:10601291_10823069|GENSCAN_predicted_peptide_2|449_aa MNDRNEIQMEAKLQSLTIIAQEILCRFFITLRRHARFLLTKLGRQGMARSGITHSCAVCI LCGPSREGDSPVAMGMTRMLLECSLSDKLCVIQEKQYEVIIVPTLLVTIFLILLGVILWL FIREQRTQQQRSGPQGIAPVPPPRDLSWEAGHGGNVALPLKETSVENFLGATTPALAKLQ VPREQLSEVLEQICSGSCGPIFRANMNTGDPSKPKSVILKALKEPAGLHEVQDFLGRIQF HQYLGKHKNLVQLEGCCTEKLPLYMVLEDVAQGDLLSFLWTCRRDVMTMDGLLYDLTEKQ VYHIGKQVLLALEFLQEKHLFHGDVAARNILMQSDLTAKLCGLGLAYEVYTRGAISSTQT IPLKWLAPERLLLRPASIRADVYSIMKSCWRWREADRPSPRELRLRLEAAIKTADDEAVL QVPELVVPELYAAVAGIRVESLFYNYSML >gi568815586r:10601291_10823069|GENSCAN_predicted_CDS_2|1350_bp atgaatgataggaatgagattcaaatggaagccaaactccaaagtcttaccattatagca caggaaattctatgcaggttctttattacccttaggagacatgcacgtttcctgctcact aaactaggaaggcaaggaatggcaaggtcaggaattactcacagctgtgctgtgtgcatt ctctgtgggcctagcagggaaggggacagccctgtggcaatgggcatgacacggatgctc ctggaatgcagtctcagtgacaagttgtgtgtcatccaggagaagcagtatgaagtgatt atcgtcccaactttgttggttactatcttcctcatccttcttggggtcatcctgtggctt tttatcagagaacaaagaactcaacagcagcgttctggacctcaaggcattgcccctgtt cctccacctagggacctaagctgggaagcaggacatggaggaaatgtggctttgccactt aaggagacatccgtggaaaactttctgggagctaccacacctgccctggctaagctgcag gtgccgcgggagcaactctctgaagttctggagcagatttgcagtggtagctgtgggccc atctttcgagccaatatgaacactggggacccttctaagcccaagagtgttattctcaag gctttaaaagaaccagctgggctccatgaggtacaagatttcttagggcgaatccaattc catcaatacctggggaaacacaaaaacctggtgcagctggaaggctgctgcactgaaaag ctgccactctatatggtgttggaggatgtggcccagggggacctgctcagctttctctgg acctgtcggcgggatgtgatgactatggatggtcttctctatgatctcacagaaaaacaa gtatatcacatcggaaagcaggtccttttggcgctggaattcctgcaggagaagcatttg ttccatggggatgtggcagccaggaatattctgatgcaaagtgatctcactgctaagctc tgtggattaggcctggcttatgaagtttacacccgaggggccatctcctctactcaaacc atacctctcaagtggcttgccccagaacggcttctcctgagacctgctagcatcagagca gatgtgtacagtatcatgaagtcctgctggcgctggcgtgaggctgaccgcccctcacct agagagctgcgcttgcgcctagaagctgccattaaaactgcagatgacgaggctgtgtta caagtaccagagttggtggtacctgaactgtatgcagctgtggccggcatcagagtggag agcctcttctacaactatagcatgctttga >gi568815586r:10601291_10823069|GENSCAN_predicted_peptide_3|165_aa MSIDQQQVQPLQLQQRFPVARVWRVPETSEPADFPSREPARTPSFAGSHAPQWEGGTEKH FLEAGTPCPTQPCHYFPSDWAGAIGNRSDQRRDTVVNGVIVGIKTSIKEVCVKDKYQLRN SRAGWIPKQIASLGLLAGLCSMQMCEESGFCARGGRDDTDQEYNI >gi568815586r:10601291_10823069|GENSCAN_predicted_CDS_3|498_bp atgagtatagatcagcaacaagtccagcctttacagttgcagcagaggtttcctgtggct cgagtgtggcgagtcccggaaacctcggagcccgcagacttcccttcgcgggagcccgcc cgaactccatcctttgccggcagccacgccccgcagtgggaaggagggactgaaaagcat ttccttgaggctggcacaccttgccctacccaaccctgtcattatttcccctccgactgg gccggtgccatcggaaaccggagtgaccagaggagggacacggtagtgaacggggtaata gtgggaatcaaaaccagtatcaaggaagtttgtgttaaagacaaatatcagttaaggaat agcagagcagggtggattccaaagcagattgcttcacttggtttattggctggactgtgc tccatgcagatgtgtgaagagtctgggttttgtgcaagaggaggaagagatgacacggat caagaatataacatctaa >gi568815586r:10601291_10823069|GENSCAN_predicted_peptide_4|88_aa MTGVLVRSNTRDMRAQGGGHLKKLQEEGRPSASHGEASVNQTCQHLDLGLPTSRPEKRNF CRGEYSLLEETDKETTNEKRSGTEKKEG >gi568815586r:10601291_10823069|GENSCAN_predicted_CDS_4|267_bp atgactggtgttcttgtgaggagcaacaccagagacatgcgtgcacaggggggtggccat ttgaagaagttgcaagaggaaggaaggccatctgcaagccacggagaggcctcagtgaac caaacctgccagcaccttgatcttggacttccaacctccagacctgagaaaaggaatttc tgtcgtggagaatatagcctattagaagaaacagacaaggaaactacaaatgagaaacgc tctggtaccgagaagaaagaaggatag >gi568815586r:10601291_10823069|GENSCAN_predicted_peptide_5|816_aa MPSLTTPIKIVLEVLARAIRQEKEIKSIQLGKEEVKLSLFTDGMIIYLENPTVSAQNLLK LISNFGKVSGYKINVQKSQALLYTSNRQTESQITSKLPFTIATKRIKYLGIQLTRDVKDL FKENYKPMLNKIKEDTNKWKNIPDSWIGRINIVKMHILPKLGPLQVLQRLVRNQKFTISN VKRSFEKLGEAVEKRYSVNTVLKKESSQRSDNKQAKLTRVVVPSNDWTQLEARAKKPVEA VQSGSFSGAQSREKNGNERDPLITKFTLNRRWSPCGAKGISNPNPAFSRQTSNSNSTVAY FCRKPRWGRGPRSHGHRGRRLFSHRRRQRRRGEKSSRGLRVPSAGRLPCRRSQTGTVADS RGGKCLRLEKKGEAVWERGPWVGMDVGEGPTRGPGALGSLLGIWERRAFPLGKRRALEGD GEGKIEKLLARKGQVSVPDALPTWAWERRHAPVGHMPAKRCCNRLRETWVLVPRLPPPLC AHGCKQPFSVHQWLHASLFPDCGNARGVAAEKTLSEVGVLVKAPLSEQLSIEVNSAEKQI TAIKKNNPRKYLRSVGDGETVEFDVVEGEKGAEAANVTGPDGVPVEGSRYAADRRRYRRG YYGRRRGPPRNYAGEEEEEGSGSSEGFDPPATDRQFSGARNQLRRPQYRPQYRQRRFPPY HVGQTFDRRSRVLPHPNRIQSYPWSLPYPLPHQQLLKPLNGQIKAGEIGEMKDGVPEGAQ LQGPVHRNPTYRPRYRSRGPPRPRPAPAVGEAEDKENQQATSGPNQPSVRRGYRRPYNYR RRPRPPNAPSQDGKEAKAGEAPTENPAPPTQQSSAE >gi568815586r:10601291_10823069|GENSCAN_predicted_CDS_5|2451_bp atgccctctctcaccactcctattaaaatagtgttggaagttctggccagggcaatcagg caagagaaagaaataaagagtattcaattaggaaaagaggaagtcaaattgtccctgttt acagatggcatgattatatacttagaaaaccccactgtctcagcccaaaatctccttaag ctgataagcaacttcggcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagca ttactatacaccagtaacagacaaacagagagccaaatcacgagtaaactcccattcaca attgctacaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctc ttcaaggagaactacaaaccaatgctcaacaaaataaaagaggacacaaacaaatggaag aacattccagattcatggataggaagaatcaatattgtgaaaatgcacatactgcccaag ttgggccctcttcaggtgctgcagagattggtgagaaaccagaagttcacaatcagtaat gtcaaaagaagctttgagaagctgggagaggctgtggaaaaaaggtacagtgtgaacaca gtcttgaaaaaagaaagctctcagaggagtgataacaaacaagccaagttaactagagtt gtagtgccctccaatgattggacccagctggaagccagagcaaagaagcctgtagaagct gttcaaagtggttcattctctggggcacagagcagggagaaaaatggaaatgagagggat cctctgattaccaaatttactctcaacaggcgttggtccccatgtggagcaaaaggcatc agtaaccccaatccagctttttctcgacagacttctaactcaaacagcactgtggcctat ttctgcaggaaaccccggtggggacgcggcccccgcagccacgggcaccgcggccgccgc ctctttagccaccgccgccggcagcgaagacgcggagaaaaaagttctcggggtctccgg gtccccagcgctggccgcctcccttgccggcgctcccagacgggcactgttgcggattcg cgtggtggaaaatgcctgcgtttggagaagaaaggagaggcagtctgggagaggggacct tgggtagggatggatgtaggggaaggaccgacacgtgggcctggcgctttggggtccttg cttgggatctgggaaaggagagcatttcctttggggaagaggagggcgctagaaggagac ggggagggaaagatagaaaagcttcttgccaggaagggtcaggtgtctgtccctgacgcc cttcccacatgggcatgggaaagacgccatgctcctgttggccacatgccagcaaagaga tgttgcaatcgtctccgggaaacttgggtcttagtcccacgtcttcctccccctctctgt gcccacggctgcaaacagccattcagtgtccaccaatggctacacgcttccctcttcccg gactgtgggaacgcacgtggtgtagcagctgaaaaaactttgtcggaagtgggggtattg gttaaagctcctctgtcagagcagctctcaattgaagtaaatagcgcagagaagcagata actgccatcaagaagaataacccacggaaatatctgcgcagtgtaggagatggagaaact gtagagtttgatgtggttgaaggagagaagggtgcagaagctgccaatgtgactggcccg gatggagttcctgtggaagggagtcgttacgctgcagatcggcgccgttacagacgtggc tactatggaaggcgccgtggccctccccggaattacgctggggaggaggaggaggaaggg agcggcagcagtgaaggatttgacccccctgccactgataggcagttctctggggcccgg aatcagctgcgccgcccccagtatcgccctcagtaccggcagcggcggttcccgccttac cacgtgggacagacctttgaccgtcgctcacgggtcttaccccatcccaacagaatacag agttacccctggtctctcccttacccgttacctcaccaacaacttctaaagccattaaat gggcagatcaaggctggtgagattggagagatgaaggatggagtcccagagggagcacaa cttcagggaccggttcatcgaaatccaacttaccgcccaaggtaccgtagcaggggacct cctcgcccacgacctgccccagcagttggagaggctgaagataaagaaaatcagcaagcc accagtggtccaaaccagccgtctgttcgccgtggataccggcgtccctacaattaccgg cgtcgcccgcgtcctcctaacgctccttcacaagatggcaaagaggccaaggcaggtgaa gcaccaactgagaaccctgctccacccacccagcagagcagtgctgagtaa >gi568815586r:10601291_10823069|GENSCAN_predicted_peptide_6|833_aa XLEPEPTELGVVALGQIWMQPKVQPGVIMTPREAPKETGCSQGAACPSCIPQVLRILHFQ AEEAGLEHPSISSAFSSHQTLLTRAWDFRHTKQIRLHATGFRDPSTEAHMRAIKAVIIFL LLLIVYYPVFLVMTSSALIPQGKLVLMIELRVTKAFESRTSSTLMEAMVSISLDAVRTSD TVLYGLRGLEYSEDKKMWESLELPRDWLNGFDQNADSDMNNKVRVEVVSDEDEELLGTEV KPRDLVPCIPAAPAMAKRGQGTAQVMASGGTSPKFWQLPCGVESAGAQKSIIEVWEPPPR FQRMCGNAWMSRQRCAAGAGPSWRTSAKAVWKGNVGLETSHRVPTWALPSGAVRRGSPSS TPQNGRSTNNLHHVPGKTADTQHQPVKAVKREVVPCKPTEAELPRAVAAHLLHQHNLDNP DFVQGNNVPSGSQVGAFRNSLLADLTVTALELPNYSVRSHGPGSRLAHIDSDSRSAQLQT GSHDTSVQASPWNPTLQIYLNKPRVQNCYSTSQYQADLMDPSSRTTATDPGSRTDFVDPG PKTQAYYYIMDPGARPAQILIQTPNQSLQRFCYMAHTESLDELTGEGFSLLKPVYKEWKR DLVPCIPATPAMTKRGHGTARAIASEGGSPKPWQLPCGIEPAGPQKSIIKVWEPPPRFQR MYGNSWMSRQKLAAGSGPSWRTSARAVWKGNVRLEPPHRVPTGTPPSGAVRIGPPSSRPQ NGRSTDSLLHVPGKVTDTQHQPMKAARNGTIPCKATGAELPKAMGAHFLHQHDLDKLAKR VKAVHKAMTVMAMIPTGEPTVIQTFGNPHSNAKETLPFSSSFYRKRTRKGEIK >gi568815586r:10601291_10823069|GENSCAN_predicted_CDS_6|2502_bp ntcttagagccagagcccacagagctgggggtggtggcattggggcagatctggatgcag cccaaggtacagccaggagtgattatgactcctagagaagctcctaaggagactggatgt tctcaaggagcagcctgtccatcttgcatcccccaggttctccggattctccatttccaa gcagaagaagcaggactggagcacccttccatatcatcagccttctcctcgcaccagacc ttactcaccagggcctgggactttagacacaccaagcagattcgactgcatgctacaggg ttcagagaccccagtacagaggcccacatgagggccataaaggcagtgatcatctttctg ctcctcctcatcgtgtactacccagtctttcttgttatgacctctagcgctctgattcct cagggaaaattagtgttgatgattgagctcagggtcaccaaagcctttgagagcaggact agcagcaccttgatggaagccatggtatccatctcccttgatgctgtaagaacctcagac acagtcctttatgggttgagaggtttggagtactcagaagacaaaaagatgtgggagagt ttggaacttcctagagactggttgaatggctttgaccaaaatgctgatagcgatatgaac aataaggtccgggttgaggtggtctcagatgaagatgaggaacttttgggaactgaagta aagcctagggacttagtgccctgcatcccagctgctccagccatggctaaaaggggccaa ggtacagctcaggtcatggcttcagggggtacaagccccaagttttggcagcttccatgt ggtgttgagtctgcaggtgcacagaagtcaataattgaggtttgggaacctccacctaga tttcagagaatgtgtggaaatgcgtggatgtccaggcagaggtgtgctgcaggggctggg ccctcatggagaacttctgctaaggcagtgtggaagggaaatgtggggttggagacctca cacagagtccccacttgggcactgcctagtggagctgtgagaagagggtcaccatcctct acaccccaaaatggtagatccaccaacaacttgcaccatgtacctggaaaaactgcagac actcaacaccagcctgtgaaagcagtcaagagggaggttgtaccttgcaaacccacagag gcggagctgcccagggctgtggcagcccaccttttgcatcagcataacctagataaccct gattttgttcaggggaataatgtgcccagtggatctcaagtgggagcatttcggaactct ttgctggctgatctgactgtcacagcactggagctccctaactattcagttagatcccat ggcccaggatccaggctggcccacatagactcagactccaggtctgcccagctccagact ggttcccatgacaccagtgtccaggccagcccctggaaccccacactacagatttacttg aacaaacccagggtccagaactgctacagtacatcccaataccaggctgacctcatggac ccaagctccaggaccactgctacagatccaggatccaggacagactttgtggatccagga ccaaagacccaagcctactattatatcatggacccaggtgccagacctgctcaaatatta atccagacaccaaaccagtctctccagagattctgttacatggcccacacagaatctcta gatgaactgactggtgaagggttttccctgctgaagccagtctataaagaatggaaaagg gacttggtgccctgcatcccagctactccagccatgactaaaaggggccatggtacagct cgggctattgcttcagagggtggaagtcccaagccttggcagcttccatgtggtattgag cctgcaggtccacagaaatctataattaaggtttgggaacctccacctagatttcagagg atgtatggaaactcctggatgtccaggcagaagttagctgcagggtctgggccctcatgg agaacttctgctagggcagtgtggaagggaaatgtgagattggagcccccacacagagtc cctactgggacaccacctagtggagctgtgagaatagggccaccatcctccagaccccag aatggtagatctaccgacagcttgctccatgtgcctggaaaagtcacagacactcaacac cagcccatgaaagcagccaggaatgggactataccctgcaaagccacaggggcggagctg cccaaggccatgggagcccactttttgcatcagcatgacctggataagctggcgaaacgc gtgaaggccgtacacaaggcaatgacggtgatggcaatgattcccacgggagaaccaact gtgatccagacctttggcaatcctcacagtaatgcaaaggagactttaccattctcttcc tcattttacagaaaaagaactagaaagggagaaatcaagtaa