GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:04:16 Sequence gi568815597f:161526588_161708214 : 181627 bp : 43.34% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 205 200 6 1.05 1.07 Term - 4213 4031 183 1 0 103 49 105 0.300 5.64 1.06 Intr - 11591 11504 88 1 1 97 54 68 0.177 4.27 1.05 Intr - 13971 13866 106 1 1 1 30 120 0.189 -3.23 1.04 Intr - 18371 18114 258 0 0 97 100 115 0.884 11.03 1.03 Intr - 22091 21834 258 0 0 91 109 243 0.994 24.13 1.02 Intr - 22444 22424 21 2 0 110 105 28 0.940 4.22 1.01 Init - 23149 23110 40 1 1 81 97 61 0.694 4.78 1.00 Prom - 24415 24376 40 -6.66 2.00 Prom + 34730 34769 40 -6.16 2.01 Init + 38639 38641 3 1 0 72 115 0 0.178 1.10 2.02 Intr + 54779 54962 184 1 1 90 92 24 0.490 2.36 2.03 Term + 56503 56597 95 2 2 92 42 125 0.862 6.29 2.04 PlyA + 58564 58569 6 1.05 3.00 Prom + 59576 59615 40 -4.76 3.01 Init + 60895 60948 54 0 0 99 90 3 0.829 0.89 3.02 Intr + 61835 61855 21 1 0 113 119 -6 0.800 2.74 3.03 Intr + 63013 63232 220 0 1 -24 65 211 0.897 5.47 3.04 Intr + 64557 64811 255 1 0 98 86 243 0.997 22.52 3.05 Intr + 65542 65655 114 2 0 92 72 120 0.984 11.22 3.06 Intr + 66839 66895 57 0 0 94 109 24 0.764 3.96 3.07 Intr + 74353 74479 127 2 1 77 103 74 0.236 7.54 3.08 Intr + 79682 80711 1030 2 1 18 -86 2139 0.047 180.07 3.09 Term + 80725 81630 906 0 0 -34 42 1240 0.547 99.50 3.10 PlyA + 81824 81829 6 1.05 4.08 PlyA - 81845 81840 6 1.05 4.07 Term - 85818 85636 183 0 0 90 45 105 0.636 3.94 4.06 Intr - 93016 92837 180 1 0 94 28 99 0.395 4.56 4.05 Intr - 94209 94130 80 1 2 43 113 -11 0.262 -3.83 4.04 Intr - 99815 99558 258 0 0 102 100 114 0.885 11.43 4.03 Intr - 103448 103191 258 0 0 91 109 283 0.995 28.13 4.02 Intr - 103801 103781 21 2 0 113 105 28 0.951 4.52 4.01 Init - 104507 104468 40 2 1 81 97 61 0.852 4.78 4.00 Prom - 105778 105739 40 -6.66 5.00 Prom + 116075 116114 40 -6.16 5.01 Init + 120029 120031 3 1 0 72 115 0 0.087 1.10 5.02 Intr + 136583 136766 184 1 1 90 92 24 0.385 2.36 5.03 Intr + 143665 143685 21 2 0 113 119 5 0.539 3.62 5.04 Intr + 144805 145062 258 2 0 98 65 269 0.752 23.03 5.05 Intr + 146388 146642 255 1 0 105 91 243 0.860 23.72 5.06 Intr + 147373 147486 114 2 0 92 72 120 0.999 11.22 5.07 Intr + 148670 148726 57 0 0 94 109 13 0.851 2.86 5.08 Intr + 150741 150778 38 1 2 116 84 33 0.994 3.68 5.09 Term + 150889 150966 78 0 0 97 34 109 0.973 4.16 5.10 PlyA + 151448 151453 6 1.05 6.03 PlyA - 152517 152512 6 1.05 6.02 Term - 158566 158258 309 1 0 -49 54 287 0.314 6.66 6.01 Init - 162851 162708 144 2 0 69 75 87 0.975 5.62 6.00 Prom - 163690 163651 40 -5.16 7.00 Prom + 163832 163871 40 -6.66 7.01 Init + 167580 167725 146 2 2 68 98 85 0.839 7.09 7.02 Intr + 170225 170547 323 2 2 42 40 183 0.202 4.40 7.03 Term + 171198 171370 173 1 2 87 41 132 0.300 6.39 7.04 PlyA + 171478 171483 6 1.05 8.00 Prom + 172223 172262 40 -2.06 8.01 Init + 180678 180756 79 2 1 104 106 113 0.898 13.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 79704 80711 1008 2 0 85 -86 2112 0.929 187.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:161526588_161708214|GENSCAN_predicted_peptide_1|317_aa MWQLLLPTALLLLVSAGMRTEDLPKAVVFLEPQWYRVLEKDSVTLKCQGAYSPEDNSTQW FHNESLISSQASSYFIDAATVDDSGEYRCQTNLSTLSDPVQLEVHIGWLLLQAPRWVFKE EDPIHLRCHSWKNTALHKVTYLQNGKGRKYFHHNSDFYIPKATLKDSGSYFCRGLFGSKN VSSETVNITITQAAKSYHYNPSRKNQIPQNYPYSTKAEEDIGNHTTRRPVELAIWASVMD CLLPGSSIVTDRIRSGMLPAPQLFETPHWIRRVSDSATVIGAEDPRETGRHRVYLDMARP LYRSRGALEESPGKMEP >gi568815597f:161526588_161708214|GENSCAN_predicted_CDS_1|954_bp atgtggcagctgctcctcccaactgctctgctacttctagtttcagctggcatgcggact gaagatctcccaaaggctgtggtgttcctggagcctcaatggtacagggtgctcgagaag gacagtgtgactctgaagtgccagggagcctactcccctgaggacaattccacacagtgg tttcacaatgagagcctcatctcaagccaggcctcgagctacttcattgacgctgccaca gtcgacgacagtggagagtacaggtgccagacaaacctctccaccctcagtgacccggtg cagctagaagtccatatcggctggctgttgctccaggcccctcggtgggtgttcaaggag gaagaccctattcacctgaggtgtcacagctggaagaacactgctctgcataaggtcaca tatttacagaatggcaaaggcaggaagtattttcatcataattctgacttctacattcca aaagccacactcaaagacagcggctcctacttctgcagggggctttttgggagtaaaaat gtgtcttcagagactgtgaacatcaccatcactcaagcagctaagagttatcactacaac cctagtcggaaaaaccaaatacctcaaaattacccgtacagcactaaggcagaagaggac attgggaaccacacaacgcggagacctgtggagctggcaatatgggcaagtgtcatggac tgtctactgccaggaagctccattgtcaccgacaggatcagaagtggcatgctcccagct ccgcagctgttcgagactccgcactggatccgccgggtgtcggattcagcaacagtgatt ggtgcggaggatccgagggagacgggccgacacagggtctaccttgacatggctcgaccc ctctacagaagcagaggggcgctcgaagagtcgccagggaaaatggagccttaa >gi568815597f:161526588_161708214|GENSCAN_predicted_peptide_2|93_aa MGRIRQALREGCDCCALGASSLQGVMGILSFLPVLATESDWADCKSPQPWGHMLLWTAVL FLGTVLGLGDLSVNMTDSLPDRKYEVNELIVKQ >gi568815597f:161526588_161708214|GENSCAN_predicted_CDS_2|282_bp atgggtagaatccgccaagctttgagagaaggctgtgactgctgtgctctgggcgccagc tcgctccagggagtgatgggaatcctgtcattcttacctgtccttgccactgagagtgac tgggctgactgcaagtccccccagccttggggtcatatgcttctgtggacagctgtgcta ttcctgggcactgttctaggacttggagacttatcagtgaacatgactgacagccttcct gaccggaaatacgaggtcaatgaacttattgtgaagcaataa >gi568815597f:161526588_161708214|GENSCAN_predicted_peptide_3|927_aa MGGERWLTPVIPEFWEAELLLLGHLWINVLQEDSVTLTCRGTHSPESDSIPWFHNGNLIP THTQPSYRFKANNNDSGEYTCQTGQTSLSDPVHLTVLSEWLVLQTPHLEFQEGETIVLRC HSWKDKPLVKVTFFQNGKSKKFSRSDPNFSIPQANHSHSGDYHCTGNIGYTLYSSKPVTI TVQAPSSSPMGIIVAVVTGIAVAAIVAAVVALIYCRKKRISALSGYPECREMGETLPEKP AILDIKNLLFHIHTANTINQTTVMKRCSNMRNAYVTGYMRTIIIRQEASAMQAPRELAVG IDLGTTYSCVGVFQQGRVEILANDQGNRTTPSYVAFTDTERLVGDAAKSQAALNPHNTVF DAKRLIGRKFADTTVQSDMKHWPFQVVSEGGKPKVRVCYRGEDKTFYPEEISSMVLSKMK ETAEAYLGQPVKHAVITVPTYFSNSQRQATKDAGAIAGLKVLPIINEATAAAIAYGLDRR GAGKRNVLIFDLGGGTFDVSVLSIDAGVFEVKATAGDTHLGGEDFDNRLVNHFMEEFRRK HGKDLSGNKRALRRLRTACERAKRTPSSSTQATLEIDSLFEGVDFYKSITRARFEELCSD LFRSTLEPVEKALRDAKLDKAQIHDFGSTRIPKVQKLLQDFFNGKELNKSINPDEAVAYG AAVQAAVLMGDKCEKVQDLLLLDVAPLSLGLETAGGVMTTLIQRNATIPTKQTQTFTTYS DNQPGVFIQVYEVERAMTKDNNLLGRFELIGIPPAPHGVPQIEVTFDIDANGILSVTATD RSTGKANKITNDKGRLSKEEVERMVHEAEQYGAEDEAQRDRVAAKNSLEAHVFHVKGSLQ EESLRDKIPEEDRRKVQDKCQEVLAWLEHNQLAEKEEYEHQKRELEQICRPIFSRLYGGP GVPGGSSCSAQAHQGDPSTGPIIEEVD >gi568815597f:161526588_161708214|GENSCAN_predicted_CDS_3|2784_bp atgggcggggaacggtggctcacacctgtaatcccagaattttgggaggctgagctcctg ttgctgggacacctgtggatcaacgtgctccaagaggactctgtgactctgacatgccgg gggactcacagccctgagagcgactccattccgtggttccacaatgggaatctcattccc acccacacgcagcccagctacaggttcaaggccaacaacaatgacagcggggagtacacg tgccagactggccagaccagcctcagcgaccctgtgcatctgactgtgctttctgagtgg ctggtgctccagacccctcacctggagttccaggagggagaaaccatcgtgctgaggtgc cacagctggaaggacaagcctctggtcaaggtcacattcttccagaatggaaaatccaag aaattttcccgttcggatcccaacttctccatcccacaagcaaaccacagtcacagtggt gattaccactgcacaggaaacataggctacacgctgtactcatccaagcctgtgaccatc actgtccaagctcccagctcttcaccgatggggatcattgtggctgtggtcactgggatt gctgtagcggccattgttgctgctgtagtggccttgatctactgcaggaaaaagcggatt tcagctctctcaggataccctgagtgcagggaaatgggagagaccctccctgagaaacca gccattcttgacatcaagaatcttctgttccacatccacacagccaatacaattaatcaa accactgttatgaaaagatgtagcaacatgagaaatgcttatgttacaggttacatgaga acaatcatcatccgacaagaagcttcagccatgcaggccccacgggagctcgcggtgggc atcgacctgggcaccacctactcgtgcgtgggcgtgtttcagcagggccgcgtggagatc ctggccaacgaccagggcaaccgcaccacgcccagctacgtggccttcaccgacaccgag cggctggtcggggacgcggccaagagccaggcggccctgaacccccacaacaccgtgttc gatgccaagcggctgatcgggcgcaagttcgcggacaccacggtgcagtcggacatgaag cactggcccttccaggtggtgagcgagggcggcaagcccaaggtgcgcgtatgctaccgc ggggaggacaagacgttctaccccgaggagatctcgtccatggtgctgagcaagatgaag gagacggccgaggcgtacctgggccagcccgtgaagcacgcagtgatcaccgtgcccacc tatttcagtaactcgcagcgccaggccaccaaggacgcgggggccatcgcggggctcaag gtgctgccgatcatcaatgaggccacggcagcagccatcgcctatgggctggaccggcgg ggcgcgggaaagcgcaacgtgctcatttttgacctgggtgggggcaccttcgatgtgtcg gttctctccattgacgccggtgtctttgaggtgaaagccactgctggagatacccacctg ggaggagaggacttcgacaaccggctcgtgaaccacttcatggaagaattccggcggaag catgggaaggacctgagcgggaacaagcgtgccctgcgcaggctgcgcacagcctgtgag cgcgccaagcgcaccccgtcctccagcacccaggccaccctggagatagactccctgttc gagggcgtggacttctacaagtccatcactcgtgcccgctttgaggaactgtgctcagac ctcttccgcagcaccctggagccggtggagaaggccctgcgggatgccaagctggacaag gcccagattcatgacttcggctccactcgcatccccaaggtgcagaagttgctgcaggac ttcttcaacggcaaggagctgaacaagagcatcaaccctgatgaggctgtggcctatggg gctgctgtgcaggcggccgtgttgatgggggacaaatgtgagaaagtgcaggatctcctg ctgctggatgtggctcccctgtctctggggctggagacagcaggtggggtgatgaccacg ctgatccagaggaacgccactatccccaccaagcagacccagactttcaccacctactcg gacaaccagcctggggtcttcatccaggtgtatgaggttgagagggccatgaccaaggac aacaacctgctggggcgttttgaactcattggcatccctcctgccccacatggagtcccc cagatagaggtgacgtttgacattgatgctaatggcatcctgagcgtgacagccactgac aggagcacaggtaaggctaacaagatcaccaatgacaagggccggctgagcaaggaggag gtggagaggatggttcatgaagccgagcagtacggggctgaggatgaggcccagagggac agagtggctgccaaaaactcgctggaggcccatgtcttccatgtgaaaggttctttgcaa gaggaaagccttagggacaagattcccgaagaggacaggcgcaaagtgcaagacaagtgt caggaagtccttgcctggctggagcacaaccagctggcagagaaggaggagtatgagcat cagaagagggagctggagcaaatctgtcgccccatcttctccaggctctatggggggcct ggtgtccctgggggcagcagttgtagcgctcaagcccaccagggggaccccagcaccggc cccatcattgaggaggttgattga >gi568815597f:161526588_161708214|GENSCAN_predicted_peptide_4|339_aa MWQLLLPTALLLLVSAGMRTEDLPKAVVFLEPQWYSVLEKDSVTLKCQGAYSPEDNSTQW FHNENLISSQASSYFIDAATVNDSGEYRCQTNLSTLSDPVQLEVHIGWLLLQAPRWVFKE EDPIHLRCHSWKNTALHKVTYLQNGKDRKYFHHNSDFHIPKATLKDSGSYFCRGLVGSKN VSSETVNITITQGVFPNVIPPPFPTHDRSRCVMFPTRAHTCGAGNMGKCHGLSTSRKLHC HRQDQKWHGKWYKKAHSGTVLKTSPFGGASHAKGIGLEKLPAPQLFETPHWIRRVSDSAT VIGAEDPRETGRHRVYLDMARPLYRSRGALEESPGKMEP >gi568815597f:161526588_161708214|GENSCAN_predicted_CDS_4|1020_bp atgtggcagctgctcctcccaactgctctgctacttctagtttcagctggcatgcggact gaagatctcccaaaggctgtggtgttcctggagcctcaatggtacagcgtgcttgagaag gacagtgtgactctgaagtgccagggagcctactcccctgaggacaattccacacagtgg tttcacaatgagaacctcatctcaagccaggcctcgagctacttcattgacgctgccaca gtcaacgacagtggagagtacaggtgccagacaaacctctccaccctcagtgacccggtg cagctagaagtccatatcggctggctgttgctccaggcccctcggtgggtgttcaaggag gaagaccctattcacctgaggtgtcacagctggaagaacactgctctgcataaggtcaca tatttacagaatggcaaagacaggaagtattttcatcataattctgacttccacattcca aaagccacactcaaagatagcggctcctacttctgcagggggcttgttgggagtaaaaat gtgtcttcagagactgtgaacatcaccatcactcaaggtgtttttcctaatgttatccct ccccccttccccacccacgacaggtcccggtgtgtgatgttccccacccgtgcacacacc tgtggagctggcaatatgggcaagtgtcatggactgtctacctccaggaagctccattgt caccgacaggatcagaagtggcatggtaaatggtacaagaaagcccattcgggcacagtc ctgaagaccagcccttttggaggtgcttctcatgcaaagggaattgggctggaaaaactc ccagctccgcagctgttcgagactccgcactggatccgccgggtgtcggattcagcaaca gtgattggtgcggaggatccgagggagacgggccgacacagggtctaccttgacatggct cgacccctctacagaagcagaggggcgctcgaagagtcgccagggaaaatggagccttaa >gi568815597f:161526588_161708214|GENSCAN_predicted_peptide_5|335_aa MGRIRQALREGCDCCALGASSLQGVMGILSFLPVLATESDWADCKSPQPWGHMLLWTAVL FLAPVAGTPAAPPKAVLKLEPQWINVLQEDSVTLTCRGTHSPESDSIQWFHNGNLIPTHT QPSYRFKANNNDSGEYTCQTGQTSLSDPVHLTVLSEWLVLQTPHLEFQEGETIVLRCHSW KDKPLVKVTFFQNGKSKKFSRSDPNFSIPQANHSHSGDYHCTGNIGYTLYSSKPVTITVQ APSSSPMGIIVAVVTGIAVAAIVAAVVALIYCRKKRISALPGYPECREMGETLPEKPANP TNPDEADKVGAENTITYSLLMHPDALEEPDDQNRI >gi568815597f:161526588_161708214|GENSCAN_predicted_CDS_5|1008_bp atgggtagaatccgccaagctttgagagaaggctgtgactgctgtgctctgggcgccagc tcgctccagggagtgatgggaatcctgtcattcttacctgtccttgccactgagagtgac tgggctgactgcaagtccccccagccttggggtcatatgcttctgtggacagctgtgcta ttcctggctcctgttgctgggacacctgcagctcccccaaaggctgtgctgaaactcgag ccccagtggatcaacgtgctccaggaggactctgtgactctgacatgccgggggactcac agccctgagagcgactccattcagtggttccacaatgggaatctcattcccacccacacg cagcccagctacaggttcaaggccaacaacaatgacagcggggagtacacgtgccagact ggccagaccagcctcagcgaccctgtgcatctgactgtgctttctgagtggctggtgctc cagacccctcacctggagttccaggagggagaaaccatcgtgctgaggtgccacagctgg aaggacaagcctctggtcaaggtcacattcttccagaatggaaaatccaagaaattttcc cgttcggatcccaacttctccatcccacaagcaaaccacagtcacagtggtgattaccac tgcacaggaaacataggctacacgctgtactcatccaagcctgtgaccatcactgtccaa gctcccagctcttcaccgatggggatcattgtggctgtggtcactgggattgctgtagcg gccattgttgctgctgtagtggccttgatctactgcaggaaaaagcggatttcagctctc ccaggataccctgagtgcagggaaatgggagagaccctccctgagaaaccagccaatccc actaatcctgatgaggctgacaaagttggggctgagaacacaatcacctattcacttctc atgcacccggatgctctggaagagcctgatgaccagaaccgtatttag >gi568815597f:161526588_161708214|GENSCAN_predicted_peptide_6|150_aa MEKREKKADEQGTLEPGEQNGDEFRRFSFCFMYATSGAEKASNPEMPKVVTREYTINIHK HIHGVGFKKCAPRALKELRKFGMKEMGTPDARIDIRLNKAVWAKGIRMSHTESVCGCPEN VMRMKIHQISSILWLPMYLLPLLKIYSQCG >gi568815597f:161526588_161708214|GENSCAN_predicted_CDS_6|453_bp atggagaaaagagaaaagaaagcagatgagcaagggaccttggaacctggggaacaaaat ggtgatgagttccgtaggttttctttttgtttcatgtatgctacatctggagctgaaaaa gccagcaacccagaaatgccaaaggtggtgacccgagaatacaccatcaacattcacaag cacatccatggagtgggcttcaagaagtgtgcccctcgggcactcaaagagcttcggaaa tttggcatgaaggagatgggaactccagatgcgcgcattgatatcaggctcaacaaagct gtctgggccaaaggaataagaatgtcccataccgaatccgtgtgcggctgtcccgaaaac gtaatgaggatgaagattcaccaaataagctctatactttggttacctatgtacctgttg ccacttttgaaaatctacagtcaatgtggatga >gi568815597f:161526588_161708214|GENSCAN_predicted_peptide_7|213_aa MRKNQCKNTENSKNQNASSPPNDHNSSLARAQNWKENENDELTEVGFRSIILIPKPGRYT TKKENVRPISLMDTDEKILNKMLANQIQQHIKKLIHHGQVGFIPGMQGLFNIHKSIKVIH HINRTNDKNHMIISIDAEKAFDKIQHPFMLKTLNKLDLEKTTLNFVWNQKAARIAKTILS KKNKAGGIMQPDFKQNYKATVTKMHDTGTKTDM >gi568815597f:161526588_161708214|GENSCAN_predicted_CDS_7|642_bp atgaggaaaaaccaatgcaaaaacactgaaaactccaaaaaccagaatgcctcttctcct ccaaatgatcacaactcctctctagcaagggcacaaaactggaaggagaatgagaatgac gaattgacagaagtaggcttcagaagcatcatcctgataccaaaacctggcagatacaca actaaaaaagaaaatgtcaggccaatatccctgatggacaccgatgagaaaatcctcaat aaaatgctggcaaaccaaatccagcagcacatcaaaaagcttatccaccatggtcaagtt ggcttcatccctgggatgcaaggcttgttcaacatacacaaatcaataaaggtaatccat cacataaacagaaccaatgacaaaaaccacatgattatctcaatagatgcagagaaggcc ttcgataaaattcaacatcccttcatgctaaaaactctcaataaactagacttagaaaaa actactttaaatttcgtgtggaaccaaaaagcagcccgtatagccaagacaatcctaagc aaaaagaacaaagctggaggcatcatgcaacctgacttcaaacaaaactacaaggctaca gtaaccaaaatgcatgatactggtaccaaaacagatatgtag >gi568815597f:161526588_161708214|GENSCAN_predicted_peptide_8|27_aa MKLGCVLMAWALYLSLGVLWVAQMLLX >gi568815597f:161526588_161708214|GENSCAN_predicted_CDS_8|81_bp atgaagctgggctgtgtcctcatggcctgggccctctacctttcccttggtgtgctctgg gtggcccagatgctactggnn