GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:52:50 Sequence gi568815595r:32383963_32602745 : 218783 bp : 42.72% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7945 8103 159 0 0 72 115 136 0.330 12.58 1.02 Term + 8256 8660 405 2 0 23 53 170 0.272 1.20 1.03 PlyA + 9887 9892 6 1.05 2.08 PlyA - 13286 13281 6 1.05 2.07 Term - 21393 20951 443 1 2 51 48 207 0.512 7.43 2.06 Intr - 30455 30343 113 1 2 28 51 164 0.053 5.80 2.05 Intr - 35483 35427 57 1 0 84 116 44 0.483 3.88 2.04 Intr - 36752 36605 148 0 1 96 92 63 0.320 5.87 2.03 Intr - 39668 39599 70 2 1 78 27 65 0.140 -2.76 2.02 Intr - 40323 40206 118 2 1 78 72 83 0.389 5.25 2.01 Init - 45038 44872 167 2 2 70 86 189 0.894 16.05 2.00 Prom - 45242 45203 40 -11.24 3.00 Prom + 46957 46996 40 -5.45 3.01 Init + 52309 52392 84 0 0 96 35 79 0.157 4.27 3.02 Intr + 57878 58051 174 1 0 93 103 165 0.874 17.71 3.03 Intr + 65492 65590 99 1 0 127 77 127 0.824 14.89 3.04 Intr + 68430 68511 82 2 1 116 109 24 0.563 5.59 3.05 Intr + 70311 70398 88 1 1 18 56 42 0.108 -7.89 3.06 Intr + 71436 71730 295 0 1 11 94 239 0.445 12.89 3.07 Intr + 73914 74041 128 2 2 80 60 86 0.438 3.56 3.08 Intr + 84882 85080 199 0 1 39 43 197 0.117 8.73 3.09 Term + 87025 87297 273 0 0 -40 48 399 0.453 17.59 3.10 PlyA + 90499 90504 6 1.05 4.05 PlyA - 93982 93977 6 1.05 4.04 Term - 100135 99998 138 1 0 141 43 72 0.931 4.98 4.03 Intr - 107924 107748 177 2 0 64 111 12 0.457 0.29 4.02 Intr - 109951 109868 84 1 0 57 100 52 0.623 2.40 4.01 Init - 118783 118646 138 1 0 87 105 98 0.967 11.60 4.00 Prom - 119879 119840 40 -11.54 5.00 Prom + 121799 121838 40 -8.65 5.01 Sngl + 123993 124370 378 2 0 66 45 277 0.704 17.11 5.02 PlyA + 124594 124599 6 1.05 6.10 PlyA - 126698 126693 6 1.05 6.09 Term - 142946 142837 110 0 2 80 49 134 0.999 6.39 6.08 Intr - 144639 144484 156 1 0 102 85 47 0.682 4.86 6.07 Intr - 145698 145578 121 0 1 71 83 116 0.999 8.55 6.06 Intr - 146558 146499 60 2 0 72 80 57 0.693 1.31 6.05 Intr - 157244 157075 170 0 2 64 96 94 0.672 6.54 6.04 Intr - 161144 160914 231 0 0 92 84 149 0.996 11.72 6.03 Intr - 162003 161887 117 1 0 79 101 85 0.995 8.42 6.02 Intr - 186457 186384 74 0 2 112 106 11 0.936 3.43 6.01 Init - 186808 186663 146 1 2 104 25 169 0.982 11.87 6.00 Prom - 188929 188890 40 -6.85 7.00 Prom + 196870 196909 40 -7.15 7.01 Init + 198442 198544 103 0 1 85 113 32 0.813 5.85 7.02 Intr + 208715 209085 371 0 2 75 -33 333 0.146 13.80 7.03 Term + 213012 213245 234 2 0 78 47 151 0.210 5.34 7.04 PlyA + 213249 213254 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 30455 30299 157 1 1 28 46 221 0.840 8.32 S.002 Term - 186178 186069 110 2 2 93 43 60 0.854 -0.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:32383963_32602745|GENSCAN_predicted_peptide_1|187_aa MSHGAGLVRTTCSSGSALGPGAGAAQPSASPLEGLLDLSYPRTHAALLKVAQMASSGKVA AAAPALELSPQSRPGPCSRGPRSPGSQVSPATPLRSLTRLLTETTGKVPFRLSSSSLGRA ASNPSRGWKSRPGGGAEGAKPDLFPVHGLLFPPNPARQGFPALCAVSLEGGGQRVLCSIV GKKNWDR >gi568815595r:32383963_32602745|GENSCAN_predicted_CDS_1|564_bp atgtcgcacggagccgggctcgtccgcaccacgtgcagcagcggcagcgcgctcggaccc ggggccggcgcggcccagcccagcgcgagccccttggaggggctgctggacctcagctac ccccgcacccacgcggccctgctgaaagtggcgcaaatggcttcctccgggaaagttgct gccgcagcccccgcgctggagctcagtccccagagccgccccggcccttgctcccgcgga cccaggtcgccagggtcgcaagtttctccggccacccccctgcgaagcctgacccgccta ctgactgaaaccacgggcaaagttccttttcgtctttcttcctcttctctgggccgcgcc gcttcgaacccgagccgggggtggaagagtcgcccaggcggcggcgctgagggagcgaaa cccgacctcttccccgtccacggcctcctctttcctccgaacccagcgcggcagggcttc cctgctctgtgtgcagtttcgctcgaaggaggaggtcagagggtcttgtgttccatagtg gggaaaaagaactgggaccgttga >gi568815595r:32383963_32602745|GENSCAN_predicted_peptide_2|371_aa MRLYIHKSYQCKYQRRALVGAGLVQSSKPWLVSEWGQVGHRWAAVICGFPRLSGARLLAL GTLAPCYERLRALERPCVGVTAHKAASQHQLQCETARGSDGANCWAWGLHALLQVNEEWN KKPAVVNRPSVRQTVGLWRGHLAPFFLLQPFQPCPQFKGMIIVTCRPREQGKDVFSHILI LLEVAISNEAIKGALVRNVKDEGTSRDLHQTEAVNAVLKYSGKEGAAIRDPALMQDKKAF VTALVIYCCTTNYHKLGTLTQHSFIFSQFPRSRSLGRSPLFRVSQDCNKRLARLGVLFRA HMAVSGFQFLVTIGLKARFLAGCQLRAALASRHDPWMLVTWPLTGPLSTRQFTSSEPAGD SLPLGRTQYFF >gi568815595r:32383963_32602745|GENSCAN_predicted_CDS_2|1116_bp atgaggctctacattcacaaaagttaccagtgcaaataccagcgcagagccttggtggga gcaggtctggtgcagtcaagcaaaccctggctggtgtctgagtggggccaggtgggccac aggtgggcagctgttatctgcggctttccacgactctcaggtgcaaggctgctcgcccta ggaaccctggcaccatgttatgagaggcttagagcactggagaggccatgtgtaggtgtt acagcccataaggccgccagccagcatcaactgcaatgtgagacagctagaggctctgac ggggctaactgctgggcctgggggctccacgctctgcttcaggtcaatgaagaatggaac aaaaagccagctgtggtaaaccgcccttcagtcaggcagacagtgggactatggcgtgga caccttgcccctttcttcctcctccagccttttcaaccgtgtccccagttcaaaggcatg atcattgtgacctgcaggcccagggaacaaggaaaagatgtcttttctcacattcttata ctgctggaagttgcaataagtaatgaggcaataaaaggagccttagtaagaaatgtcaaa gatgaggggacgagcagagacttgcaccaaacagaagctgtgaacgcagtgctgaaatac agcggtaaagaaggagcagccatcagagatccagctttgatgcaggataaaaaggctttt gtgactgcgttagttatctactgctgcacaaccaattaccacaaacttggcaccttaaca caacattcatttatcttctcacagtttccacggagcaggagtctggggagaagtccactg ttcagagtctcacaagactgcaacaagaggttggccaggcttggggtcctcttccgagct cacatggctgttagtggctttcagtttcttgtaaccataggtctgaaggcccgttttctt gctggctgtcagctgagggctgctctggcttctaggcatgacccttggatgcttgtgaca tggcccctcacaggccctctctcaacaaggcagttcacttcttcagagccagcaggagac tctctccctttgggaaggacccagtacttcttctaa >gi568815595r:32383963_32602745|GENSCAN_predicted_peptide_3|473_aa MGQVSQPVGVNEIAQRLLDDGDFNREARVTLLIAFICVRSSLWTNYSAYSYFEVVTICDL IMILAFYLVHLFRFYRVLTCISWPLSELLHYLIGTLLLLIASIVAASKSYNQSGLVAGAI FGFMATFLCMASIWLSYKISCVTQSTGPSGALQRGGRVLQARPLDLCSCAKVLSGCPILA SAECCRKWKQPLWALSWSSGATLVAAGQLGASGSVWDDLRKQHPWCIWSRVLLMGGDPVY QLGWSGNMAELTLAQSAPPGQWNGGVSAEEGTVQPSETLAPLGTVGTQRQSRGCGEPHSP WILGAFEEGWPELWGLGASPARPTTMSCQHNLIAVIWAFNCHLGAFLWTLPWDLLSGHTV QLASAAGEFTPCEVTLTNGTQEPKKKEEEEEEEEEEEEEEDRRRTRRRGGEEGEGEEEEK RRKKKEEEEEKWPYSKIGRWMVHPKPKNIPTASIPQVIRHVATFAQEPEQQRT >gi568815595r:32383963_32602745|GENSCAN_predicted_CDS_3|1422_bp atggggcaggtttctcagccagtgggtgtcaatgagattgcacaaaggctattagatgat ggtgactttaacagggaggcacgggtcaccctgctgattgccttcatctgtgtgcggagc tccctgtggaccaactacagcgcctacagctactttgaagtggtcaccatttgcgacttg ataatgatcctcgccttttacctggtccacctcttccgcttctaccgcgtgctcacctgt atcagctggcccctgtcggaacttctgcactatttaatcggtaccctgctcctcctcatc gcctccattgtggcagcttccaagagttacaaccagagcggactggtagccggagcgatc tttggtttcatggccaccttcctctgcatggcaagcatatggctgtcctataagatctcg tgtgtaacccagtccacaggcccctcaggagctttgcagagaggaggacgtgtactccag gcgaggcctctggacctgtgttcctgtgccaaagtcctgtcaggctgccccatccttgcc tctgctgagtgttgccggaaatggaagcagcccctgtgggctttgagctggagctctggg gctactcttgtggctgctggccagctcggggcatcgggttctgtctgggatgatctaagg aagcagcacccgtggtgtatctggagccgagtgctgctgatgggaggggatcctgtgtac cagctgggctggagcgggaacatggctgagctgaccctggcacagtcggctcctccaggg caatggaatggaggagtctcagctgaggagggcacggtccagccctcagagacccttgcc cctctgggcactgtggggacccagaggcagagcagagggtgtggggaacctcatagtccc tggatcctgggagctttcgaggaaggctggccagagctctggggcctgggtgctagccct gcaaggcctacgaccatgtcatgccagcataacctgatagctgtgatttgggccttcaac tgtcaccttggcgctttcctgtggacactgccatgggacctgctcagcggccacacagtg caacttgcaagtgctgcaggggaattcactccctgtgaggtaacactgaccaatgggaca caggaaccaaagaagaaggaggaggaggaggaagaggaagaggaagaagaagaagaagaa gatagaagaaggacaaggaggagaggaggagaggagggggagggggaggaggaggagaaa agaagaaaaaagaaagaagaagaagaagaaaagtggccctacagcaaaattggccgctgg atggtacatcccaaacctaagaatattcctacagcttccatcccccaagtcatcagacat gtggccacatttgcccaagagcctgaacaacaaagaacctaa >gi568815595r:32383963_32602745|GENSCAN_predicted_peptide_4|178_aa MENGAVYSPTTEEDPGPARGPRSGLAAYFFMGRLPLLRRVLKGLQLYCNRASLPDYPGTF LANVTGNLLIAQSKLLSLLAFICEEVVSQCTLCGGLYFFEFVSCSAFLLSLLILIVYCTP FYERVDTTKVKSSVFGFIASFMFLLDFITMLYEKRQESQLRKPENTTRAEALTEPLNA >gi568815595r:32383963_32602745|GENSCAN_predicted_CDS_4|537_bp atggagaacggagcggtgtacagccccactacggaggaggacccgggccccgccagaggc ccccggagcggcctcgctgcctactttttcatgggccggctcccattgctccggcgcgtt ctcaagggcttgcagctgtattgtaatcgagcttctctccctgactaccctggaactttt cttgctaatgtcactggcaaccttcttatagctcagtctaagttgctgtctctgctggcc ttcatctgtgaagaagttgtatcacaatgtactttatgtggaggactttatttttttgag tttgtaagctgcagtgcctttcttctgagtctccttatactgattgtgtattgcactcca ttttatgagagagttgataccacaaaagtaaaatcatcggtgtttggatttatagcaagt tttatgttcctacttgactttatcactatgctgtatgaaaaacgacaggagtcccagctg agaaaacctgaaaataccactagggctgaagccctcactgagccacttaatgcctaa >gi568815595r:32383963_32602745|GENSCAN_predicted_peptide_5|125_aa MASAKKGGQKKKGRSAINEVVTQEYTINIHKRTHGVRFKKCAPWALKEIRKFAMKEMVAP DMCVATRLNKAVWAKGIRNVSYRIRVRLSRIRMKIHQISSILWLPMYRLPLSKIYRQSMW MRTNC >gi568815595r:32383963_32602745|GENSCAN_predicted_CDS_5|378_bp atggcttccgcaaagaagggtggccaaaagaaaaagggccgttctgctatcaacgaggtg gtgacccaagaatacaccatcaacattcacaagcgcacccatggagtgcgcttcaagaag tgtgccccttgggcactcaaagagattcggaaatttgccatgaaggagatggtagctcca gacatgtgcgttgctaccaggctcaacaaagctgtctgggccaaaggaataaggaatgtc tcataccgaatccgtgtgcgattatccagaataaggatgaagattcaccaaataagctct atactttggttacctatgtaccggttaccactttcaaaaatctacagacagtcaatgtgg atgagaactaattgctga >gi568815595r:32383963_32602745|GENSCAN_predicted_peptide_6|394_aa MAAVGRVGSFGSSPPGLSSTYTGGPLGNEIASGNGGAAAGDDEDGQNLWSCILSEVSTRS RSKLPAGKNVLLLGEDGAGKTSLIRKIQGIEEYKKGRGLEYLYLNVHDEDRDDQTRCNVW ILDGDLYHKGLLKFSLDAVSLKDTLVMLVVDMSKPWTALDSLQKWASVVREHVDKLKIPP EEMKQMEQKLIRDFQEYVEPGEDFPASPQRRNTASQEDKDDSVVLPLGADTLTHNLGIPV LVVCTKFVHEKEIMAEDDQVFLMKLQDASPRVPGGSPRTPNRSVSSNVASVSPIPAGSKK IDPNMKAGATSEGVLANFFNSLLSKKTGSPGGPGVSGGSPAGGAGGGSSGLPPSTKKSGQ KPVLDVHAELDRITRKPVTVSPTTPTSPTEGEAS >gi568815595r:32383963_32602745|GENSCAN_predicted_CDS_6|1185_bp atggcggccgtggggcgagtcggctccttcggttcttctccgccgggattatcctcgact tacactggcggccccttgggcaacgagatagcgtcgggcaacggtggcgccgcggcaggc gacgacgaggacgggcagaacctttggtcctgcatcctcagcgaggtctccacccgctcg cgctccaagctccctgcggggaagaacgtgctactgctgggtgaagatggagctggaaaa acaagcttaataagaaaaattcagggaatagaggagtataagaaaggaagaggattggaa tatttgtacttaaatgtgcatgatgaagacagggatgatcaaacaagatgtaatgtttgg atcttagatggagacctatatcacaaaggcctccttaaattttcactggatgccgtatct ctgaaggatactctagttatgctggttgttgacatgtcaaagccttggactgctttggat tctttacagaaatgggcaagtgttgttagagaacatgttgacaaactgaaaatccctcct gaagaaatgaaacaaatggaacaaaagttgattagagacttccaagaatatgtagagcca ggagaagacttcccggcttctccccagagaagaaatactgcgtcacaagaagacaaagat gacagtgtagttttacctctgggtgcggatacacttacacataacttgggcattccagta ctagtagtttgcacaaagtttgtacatgagaaggaaattatggcagaagatgatcaggtg tttcttatgaagctacaggatgcctcaccaagagtcccaggaggctccccacgaacacca aatagatctgtatcatctaatgttgccagcgtgtcacccattcctgctgggtcaaaaaaa attgatccaaacatgaaagctggagctacaagtgaaggcgttctggcaaatttcttcaac agtttgttgagtaaaaagactggctctccaggaggccctggtgtgagtggtggtagccct gcaggtggggctggaggtggaagcagtggtttaccaccatccaccaaaaagtcaggccag aagcctgtcttagatgttcatgcagaactagacagaattacacgaaaaccagttacagtt tctcccacaacacctacatctcctacggaaggagaagcttcttga >gi568815595r:32383963_32602745|GENSCAN_predicted_peptide_7|235_aa MAVNLHYLDDTECLHLMFVVRVMAYTVHAPQSDEDLYSGPHRAVSHLSRTLSSSLLQAAA AASRPDWKTDTRCGRSCREKPTVEMEAYTGVSEGRTGRVCMQVPRLFFSTLKPAERGELK DADEGREPALLSNRPEVPSKKADSSTASIALSKSLLWRILNYIMTQKIEWLDSWLPRQKG GYALLVLRSLKGYLLHQLTTHKINYLFLHLEEPRGLVDCLGAKINLQEKQSTGSD >gi568815595r:32383963_32602745|GENSCAN_predicted_CDS_7|708_bp atggcagttaatttgcattatttagatgatacagaatgtttgcatttaatgtttgtggta agggtaatggcttatacagtccatgctccacaatcagatgaagacctttattctggtcct catcgtgcagtaagccacttaagcagaacactttcttcatctctgctccaagcagcagcg gctgcatccaggcctgactggaagacggacacacgttgtggaagatcttgtagggaaaag ccaacggttgagatggaggcttataccggtgtgagcgagggtagaacaggaagagtatgt atgcaagtgccaagactgttcttctcaacgctgaagcctgcggagaggggcgagctgaag gatgcagatgaaggccgggaacctgcactgctgagcaatcgacccgaagtaccttccaaa aaagcagattcatccacagcttctatcgcattatccaaatctctcctctggaggatatta aattacatcatgacacagaagatagaatggctagacagctggcttccccgccagaaggga ggatatgctcttctagttctcaggagcctgaaagggtacctccttcatcagctgactaca cataagataaactatctcttcctacacttggaagaaccacgaggtttggtggactgcctt ggagcaaaaataaaccttcaggagaaacagtccactgggtcagactga