GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:34:17 Sequence gi568815587f:113287885_113500264 : 212380 bp : 45.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3401 3602 202 0 1 48 56 164 0.181 7.54 1.02 Intr + 13022 13106 85 2 1 30 85 53 0.035 -0.98 1.03 Intr + 28359 28431 73 2 1 81 84 106 0.619 8.48 1.04 Intr + 35404 35567 164 2 2 92 4 189 0.713 10.59 1.05 Intr + 36110 36131 22 1 1 47 110 4 0.647 -4.38 1.06 Intr + 37640 37761 122 0 2 74 82 224 0.996 20.61 1.07 Intr + 42036 42095 60 2 0 57 116 72 0.969 5.83 1.08 Intr + 47082 47153 72 2 0 70 103 71 0.913 6.40 1.09 Intr + 50890 50950 61 0 1 86 105 27 0.982 2.51 1.10 Intr + 51402 51590 189 1 0 118 58 242 0.905 23.76 1.11 Intr + 52780 52849 70 2 1 71 97 24 0.438 -0.16 1.12 Intr + 53953 54041 89 1 2 108 115 26 0.527 6.91 1.13 Intr + 56388 56556 169 1 1 71 115 161 0.526 16.10 1.14 Intr + 62189 62281 93 2 0 75 86 112 0.984 8.78 1.15 Intr + 64043 64103 61 2 1 68 97 46 0.988 2.24 1.16 Intr + 64186 64323 138 0 0 74 84 112 0.960 10.06 1.17 Intr + 72056 72124 69 1 0 83 98 17 0.737 1.58 1.18 Intr + 74517 74618 102 2 0 83 116 56 0.990 8.27 1.19 Intr + 76444 76576 133 0 1 42 49 65 0.606 -1.78 1.20 Intr + 76951 77176 226 2 1 112 85 241 0.892 23.24 1.21 Intr + 79181 79251 71 2 2 86 41 36 0.674 -2.47 1.22 Intr + 80303 80374 72 0 0 76 75 47 0.561 1.58 1.23 Intr + 80580 80764 185 1 2 67 81 85 0.337 5.21 1.24 Intr + 85240 85293 54 0 0 104 78 33 0.234 3.08 1.25 Term + 90284 90325 42 1 0 147 34 -1 0.037 -2.44 1.26 PlyA + 93337 93342 6 1.05 2.00 Prom + 93524 93563 40 -3.46 2.01 Init + 100001 100185 185 1 2 88 84 339 0.990 32.09 2.02 Intr + 105597 105891 295 0 1 106 78 434 0.999 41.21 2.03 Intr + 107045 107196 152 1 2 90 94 219 0.997 21.66 2.04 Intr + 107394 107524 131 0 2 28 115 30 0.626 0.04 2.05 Intr + 109340 109458 119 0 2 17 105 148 0.953 9.58 2.06 Intr + 110096 110132 37 1 1 104 89 71 0.999 6.64 2.07 Term + 111080 112383 1304 0 2 71 42 1236 0.497 109.06 2.08 PlyA + 115822 115827 6 1.05 3.16 PlyA - 121750 121745 6 1.05 3.15 Term - 123036 122843 194 1 2 100 41 404 0.999 34.48 3.14 Intr - 124859 124672 188 1 2 89 109 272 0.768 28.73 3.13 Intr - 126443 126169 275 2 2 71 107 48 0.619 1.34 3.12 Intr - 126577 126491 87 1 0 85 103 51 0.942 6.47 3.11 Intr - 127727 127537 191 0 2 123 14 397 0.951 35.10 3.10 Intr - 129115 128979 137 0 2 96 88 264 0.998 27.41 3.09 Intr - 130252 130143 110 1 2 99 99 186 0.982 19.88 3.08 Intr - 132160 132036 125 2 2 43 58 96 0.624 2.40 3.07 Intr - 135930 135900 31 0 1 119 80 7 0.648 0.70 3.06 Intr - 136798 136483 316 0 1 99 92 597 0.796 57.27 3.05 Intr - 145217 145123 95 2 2 122 52 1 0.469 -1.34 3.04 Intr - 147304 147262 43 0 1 67 116 61 0.726 5.04 3.03 Intr - 148321 148150 172 2 1 14 -8 182 0.168 0.20 3.02 Intr - 152196 152026 171 1 0 17 92 80 0.075 1.21 3.01 Init - 159348 159129 220 0 1 91 50 108 0.121 6.10 3.00 Prom - 165395 165356 40 -4.56 4.06 PlyA - 165700 165695 6 1.05 4.05 Term - 176274 176170 105 0 0 86 49 63 0.028 0.61 4.04 Intr - 187376 187192 185 0 2 29 121 125 0.734 9.41 4.03 Intr - 187643 187625 19 2 1 71 77 31 0.116 -3.52 4.02 Intr - 203306 203139 168 2 0 89 107 59 0.575 8.04 4.01 Intr - 208075 207972 104 2 2 84 84 14 0.228 0.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 164128 164266 139 0 1 96 69 71 0.846 6.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:113287885_113500264|GENSCAN_predicted_peptide_1|874_aa XTEKCGGAHESCTPLLSSTQVQLTVSILLTKLNLRSTARHVATGHREIIASYGPVYEIPL HTHMGWLICRKSTDKIQHPFMIKTLSKIGTEGTYLKGFRFTMDADKEKDLQKFLKNVDEI SNLIQEMNSDDPVVQQKAVLETEKRLLLMEEDQEEDECRTTLNKTMISPPQTAMKSAEEI NSALKEKGNEAFAEGNYETAILRYSEGLEKLKDMKVLYTNRAQAYMKLEDYEKALVDCEW ALKCDEKCTKAYFHMGKANLALKNYSVSRECYKKILEINPKLQTQVKGYLNQVDLQEKAD LQEKEAHELLDSGKNTAVTTKNLLETLSKPDQIPLFYAGGIEILTEMINECTEQTLFRMH NGFSIISDNEVIRRCFSTAGNDAVEEMVCVSVLKLWQAVCSRNEENQRVLVIHHDRARLL AALLSSKVLAIRQQSFALLLHLAQTESGRSLIINHLDLTRLLEALVSFLDFSDKEANTAM GLFTDLALEESETPGMPLCDMKLSGWFVGLLKTDPKVSSSSALCQCIAIMGNLSAEPTTR RHMAACEEFGDGCLSLLVWAVEVSRRCLSLLNSQDGGILTRAAGVLSRTLSSSLKIVEEA LRAGVVKKMMKFLKQASQERQPANVLLTNSLECTEGELKEAPPWKRQAFLLDPVAYPAEL SVMMKLLSSEDEVLVGNAALCLGNCMEVPNVASSLLKTDLLQVLLKLAGSDTQKTAVQVN AGIALGKLCTAEPRGNAPFQLPQGHSKPGSPVDCKSLKCLPRAVISNLEESLLHLKLCLL AGNQSLLVQQAEELGFSILKRMWPGWNSITILKPQQRWQVTDSPELLTVFIHDVRTVVPV RRKMRKLRCKAVKQLVQDLDQVEGEQGVSLNFHS >gi568815587f:113287885_113500264|GENSCAN_predicted_CDS_1|2625_bp ntgacagagaaatgtggaggtgctcatgagtcctgcacacctttgctctcctccacgcaa gttcagctgaccgtcagcatccttctgaccaagctcaacctcaggtctactgccagacat gtggccacgggacacagagagatcatagcctcctatggacctgtttatgaaattcccctt cacacgcacatggggtggctcatatgcagaaaaagcactgacaaaatccagcatcccttt atgattaaaactctcagcaaaattggcacagaagggacttacctcaagggattccggttc acaatggatgctgataaagagaaagatttgcagaaatttcttaaaaatgtggatgaaatc tccaatttaattcaggagatgaattctgatgacccagttgtgcaacagaaagctgtcctg gagacagaaaagagactactgcttatggaggaagaccaggaggaggatgaatgcaggacc accttgaacaagactatgatcagtcctccacaaactgctatgaagagtgcagaagaaata aactcagccctaaaagaaaaagggaatgaagcatttgctgaaggcaattatgaaacagct atcctgcgctacagtgagggtttggagaagctgaaggacatgaaagtgctgtacaccaac cgagcccaggcttatatgaaacttgaggactatgagaaggcactggtggattgtgagtgg gctctcaagtgtgatgaaaaatgcacaaaagcatattttcacatgggaaaagccaacctg gccctgaagaactacagtgtgtctagagagtgttataagaagatcttagaaataaacccc aagctgcaaacccaggtgaaaggttacctgaatcaagtagatcttcaggaaaaagcagac cttcaagaaaaggaagcccacgaactgctggattcaggaaagaacacagccgtgaccacc aagaacctcctggagaccctttccaagcctgaccagatccccttgttctatgctgggggg attgagatcctgactgaaatgataaatgagtgcacagaacaaactttattcagaatgcac aatggatttagtatcatcagtgacaacgaggtcataagaaggtgtttttccacagcagga aatgatgcagttgaagaaatggtctgtgtgtctgttctcaagctctggcaagcagtgtgc agcaggaacgaggaaaaccagcgtgtgctagtgatacaccatgacagggccaggctgttg gccgccctcttgtcctccaaggtcctggccatccggcagcagagctttgccctgctgctg catctcgcccagactgagagcggacggagcctgatcatcaaccaccttgacctgaccaga ttattggaagcgctggtgtcatttcttgatttctcggataaggaggccaacactgctatg ggactgttcacagacttggctctggaagaaagtgagaccccagggatgcccctgtgtgac atgaagctatctggttggttcgtgggccttttgaagacagatcccaaggtaagcagctcc tcggctctgtgccagtgcattgccatcatgggaaacctcagtgctgagcccactacccga agacacatggcggcctgtgaggaatttggggatggctgcttgagcctcctggtttgggct gtggaggtgagcagaaggtgcctgtctttactaaacagccaggatggaggaatcctgaca agagctgctggtgttctgagccggaccctttcttcctctctgaaaattgttgaggaggcc ttgcgagcaggagtggtaaagaaaatgatgaaattcctgaagcaggcttcccaggagagg cagcctgctaatgttcttcttaccaactccctggaatgcacagaaggtgaactgaaggag gctccaccatggaagcggcaagcctttctcctggatcctgttgcataccctgcagagttg agcgttatgatgaagctgctcagctcggaggatgaggttctggtgggcaacgctgccctc tgccttggtaactgcatggaggtgcccaacgttgcgtcttccctgctaaagacggacctt ttgcaggtcttgttaaagcttgcaggcagtgacacacagaagacggccgtgcaggtgaac gcaggcattgctctggggaagctgtgcacagctgagcccagaggcaacgctcccttccag ttgccccagggtcactctaaaccagggtcacctgtggactgcaagtccctaaagtgcctg ccgagagcagtcatctccaatttggaagagagtctgctgcacctgaagctctgtctcctc gcaggtaatcagagtcttctggttcaacaagcagaggagctgggattttccatcctaaag aggatgtggccgggatggaactccatcaccatcctgaagccacagcagaggtggcaggta acagacagtccagagctgctcactgtcttcatacatgacgtgaggacggtggttccagtg cgccgcaagatgaggaaactgaggtgcaaagcagttaagcaactggtccaagatctagac caggttgaaggagaacagggagtgtctctaaatttccacagctga >gi568815587f:113287885_113500264|GENSCAN_predicted_peptide_2|740_aa MAADPTELRLGSLPVFTRDDFEGDWRLVASGGFSQVFQARHRRWRTEYAIKCAPCLPPDA ASSDVNYLIEEAAKMKKIKFQHIVSIYGVCKQPLGIVMEFMANGSLEKVLSTHSLCWKLR FRIIHETSLAMNFLHSIKPPLLHLDLKPGNILLDSNMHVKISDFGLSKWMEQSTRMQYIE RSALRGMLSYIPPEMFLESNKAPGPKYDVYSPPTLPPRAGVILDVQLSHSERVLCIHSFA IVIWELLTQKKPYSDITIETDILLSLLQSRVAVPESKALARKVSCKLSLRQPGEVNEDIS QELMDSDSGNYLKRALQLSDRKNLVPRDEELCIYENKVTPLHFLVAQGSVEQVRLLLAHE VDVDCQTASGYTPLLIAAQDQQPDLCALLLAHGADANRVDEDGWAPLHFAAQNGDDGTAR LLLDHGACVDAQEREGWTPLHLAAQNNFENVARLLVSRQADPNLHEAEGKTPLHVAAYFG HVSLVKLLTSQGAELDAQQRNLRTPLHLAVERGKVRAIQHLLKSGAVPDALDQSGYGPLH TAAARGKYLICKMLLRYGASLELPTHQGWTPLHLAAYKGHLEIIHLLAESHANMGALGAV NWTPLHLAARHGEEAVVSALLQCGADPNAAEQSGWTPLHLAVQRSTFLSVINLLEHHANV HARNKVGWTPAHLAALKGNTAILKVLVEAGAQLDVQDGVSCTPLQLALRSRKQGIMSFLE GKEPSVATLGGSKPGAEMEI >gi568815587f:113287885_113500264|GENSCAN_predicted_CDS_2|2223_bp atggctgccgaccccaccgagctgcggctgggcagcctccccgtcttcacccgcgacgac ttcgagggcgactggcgcctagtggccagcggcggcttcagccaggtgttccaggcgcgg cacaggcgctggcggacggagtacgccatcaagtgcgccccctgccttccacccgacgcc gccagctctgatgtgaattacctcattgaagaagctgccaaaatgaagaagatcaagttt cagcacatcgtgtctatctacggggtgtgcaagcagcccctgggtattgtgatggagttt atggccaacggctccctggagaaggtgctgtccacccacagcctctgctggaagctcagg ttccgcatcatccatgagaccagcttggccatgaacttcctgcacagcattaagccgcct ctgctccacctggacctcaagccgggcaacatactcctggacagcaacatgcatgtcaaa atttcagacttcggcctgtccaagtggatggaacagtccacccggatgcagtacatcgag aggtcggctctgcggggcatgctcagctacatcccccctgagatgttcctggagagtaac aaggccccaggacctaaatatgatgtgtacagccccccgaccctgccaccccgggctggg gtgatcttggatgttcaactaagtcattcagaaagggttctctgcatccacagctttgca attgtcatctgggagctactcactcagaagaaaccatactcagacattaccatcgagaca gacatactgctgtcactgctgcagagtcgtgtggcagtcccagagagcaaggccctggcc aggaaggtgtcctgcaagctgtcgctgcgccagcccggggaggttaatgaggacatcagc caggaactgatggacagtgactcaggaaactacctgaagcgggcccttcagctctccgac cgtaagaatttggtcccgagagatgaggaactgtgtatctatgagaacaaggtcaccccc ctccacttcctggtggcccagggcagtgtggagcaggtgaggttgctgctggcccacgag gtagacgtggactgccagacggcctctggatacacgcccctcctgatcgccgcccaggac cagcaacccgacctctgtgccctgcttttggcacatggtgctgatgccaaccgagtggat gaggatggctgggccccactgcactttgcagcccagaatggggatgacggcactgcgcgc ctgctcctggaccacggggcctgtgtggatgcccaggaacgtgaagggtggacccctctt cacctggctgcacagaataactttgagaatgtggcacggcttctggtctcccgtcaggct gaccccaacctgcatgaggctgagggcaagacccccctccatgtggccgcctactttggc catgttagcctggtcaagctgctgaccagccagggggctgagttggatgctcagcagaga aacctgagaacaccactgcacctggcagtagagcggggcaaagtgagggccatccaacac ctgctgaagagtggagcggtccctgatgcccttgaccagagcggctacggcccactgcac actgcagctgccaggggcaaatacctgatctgcaagatgctgctcaggtacggagccagc cttgagctgcccacccaccagggctggacacccctgcatctagcagcctacaagggccac ctggagatcatccatctgctggcagagagccacgcaaacatgggtgctcttggagctgtg aactggactcccctgcacctagctgcacgccacggggaggaggcggtggtgtcagcactg ctgcagtgtggggctgaccccaatgctgcagagcagtcaggctggacacccctccacctg gcggtccagaggagcaccttcctgagtgtcatcaacctcctagaacatcacgcaaatgtc cacgcccgcaacaaggtgggctggacacccgcccacctggccgccctcaagggcaacaca gccatcctcaaagtgctggtcgaggcaggcgcccagctggacgtccaggatggagtgagc tgcacacccctgcaactggccctccgcagccgaaagcagggcatcatgtccttcctagag ggcaaggagccgtcagtggccactctgggtggttctaagccaggagccgagatggaaatt tag >gi568815587f:113287885_113500264|GENSCAN_predicted_peptide_3|784_aa MGTTEQSVAVQPSGPPDFTPGPGEGGAAVFGGWSRSGQVWTCKGYDMLEDPKFPEGIYTD LSLFSICAALFLSVHFLEDADDDDPTVTRMVVTRIEPFPGVREPCRIKQGTCLETLIAIL VLKAIGSMNLGLDTKTVFVWESSSPILPAHPSIARRPPIAFAKWERKVPGVSLLEPKDEK RGDKGFRRPCLTPLQTLEACNKTLHTPAPGLRGFALQTVTVPVHLPTPTLGNPRAWPPSG STALMDPLNLSWYDDDLERQNWSRPFNGSDGKADRPHYNYYATLLTLLIAVIVFGNVLVC MAVSREKALQTTTNYLIVSLAVADLLVATLVMPWVVYLELILGPCPGPAGKPFTYPPAST RKMGIQLLAPAKLVSAAAQGAGREALVRGPHVVGEWKFSRIHCDIFVTLDVMMCTASILN LCAISIDRYTAVAMPMLYNTRYSSKRRVTVMISIVWVLSFTISCPLLFGLNNADQNECII ANPAFVVYSSIVSFYVPFIVTLLVYIKIYIVLRRRRKRVNTKRSSRAFRAHLRAPLKGNC THPEDMKLCTVIMKSNGSFPVNRRRVGSPLAMVLRPQKLANGRSTPETPTLPQLKADSPC TPPSRHEVRHLGSARHGCVRENGWPYQRNKNDNNGYSHFSNIYHIPIISSNPHHDPGSTP DSPAKPEKNGHAKDHPKIAKIFEIQTMPNGKTRTSLKTMSRRKLSQQKEKKATQMLAIVL GVFIICWLPFFITHILNIHCDCNIPPVLYSAFTWLGYVNSAVNPIIYTTFNIEFRKAFLK ILHC >gi568815587f:113287885_113500264|GENSCAN_predicted_CDS_3|2355_bp atgggcacaacagaacagagtgtggccgtgcagcccagcgggcctcccgactttacccca ggccccggggagggtggggctgctgtctttggaggctggagcagaagtgggcaggtttgg acttgcaagggctatgacatgctagaggatcccaagttccctgagggcatctacacagat ctgagtttgttcagcatttgtgcagccctcttcttgtctgttcatttccttgaagatgca gatgatgatgatcctacagtgaccaggatggtagtgacaaggattgagccctttccagga gtcagagagccctgtaggataaagcaggggacctgtctagagaccttaattgcaatcctg gttctgaaagcaataggctctatgaacctgggtcttgacaccaaaacagtatttgtgtgg gagtcctcaagtcccatcttgcctgcccacccctccattgcaagaaggcctcccatcgcc tttgccaaatgggaaagaaaggtccccggtgtcagcctgctggagccgaaggatgagaaa cgtggggacaagggcttccggagaccctgcctcaccccgctgcagaccctggaggcctgc aacaagaccctccacaccccagctccagggcttagaggctttgccctccagacagtgact gtgcctgtccatctacccactcccaccctcggcaacccaagagcctggccacccagtggc tccaccgccctgatggatccactgaatctgtcctggtatgatgatgatctggagaggcag aactggagccggcccttcaacgggtcagacgggaaggcggacagaccccactacaactac tatgccacactgctcaccctgctcatcgctgtcatcgtcttcggcaacgtgctggtgtgc atggctgtgtcccgcgagaaggcgctgcagaccaccaccaactacctgatcgtcagcctc gcagtggccgacctcctcgtcgccacactggtcatgccctgggttgtctacctggagctt atcctggggccctgtccaggacctgcaggaaagccctttacgtaccctcctgcctccacc cgcaaaatgggcatccagctgttagctcctgccaaactggtcagcgcagcagcacaggga gctgggagagaggccctggtacggggcccccatgtggtaggtgagtggaaattcagcagg attcactgtgacatcttcgtcactctggacgtcatgatgtgcacggcgagcatcctgaac ttgtgtgccatcagcatcgacaggtacacagctgtggccatgcccatgctgtacaatacg cgctacagctccaagcgccgggtcaccgtcatgatctccatcgtctgggtcctgtccttc accatctcctgcccactcctcttcggactcaataacgcagaccagaacgagtgcatcatt gccaacccggccttcgtggtctactcctccatcgtctccttctacgtgcccttcattgtc accctgctggtctacatcaagatctacattgtcctccgcagacgccgcaagcgagtcaac accaaacgcagcagccgagctttcagggcccacctgagggctccactaaagggcaactgt actcaccccgaggacatgaaactctgcaccgttatcatgaagtctaatgggagtttccca gtgaacaggcggagagtgggaagcccactggccatggttctgagacctcagaagctggcc aatgggagaagcaccccagaaacccccaccttgcctcagctgaaggcagactcaccgtgc acacctccaagcaggcatgaagtgagacacctcggttctgcaaggcatggatgtgtacga gaaaatggttggccataccaacgtaataaaaatgataataatggctattcacatttctca aacatctaccatatccctattatctcatcaaatcctcaccacgaccccgggagcactccc gacagccccgccaaaccagagaagaatgggcatgccaaagaccaccccaagattgccaag atctttgagatccagaccatgcccaatggcaaaacccggacctccctcaagaccatgagc cgtaggaagctctcccagcagaaggagaagaaagccactcagatgctcgccattgttctc ggcgtgttcatcatctgctggctgcccttcttcatcacacacatcctgaacatacactgt gactgcaacatcccgcctgtcctgtacagcgccttcacgtggctgggctatgtcaacagc gccgtgaaccccatcatctacaccaccttcaacattgagttccgcaaggccttcctgaag atcctccactgctga >gi568815587f:113287885_113500264|GENSCAN_predicted_peptide_4|193_aa XEETRVELNIAPKKDPTLHVETATQRGPLTSHRARVIETGSWRMQQGGLQALIEWWLTTT TGICQMSTLCMALCQVIYKLYVNGFLLKPFEAGDRRGEPAELLPAGALNGAAGPGARDRP RRVAAPDGCRRGGRAWMRRELEASSSRRRLCPRAPYGLKLQRMEEKQKKGEDRVASGYKD VDERAEIGGSYRE >gi568815587f:113287885_113500264|GENSCAN_predicted_CDS_4|582_bp ngagaggagacccgggttgagctgaatattgctccaaaaaaagaccccaccctccatgtg gagactgccactcagcggggtccactaacctcccacagagccagggtcatagaaacagga agctggcgcatgcaacagggtgggctacaggctttgatagaatggtggctaacaacaaca actggtatttgtcagatgtctacactgtgcatggcattatgtcaggtgatctacaagctt tacgtcaatggattcttactgaaaccctttgaggccggggatcgccgaggagagccggcc gagctgctgcccgccggggctctgaacggcgcggcggggccgggagccagggaccggccg aggagagtggcggccccggacggctgccggaggggcggccgcgcgtggatgcggcgggag ctggaagcctcaagcagccggcgccgtctctgcccccgggcgccctatggcttgaagcta cagagaatggaagaaaaacagaagaaaggtgaagacagggtggcaagtgggtataaagat gtggatgagagagctgaaataggtggaagttacagggaatag