GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:36:38 Sequence gi568815589f:34134215_34351529 : 217315 bp : 43.94% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 364 24 341 2 2 71 56 428 0.930 33.09 1.03 Intr - 599 458 142 2 1 4 31 164 0.469 2.23 1.02 Intr - 17467 17286 182 2 2 36 64 121 0.431 4.09 1.01 Init - 24933 24924 10 0 1 79 89 -3 0.243 -0.47 1.00 Prom - 38265 38226 40 -4.06 2.00 Prom + 38290 38329 40 -5.36 2.01 Init + 44847 44910 64 2 1 99 26 125 0.344 6.75 2.02 Intr + 67675 67783 109 2 1 102 97 31 0.541 4.74 2.03 Intr + 78870 79015 146 0 2 21 61 126 0.040 3.13 2.04 Intr + 100002 100126 125 1 2 76 111 72 0.946 8.60 2.05 Intr + 106971 107894 924 2 0 90 127 592 0.992 54.73 2.06 Intr + 115565 115747 183 1 0 109 99 209 0.923 24.08 2.07 Intr + 116444 116545 102 1 0 82 81 131 0.991 12.17 2.08 Term + 117178 117318 141 0 0 118 53 126 0.999 10.03 2.09 PlyA + 118291 118296 6 1.05 3.11 PlyA - 118335 118330 6 1.05 3.10 Term - 120306 120166 141 0 0 98 43 136 0.966 8.03 3.09 Intr - 120951 120858 94 2 1 73 101 187 0.998 18.47 3.08 Intr - 123767 121611 2157 1 0 42 44 591 0.700 37.44 3.07 Intr - 125491 125382 110 1 2 120 131 241 0.999 30.68 3.06 Intr - 128958 128887 72 0 0 100 103 28 0.873 5.10 3.05 Intr - 135148 135043 106 0 1 88 87 36 0.988 3.72 3.04 Intr - 137716 137595 122 1 2 84 99 142 0.999 14.19 3.03 Intr - 152490 152403 88 2 1 83 76 62 0.567 4.47 3.02 Intr - 156175 155960 216 0 0 70 79 132 0.264 8.12 3.01 Init - 177132 176510 623 0 2 83 90 307 0.280 25.22 3.00 Prom - 177580 177541 40 -6.66 4.00 Prom + 179460 179499 40 -2.56 4.01 Init + 180152 180162 11 1 2 68 77 15 0.802 -2.19 4.02 Intr + 184213 184590 378 1 0 97 17 623 0.865 50.18 4.03 Intr + 184636 185040 405 1 0 56 51 665 0.851 53.06 4.04 Term + 185153 185441 289 2 1 0 49 456 0.801 27.75 4.05 PlyA + 186103 186108 6 1.05 5.00 Prom + 189818 189857 40 -6.46 5.01 Init + 204826 204952 127 0 1 91 75 104 0.992 9.72 5.02 Term + 208910 209226 317 0 2 112 49 382 0.999 31.80 5.03 PlyA + 209465 209470 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:34134215_34351529|GENSCAN_predicted_peptide_1|225_aa MDSGPFTSDCCCYWQGQELSSVLITEKSKASLNSWTLCPNGSSQLRYMTYSGKEGALWMS ADKEARRTRASSVAAAWQQRGSGGGGGGRSSGCFSRVAGSPSSVVDYLISGGLIDFIAEV DLTSALTRKITLKTPLISSPMDTVTEADVAIAMALMGDTGFIHHNCTPELQAKELRKVKK FEQGFITDPTVLSPSHTVGDVLEAKMRHGFSGIPITETGTMGSKL >gi568815589f:34134215_34351529|GENSCAN_predicted_CDS_1|675_bp atggatagtggcccctttacctctgactgctgctgctactggcaaggccaagagctgagt tctgtgctgatcacagagaagagcaaggcctcactgaattcatggactttatgtcccaac gggagctctcaactgaggtacatgacttactccggcaaagaaggggcactctggatgtct gcagataaggaggcccgccggacccgcgctagcagcgtggcagcagcgtggcagcagcgc ggcagcggcggcggcggcggcgggcggtccagcgggtgtttctctcgggtcgcagggtct cccagcagcgtggtggactacctgatcagcggtggactcatagacttcatagctgaggtg gaccttacctcagccctgacccggaagatcacgctgaagacaccgctgatctcctccccc atggacactgtgacagaggctgacgtggccatcgcgatggctctgatgggagatactggt ttcattcaccacaactgcaccccagagctccaggccaaggagctacggaaggtcaagaag tttgaacagggcttcatcacggaccccacggtgctgagcccctcccacactgtaggtgat gtgctggaggccaagatgcggcatggcttctctggcatccccatcactgagacgggcacc atgggcagcaagctg >gi568815589f:34134215_34351529|GENSCAN_predicted_peptide_2|597_aa MAAAALAVATVTAWPGAGRVGGLICQSVSQNSEKLFYVYWFILKAIAKYTDEEICLARIK NKSHMIISIDAEKAFDKIQHPFMIRTLSKISIRGTYLNVMKAISGPGTFSYLDDVPFKTG DKFKTPAKVGLPIGFSLPDCLQVVREVQYDFSLEKKTIEWAEEIKKIEEAEREAECKIAE AEAKVNSKSGPEGDSKMSFSKTHSTATMPPPINPILASLQHNSILTPTRVSSSATKQKVL SPPHIKADFNLADFECEEDPFDNLELKTIDEKEELRNILVGTTGPIMAQLLDNNLPRGGS GSVLQDEEVLASLERATLDFKPLHKPNGFITLPQLGNCEKMSLSSKVSLPPIPAVSNIKS LSFPKLDSDDSNQKTAKLASTFHSTSCLRNGTFQNSLKPSTQSSASELNGHHTLGLSALN LDSGTEMPALTSSQMPSLSVLSVCTEESSPPNTGPTVTPPNFSVSQVPNMPSCPQAYSEL QMLSPSERQCVETVVNMGYSYECVLRAMKKKGENIEQILDYLFAHGQLCEKGFDPLLVEE ALEMHQCSEEKMMEFLQLMSKFKEMGFELKDIKEVLLLHNNDQDNALEDLMARAGAS >gi568815589f:34134215_34351529|GENSCAN_predicted_CDS_2|1794_bp atggcggctgcggcactggcggtggctacggtgacggcctggcccggagcgggcagagtt ggaggtttaatttgccagagtgtttcacagaactcagagaaactcttttatgtttactgg tttattctaaaggcgattgcaaagtatacagatgaagagatttgtctggcaagaattaaa aacaaaagtcacatgatcatctcaatagatgcagaaaaagcatttgacaaaatccagcat ccctttatgattagaactctcagcaaaatcagcatacgagggacatacctcaatgtaatg aaagccatctctgggccagggactttcagttaccttgatgatgtcccatttaagacagga gacaaattcaaaacaccagctaaagttggtctacctattggcttctccttgcctgattgt ttgcaggttgtcagagaagtacagtatgacttctctttggaaaagaaaaccattgagtgg gctgaagagattaagaaaatcgaagaagccgagcgggaagcagagtgcaaaattgcggaa gcagaagctaaagtgaattctaagagtggcccagagggcgatagcaaaatgagcttctcc aagactcacagtacagccacaatgccacctcctattaaccccatcctcgccagcttgcag cacaacagcatcctcacaccaactcgggtcagcagtagtgccacgaaacagaaagttctc agcccacctcacataaaggcggatttcaatcttgctgactttgagtgtgaagaagaccca tttgataatctggagttaaaaactattgatgagaaggaagagctgagaaatattctggta ggaaccactggacccattatggctcagttattggacaataacttgcccaggggaggctct gggtctgtgttacaggatgaggaggtcctggcatccttggaacgggcaaccctagatttc aagcctcttcataaacccaatggctttataaccttaccacagttgggcaactgtgaaaag atgtcactgtcttccaaagtgtccctcccccctatacctgcagtaagcaatatcaaatcc ctgtctttccccaaacttgactctgatgacagcaatcagaagacagccaagctggcgagc actttccatagcacatcctgcctccgcaatggcacgttccagaattccctaaagccttcc acccaaagcagtgccagtgagctcaatgggcatcacactcttgggctttcagctttgaac ttggacagtggcacagagatgccagccctgacatcctcccagatgccttccctctctgtt ttgtctgtgtgcacagaggaatcatcacctccaaatactggtcccacggtcacccctcct aatttctcagtgtcacaagtgcccaacatgcccagctgtccccaggcctattctgaactg cagatgctgtcccccagcgagcggcagtgtgtggagacggtggtcaacatgggctactcg tacgagtgtgtcctcagagccatgaagaagaaaggagagaatattgagcagattctcgac tatctctttgcacatggacagctttgtgagaagggcttcgaccctcttttagtggaagag gctctggaaatgcaccagtgttcagaagaaaagatgatggagtttcttcagttaatgagc aaatttaaggagatgggctttgagctgaaagacattaaggaagttttgctattacacaac aatgaccaggacaatgctttggaagacctcatggctcgggcaggagccagctga >gi568815589f:34134215_34351529|GENSCAN_predicted_peptide_3|1242_aa MASWLYECLCEAELAQYYSHFTALGLQKIDELAKITMKDYSKLGVHDMNDRKRLFQLIKI IKIMQEEDKAVSIPERHLQTSSLRIKSQELRSGPRRQLNFDSPADNKDRNASNDGFEMCS LSDFSANEQKSTYLKVLEHMLPDDSQYHTKTGILNATAGDSYVQTEISTSLFSPNYLSAI LGDCDIPIIQRISHVSGYNYGIPHSCIRGNATCFAYGQTGAGKTYTMIGTHENPGLYALA AKDIFRQLEVSQPRKHLFVWISFYEIYCGQLYDLLNRRKRLFAREDSKHMVQIVGLQELQ VDSVELLLEVILKGSKERSTGATGVNADSSRSHAVIQIQIKDSAKRTFGRISFIDLAGSE RAADARDSDRQTKMEGAEINQSLLALKECIRALDQEHTHTPFRQSKLTQVLKDSFIGNAK TCMIANISPSHVATEHTLNTLRYADRVKELKKGIKCCTSVTSRNRTSGNSSPKRIQSSPG ALSEDKCSPKKVKLGFQQSLTVAAPGSTRGKVHPLTSHPPNIPFTSAPKVSGKRGGSRGS PSQEWVIHASPVKGTVRSGHVAKKKPEESAPLCSEKNRMGNKTVLGWESRASGPGEGLVR GKLSTKCKKVQTVQPVQKQLVSRVELSFGNAHHRAEYSQDSQRGTPARPASEAWTNIPPH QKEREEHLRFYHQQFQQPPLLQQKLKYQPLKRSLRQYRPPEGQLTNETPPLFHSYSENHD GAQVEELDDSDFSEDSFSHISSQRATKQRNTLENSEDSFFLHQTWGQGPEKQVAERQQSL FSSPRTGDKKDLTKSWVDSRDPINHRRAALDHSCSPSKGPVDWSRENSTSSGPSPRDSLA EKPYCSQVDFIYRQERGGGSSFDLRKDASQSEVSGENEGNLPSPEEDGFTISLSHVAVPG SPDQRDTVTTPLREVSADGPIQVTSTVKNGHAVPGEDPRGQLGTHAEYASGLMSPLTMSL LENPDNEGSPPSEQLVQDGATHSLVAESTGGPVVSHTVPSGDQEAALPVSSATRHLWLSS SPPDNKPGGDLPALSPSPIRQHPADKLPSREADLGEACQSRETVLFSHEHMGSEQYDADA EETGLDGSWGFPGKPFTTIHMGVPHSGPTLTPRTGSSDVADQLWAQERKHPTRLGWQEFG LSTDPIKLPCNSENVTWLKPRPISRQVVIRAHQEQLDEMAELGFKEETLMSQLASNDFED FVTQLDEIMVLKSKCIQSLRSQLQLYLTCHGPTAAPEGTVPS >gi568815589f:34134215_34351529|GENSCAN_predicted_CDS_3|3729_bp atggcatcctggttatatgaatgtctttgtgaagctgaacttgcacagtattattctcat ttcactgcccttggccttcagaaaatagatgaattagccaagattacaatgaaggactac tccaaattaggagtccatgacatgaacgaccgcaaacgtctcttccaacttatcaaaatt attaagattatgcaagaagaagataaagcagtcagtatcccagagcgtcatcttcagaca agcagcctgcgcatcaaatctcaggaattaagatctggccctcgcagacagctgaatttt gattctcctgctgacaataaagacagaaatgccagcaatgatgggtttgaaatgtgcagt ttatcagatttctctgcaaatgaacagaagtccacttacctaaaagtgctagaacacatg ctaccagatgattcccagtaccatacaaaaacaggaattctgaatgccacagctggtgat tcctatgtgcaaacagaaatcagcacttcactcttttcaccaaattacctttctgcaata ctgggggattgtgatattcccattattcaaagaatctctcatgtttcagggtataactat ggaatccctcattcttgtatcagaggcaatgccacttgctttgcttatggacagacaggt gctggaaagacctacaccatgataggaactcatgagaacccaggattgtatgctctagct gccaaagatatcttcaggcaactagaagtgtcccagccaagaaagcacctctttgtgtgg atcagcttctatgaaatttactgtggacagctttatgacctcctaaatagaagaaaaagg ctctttgcaagagaagatagcaagcacatggtgcagatagtgggactgcaagagcttcag gtggacagtgtggagctcctcttagaggtgatcttaaagggcagcaaggagcgcagcact ggggccactggagttaatgcagactcctcccgctcccatgccgtcatccaaattcagatc aaagattcagccaagaggacatttggcaggatctcttttattgacttggctggcagtgaa agagcagcagatgcaagggactcagatagacagacaaagatggaaggtgcagaaataaat cagagtctactggctctgaaggaatgtatccgagcactggatcaggaacacacccatact cccttcaggcaaagcaaactaactcaggtcctgaaggactctttcatcggcaatgccaaa acctgcatgatcgccaacatctcaccaagccacgtggccactgaacacactctcaacacc ttgcgctatgctgaccgggtcaaagaactaaagaaaggcattaagtgttgcacttcagtt accagtcgaaatcggacatctggaaactcctctccaaaacgaattcagagctcccctggg gctttgtcagaggacaaatgttctcccaaaaaagtcaagctgggatttcagcagtcactc acagtggcagcccctggttccacgagagggaaggtccatcctctgaccagccacccaccc aacattccttttacttctgcacctaaggtctctggtaaaaggggtggctccagagggagt ccttcacaagagtgggtcattcatgctagccctgtgaaaggaactgtgcgctctggacat gtggccaaaaaaaagccagaagagtcagcaccattgtgctctgagaaaaatcgaatgggc aacaaaactgtccttgggtgggaaagcagggcctcaggcccaggagaaggcctagtgcgt ggtaagctgtccaccaagtgcaagaaagtgcagacagtgcagccagtacagaagcagctt gtgtctcgagttgagctctcctttggcaacgcccaccacagggctgagtacagtcaagac agccagaggggcacgcctgctaggcctgcctctgaagcttggacaaacatcccgccacat cagaaggagagggaggaacatctgcgtttctatcaccagcagttccaacagccacctctc ctccaacagaagttaaaataccaaccactgaaaaggtctttacgccagtacaggccccca gagggtcagctcacgaatgagactccgcctctgttccactcttactctgaaaaccatgat ggagcccaagtagaggaacttgatgacagtgatttcagtgaagattctttttcacacatc tctagtcagagggccacaaagcaaaggaacaccctggagaatagcgaagactcattcttc ctgcaccagacgtggggacagggtcctgagaagcaggtggcagaaagacagcagagtctg ttttctagccccaggacaggtgacaagaaagatctaactaaaagctgggtggactccagg gaccccataaaccacagaagagcagcactcgatcacagctgcagcccaagtaaggggccc gtggactggagcagagagaactctacttcctcagggccttctcccagagacagcctggca gagaagccatactgttcacaggtagatttcatatatagacaggaaagaggtggaggctct tcctttgatctcagaaaggatgcctcccaaagtgaggtttctggggagaatgagggcaac ttgccatccccagaggaagatggtttcactatctcattgtcccacgttgcagttcctgga tccccagaccaaagagacacagtcaccacacctctgagagaagtcagtgcagacggccca atccaggtgaccagcactgtgaaaaacggtcatgctgtcccaggagaggatcctaggggg cagttaggcacgcatgctgaatatgcttctggactcatgtctcccctcaccatgtccctc ctggagaacccagacaacgaagggtctcctccctcggagcagctggtccaggatggggct acgcacagtctagtggcagagagcacagggggcccagttgtgagccacacagtgccatct ggtgatcaagaggcagccttgccagtgtcttcagcaactaggcacctgtggctgtcctca tctccccctgataataagcctggtggtgatcttccagctctgtccccatcacccatccgt cagcacccagctgacaagctgcccagcagggaggcagacctaggagaggcctgccagagc agagagactgtacttttctcccacgaacacatgggtagtgagcagtatgatgctgatgca gaggagacggggctggatggctcctggggtttcccaggaaagcccttcaccaccatacat atgggggtaccccattctggacctacactcaccccacgaacaggaagtagtgatgtggct gaccagctctgggcccaggagagaaaacatcctacaaggcttggttggcaggagtttggt ttgtccacagaccccatcaagttgccctgcaacagtgaaaatgtcacatggctcaaaccc aggccgatctcaaggcaggtggtcatccgagcacaccaggaacagctggatgaaatggct gagctcggcttcaaggaggagacgctgatgagccagctggcttctaatgattttgaagat tttgtgacccagctggatgaaatcatggttctgaaatccaagtgtatccagagtctgagg agccagctgcagctctatctcacctgccacgggcccaccgcagcccctgagggaacagtg ccgtcttag >gi568815589f:34134215_34351529|GENSCAN_predicted_peptide_4|360_aa MLASAFCLLAVALATEVKKPAATAAPGTAEKLSPKAATLAEHSAGLAFSLYQAMAKDQAV ENILVSPVVVASSLGLVSLGGKATTASEAKAVLSAKQLSDEEVHAGVGEPLRSLSNSTAR NVTWKLCSRLSKQHYNCEHSKINFHDKRSALQSIHEWAVQTTDGKLPKVTKDMECMDGAL LVNTMFFKPHWNEKFHHKMVENRGFMVTRFYTVGVMVMHQTGLYNYYDNEKEKLQIVEMP LAHKLSSLIILMPHHVEPLEALKSWLGLTEAIDKNKANLSRMPHKKDLYLTSVFHATAFE LDTDGNSFDQDIYGSKELRSPKLFYSDHPFIFLVWDTQSGSLLFTGHLVRPKVDKMQDEF >gi568815589f:34134215_34351529|GENSCAN_predicted_CDS_4|1083_bp atgttggccagcgccttctgcctcctggcggtggccttggcgaccgaggtgaagaaacct gcagccacagcagctcctggcaccgcagagaagctgagccccaaggcagccacgctggcc gaacacagcgccggcctggccttcagcctgtaccaggccatggccaaggaccaggcggtg gagaacatcctggtgtcgcccgtggtggtggcctcgtcgttggggctcgtgtcgctgggc ggcaaggcgaccacggcgtcggaggccaaggcagtgctgagtgccaagcagctgagcgac gaggaggtgcacgccggcgtgggcgagccgctgcgttcactcagcaactccaccgcgcgc aacgtgacctggaagctgtgcagtcgcctcagcaagcagcactacaactgcgagcactcc aagatcaatttccatgacaagcgcagtgcgctgcagtccatccacgagtgggccgtgcag accaccgacggcaagctgcccaaggtcaccaaggacatggagtgcatggatggcgccctg cttgtcaacaccatgttcttcaagccacactggaatgagaaattccaccacaagatggtg gaaaaccgtggcttcatggtgactcggttctataccgtgggtgtcatggtgatgcaccag acaggcctctacaactactatgacaatgagaaggaaaagctgcaaatcgtggagatgccc ctggcccacaagctctccagcctcatcatcctcatgccccaccacgtggagcccctcgag gccttaaaaagctggcttggcctgactgaggccattgacaagaacaaggcaaacttgtca cgcatgccacacaagaaggacctgtacctgaccagcgtgttccacgccaccgcctttgag ttggacacagacggcaactcctttgaccaggacatctatgggagcaaggagctgcgcagc cccaagctgttctactccgaccaccccttcatcttcctggtgtgggacacccagagcggc tccctgctgttcactgggcacctggtccggcctaaggttgacaagatgcaagacgagttt tag >gi568815589f:34134215_34351529|GENSCAN_predicted_peptide_5|147_aa MALRACGLIIFRRCLIPKVDNNAIEFLLLQASDGIHHWTPPKGHVEPGEDDLETALRETQ EEAGIEAGQLTIIEGFKRELNYVARNKPKTVIYWLAEVKDYDVEIRLSHEHQAYRWLGLE EACQLAQFKEMKAALQEGHQFLCSIEA >gi568815589f:34134215_34351529|GENSCAN_predicted_CDS_5|444_bp atggccttgagagcatgtggcttgatcatcttccgaagatgcctcattcccaaagtggac aacaatgcaattgagtttttactgctgcaggcatcagatggcattcatcactggactcct cccaaaggccatgtggaaccaggagaggatgacttggaaacagccctgagggagacccaa gaggaagcaggcatagaagcaggccagctgaccattattgaggggttcaaaagggaactc aattatgtggccaggaacaagcctaaaacagtcatttactggctggcggaggtgaaggac tatgacgtggagatccgcctctcccatgagcaccaagcctaccgctggctggggctggag gaggcctgccagttggctcagttcaaggagatgaaggcagcgctccaagaaggacaccag tttctttgctccatagaggcctga