GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:20:56 Sequence gi568815587r:113310730_113524651 : 213922 bp : 46.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5514 5586 73 2 1 81 84 106 0.808 8.48 1.02 Intr + 12559 12722 164 2 2 92 4 189 0.713 10.59 1.03 Intr + 13265 13286 22 1 1 47 110 4 0.647 -4.38 1.04 Intr + 14795 14916 122 0 2 74 82 224 0.996 20.61 1.05 Intr + 19191 19250 60 2 0 57 116 72 0.969 5.83 1.06 Intr + 24237 24308 72 2 0 70 103 71 0.913 6.40 1.07 Intr + 28045 28105 61 0 1 86 105 27 0.982 2.51 1.08 Intr + 28557 28745 189 1 0 118 58 242 0.905 23.76 1.09 Intr + 29935 30004 70 2 1 71 97 24 0.438 -0.16 1.10 Intr + 31108 31196 89 1 2 108 115 26 0.527 6.91 1.11 Intr + 33543 33711 169 1 1 71 115 161 0.526 16.10 1.12 Intr + 39344 39436 93 2 0 75 86 112 0.984 8.78 1.13 Intr + 41198 41258 61 2 1 68 97 46 0.988 2.24 1.14 Intr + 41341 41478 138 0 0 74 84 112 0.960 10.06 1.15 Intr + 49211 49279 69 1 0 83 98 17 0.737 1.58 1.16 Intr + 51672 51773 102 2 0 83 116 56 0.990 8.27 1.17 Intr + 53599 53731 133 0 1 42 49 65 0.606 -1.78 1.18 Intr + 54106 54331 226 2 1 112 85 241 0.892 23.24 1.19 Intr + 56336 56406 71 2 2 86 41 36 0.674 -2.47 1.20 Intr + 57458 57529 72 0 0 76 75 47 0.561 1.58 1.21 Intr + 57735 57919 185 1 2 67 81 85 0.337 5.21 1.22 Intr + 62395 62448 54 0 0 104 78 33 0.234 3.08 1.23 Term + 67439 67480 42 1 0 147 34 -1 0.037 -2.44 1.24 PlyA + 70492 70497 6 1.05 2.00 Prom + 70679 70718 40 -3.46 2.01 Init + 77156 77340 185 1 2 88 84 339 0.990 32.09 2.02 Intr + 82752 83046 295 0 1 106 78 434 0.999 41.21 2.03 Intr + 84200 84351 152 1 2 90 94 219 0.997 21.66 2.04 Intr + 84549 84679 131 0 2 28 115 30 0.626 0.04 2.05 Intr + 86495 86613 119 0 2 17 105 148 0.953 9.58 2.06 Intr + 87251 87287 37 1 1 104 89 71 0.999 6.64 2.07 Term + 88235 89538 1304 0 2 71 42 1236 0.497 109.06 2.08 PlyA + 92977 92982 6 1.05 3.16 PlyA - 98905 98900 6 1.05 3.15 Term - 100191 99998 194 1 2 100 41 404 0.999 34.48 3.14 Intr - 102014 101827 188 1 2 89 109 272 0.768 28.73 3.13 Intr - 103598 103324 275 2 2 71 107 48 0.619 1.34 3.12 Intr - 103732 103646 87 1 0 85 103 51 0.942 6.47 3.11 Intr - 104882 104692 191 0 2 123 14 397 0.951 35.10 3.10 Intr - 106270 106134 137 0 2 96 88 264 0.998 27.41 3.09 Intr - 107407 107298 110 1 2 99 99 186 0.982 19.88 3.08 Intr - 109315 109191 125 2 2 43 58 96 0.624 2.40 3.07 Intr - 113085 113055 31 0 1 119 80 7 0.648 0.70 3.06 Intr - 113953 113638 316 0 1 99 92 597 0.796 57.27 3.05 Intr - 122372 122278 95 2 2 122 52 1 0.469 -1.34 3.04 Intr - 124459 124417 43 0 1 67 116 61 0.726 5.04 3.03 Intr - 125476 125305 172 2 1 14 -8 182 0.168 0.20 3.02 Intr - 129351 129181 171 1 0 17 92 80 0.075 1.21 3.01 Init - 136503 136284 220 0 1 91 50 108 0.121 6.10 3.00 Prom - 142550 142511 40 -4.56 4.06 PlyA - 142855 142850 6 1.05 4.05 Term - 153429 153325 105 0 0 86 49 63 0.028 0.61 4.04 Intr - 164531 164347 185 0 2 29 121 125 0.732 9.41 4.03 Intr - 164798 164780 19 2 1 71 77 31 0.115 -3.52 4.02 Intr - 180461 180294 168 2 0 89 107 59 0.287 8.04 4.01 Intr - 207030 206935 96 0 0 56 84 63 0.235 2.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 141283 141421 139 0 1 96 69 71 0.846 6.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:113310730_113524651|GENSCAN_predicted_peptide_1|778_aa GFRFTMDADKEKDLQKFLKNVDEISNLIQEMNSDDPVVQQKAVLETEKRLLLMEEDQEED ECRTTLNKTMISPPQTAMKSAEEINSALKEKGNEAFAEGNYETAILRYSEGLEKLKDMKV LYTNRAQAYMKLEDYEKALVDCEWALKCDEKCTKAYFHMGKANLALKNYSVSRECYKKIL EINPKLQTQVKGYLNQVDLQEKADLQEKEAHELLDSGKNTAVTTKNLLETLSKPDQIPLF YAGGIEILTEMINECTEQTLFRMHNGFSIISDNEVIRRCFSTAGNDAVEEMVCVSVLKLW QAVCSRNEENQRVLVIHHDRARLLAALLSSKVLAIRQQSFALLLHLAQTESGRSLIINHL DLTRLLEALVSFLDFSDKEANTAMGLFTDLALEESETPGMPLCDMKLSGWFVGLLKTDPK VSSSSALCQCIAIMGNLSAEPTTRRHMAACEEFGDGCLSLLVWAVEVSRRCLSLLNSQDG GILTRAAGVLSRTLSSSLKIVEEALRAGVVKKMMKFLKQASQERQPANVLLTNSLECTEG ELKEAPPWKRQAFLLDPVAYPAELSVMMKLLSSEDEVLVGNAALCLGNCMEVPNVASSLL KTDLLQVLLKLAGSDTQKTAVQVNAGIALGKLCTAEPRGNAPFQLPQGHSKPGSPVDCKS LKCLPRAVISNLEESLLHLKLCLLAGNQSLLVQQAEELGFSILKRMWPGWNSITILKPQQ RWQVTDSPELLTVFIHDVRTVVPVRRKMRKLRCKAVKQLVQDLDQVEGEQGVSLNFHS >gi568815587r:113310730_113524651|GENSCAN_predicted_CDS_1|2337_bp ggattccggttcacaatggatgctgataaagagaaagatttgcagaaatttcttaaaaat gtggatgaaatctccaatttaattcaggagatgaattctgatgacccagttgtgcaacag aaagctgtcctggagacagaaaagagactactgcttatggaggaagaccaggaggaggat gaatgcaggaccaccttgaacaagactatgatcagtcctccacaaactgctatgaagagt gcagaagaaataaactcagccctaaaagaaaaagggaatgaagcatttgctgaaggcaat tatgaaacagctatcctgcgctacagtgagggtttggagaagctgaaggacatgaaagtg ctgtacaccaaccgagcccaggcttatatgaaacttgaggactatgagaaggcactggtg gattgtgagtgggctctcaagtgtgatgaaaaatgcacaaaagcatattttcacatggga aaagccaacctggccctgaagaactacagtgtgtctagagagtgttataagaagatctta gaaataaaccccaagctgcaaacccaggtgaaaggttacctgaatcaagtagatcttcag gaaaaagcagaccttcaagaaaaggaagcccacgaactgctggattcaggaaagaacaca gccgtgaccaccaagaacctcctggagaccctttccaagcctgaccagatccccttgttc tatgctggggggattgagatcctgactgaaatgataaatgagtgcacagaacaaacttta ttcagaatgcacaatggatttagtatcatcagtgacaacgaggtcataagaaggtgtttt tccacagcaggaaatgatgcagttgaagaaatggtctgtgtgtctgttctcaagctctgg caagcagtgtgcagcaggaacgaggaaaaccagcgtgtgctagtgatacaccatgacagg gccaggctgttggccgccctcttgtcctccaaggtcctggccatccggcagcagagcttt gccctgctgctgcatctcgcccagactgagagcggacggagcctgatcatcaaccacctt gacctgaccagattattggaagcgctggtgtcatttcttgatttctcggataaggaggcc aacactgctatgggactgttcacagacttggctctggaagaaagtgagaccccagggatg cccctgtgtgacatgaagctatctggttggttcgtgggccttttgaagacagatcccaag gtaagcagctcctcggctctgtgccagtgcattgccatcatgggaaacctcagtgctgag cccactacccgaagacacatggcggcctgtgaggaatttggggatggctgcttgagcctc ctggtttgggctgtggaggtgagcagaaggtgcctgtctttactaaacagccaggatgga ggaatcctgacaagagctgctggtgttctgagccggaccctttcttcctctctgaaaatt gttgaggaggccttgcgagcaggagtggtaaagaaaatgatgaaattcctgaagcaggct tcccaggagaggcagcctgctaatgttcttcttaccaactccctggaatgcacagaaggt gaactgaaggaggctccaccatggaagcggcaagcctttctcctggatcctgttgcatac cctgcagagttgagcgttatgatgaagctgctcagctcggaggatgaggttctggtgggc aacgctgccctctgccttggtaactgcatggaggtgcccaacgttgcgtcttccctgcta aagacggaccttttgcaggtcttgttaaagcttgcaggcagtgacacacagaagacggcc gtgcaggtgaacgcaggcattgctctggggaagctgtgcacagctgagcccagaggcaac gctcccttccagttgccccagggtcactctaaaccagggtcacctgtggactgcaagtcc ctaaagtgcctgccgagagcagtcatctccaatttggaagagagtctgctgcacctgaag ctctgtctcctcgcaggtaatcagagtcttctggttcaacaagcagaggagctgggattt tccatcctaaagaggatgtggccgggatggaactccatcaccatcctgaagccacagcag aggtggcaggtaacagacagtccagagctgctcactgtcttcatacatgacgtgaggacg gtggttccagtgcgccgcaagatgaggaaactgaggtgcaaagcagttaagcaactggtc caagatctagaccaggttgaaggagaacagggagtgtctctaaatttccacagctga >gi568815587r:113310730_113524651|GENSCAN_predicted_peptide_2|740_aa MAADPTELRLGSLPVFTRDDFEGDWRLVASGGFSQVFQARHRRWRTEYAIKCAPCLPPDA ASSDVNYLIEEAAKMKKIKFQHIVSIYGVCKQPLGIVMEFMANGSLEKVLSTHSLCWKLR FRIIHETSLAMNFLHSIKPPLLHLDLKPGNILLDSNMHVKISDFGLSKWMEQSTRMQYIE RSALRGMLSYIPPEMFLESNKAPGPKYDVYSPPTLPPRAGVILDVQLSHSERVLCIHSFA IVIWELLTQKKPYSDITIETDILLSLLQSRVAVPESKALARKVSCKLSLRQPGEVNEDIS QELMDSDSGNYLKRALQLSDRKNLVPRDEELCIYENKVTPLHFLVAQGSVEQVRLLLAHE VDVDCQTASGYTPLLIAAQDQQPDLCALLLAHGADANRVDEDGWAPLHFAAQNGDDGTAR LLLDHGACVDAQEREGWTPLHLAAQNNFENVARLLVSRQADPNLHEAEGKTPLHVAAYFG HVSLVKLLTSQGAELDAQQRNLRTPLHLAVERGKVRAIQHLLKSGAVPDALDQSGYGPLH TAAARGKYLICKMLLRYGASLELPTHQGWTPLHLAAYKGHLEIIHLLAESHANMGALGAV NWTPLHLAARHGEEAVVSALLQCGADPNAAEQSGWTPLHLAVQRSTFLSVINLLEHHANV HARNKVGWTPAHLAALKGNTAILKVLVEAGAQLDVQDGVSCTPLQLALRSRKQGIMSFLE GKEPSVATLGGSKPGAEMEI >gi568815587r:113310730_113524651|GENSCAN_predicted_CDS_2|2223_bp atggctgccgaccccaccgagctgcggctgggcagcctccccgtcttcacccgcgacgac ttcgagggcgactggcgcctagtggccagcggcggcttcagccaggtgttccaggcgcgg cacaggcgctggcggacggagtacgccatcaagtgcgccccctgccttccacccgacgcc gccagctctgatgtgaattacctcattgaagaagctgccaaaatgaagaagatcaagttt cagcacatcgtgtctatctacggggtgtgcaagcagcccctgggtattgtgatggagttt atggccaacggctccctggagaaggtgctgtccacccacagcctctgctggaagctcagg ttccgcatcatccatgagaccagcttggccatgaacttcctgcacagcattaagccgcct ctgctccacctggacctcaagccgggcaacatactcctggacagcaacatgcatgtcaaa atttcagacttcggcctgtccaagtggatggaacagtccacccggatgcagtacatcgag aggtcggctctgcggggcatgctcagctacatcccccctgagatgttcctggagagtaac aaggccccaggacctaaatatgatgtgtacagccccccgaccctgccaccccgggctggg gtgatcttggatgttcaactaagtcattcagaaagggttctctgcatccacagctttgca attgtcatctgggagctactcactcagaagaaaccatactcagacattaccatcgagaca gacatactgctgtcactgctgcagagtcgtgtggcagtcccagagagcaaggccctggcc aggaaggtgtcctgcaagctgtcgctgcgccagcccggggaggttaatgaggacatcagc caggaactgatggacagtgactcaggaaactacctgaagcgggcccttcagctctccgac cgtaagaatttggtcccgagagatgaggaactgtgtatctatgagaacaaggtcaccccc ctccacttcctggtggcccagggcagtgtggagcaggtgaggttgctgctggcccacgag gtagacgtggactgccagacggcctctggatacacgcccctcctgatcgccgcccaggac cagcaacccgacctctgtgccctgcttttggcacatggtgctgatgccaaccgagtggat gaggatggctgggccccactgcactttgcagcccagaatggggatgacggcactgcgcgc ctgctcctggaccacggggcctgtgtggatgcccaggaacgtgaagggtggacccctctt cacctggctgcacagaataactttgagaatgtggcacggcttctggtctcccgtcaggct gaccccaacctgcatgaggctgagggcaagacccccctccatgtggccgcctactttggc catgttagcctggtcaagctgctgaccagccagggggctgagttggatgctcagcagaga aacctgagaacaccactgcacctggcagtagagcggggcaaagtgagggccatccaacac ctgctgaagagtggagcggtccctgatgcccttgaccagagcggctacggcccactgcac actgcagctgccaggggcaaatacctgatctgcaagatgctgctcaggtacggagccagc cttgagctgcccacccaccagggctggacacccctgcatctagcagcctacaagggccac ctggagatcatccatctgctggcagagagccacgcaaacatgggtgctcttggagctgtg aactggactcccctgcacctagctgcacgccacggggaggaggcggtggtgtcagcactg ctgcagtgtggggctgaccccaatgctgcagagcagtcaggctggacacccctccacctg gcggtccagaggagcaccttcctgagtgtcatcaacctcctagaacatcacgcaaatgtc cacgcccgcaacaaggtgggctggacacccgcccacctggccgccctcaagggcaacaca gccatcctcaaagtgctggtcgaggcaggcgcccagctggacgtccaggatggagtgagc tgcacacccctgcaactggccctccgcagccgaaagcagggcatcatgtccttcctagag ggcaaggagccgtcagtggccactctgggtggttctaagccaggagccgagatggaaatt tag >gi568815587r:113310730_113524651|GENSCAN_predicted_peptide_3|784_aa MGTTEQSVAVQPSGPPDFTPGPGEGGAAVFGGWSRSGQVWTCKGYDMLEDPKFPEGIYTD LSLFSICAALFLSVHFLEDADDDDPTVTRMVVTRIEPFPGVREPCRIKQGTCLETLIAIL VLKAIGSMNLGLDTKTVFVWESSSPILPAHPSIARRPPIAFAKWERKVPGVSLLEPKDEK RGDKGFRRPCLTPLQTLEACNKTLHTPAPGLRGFALQTVTVPVHLPTPTLGNPRAWPPSG STALMDPLNLSWYDDDLERQNWSRPFNGSDGKADRPHYNYYATLLTLLIAVIVFGNVLVC MAVSREKALQTTTNYLIVSLAVADLLVATLVMPWVVYLELILGPCPGPAGKPFTYPPAST RKMGIQLLAPAKLVSAAAQGAGREALVRGPHVVGEWKFSRIHCDIFVTLDVMMCTASILN LCAISIDRYTAVAMPMLYNTRYSSKRRVTVMISIVWVLSFTISCPLLFGLNNADQNECII ANPAFVVYSSIVSFYVPFIVTLLVYIKIYIVLRRRRKRVNTKRSSRAFRAHLRAPLKGNC THPEDMKLCTVIMKSNGSFPVNRRRVGSPLAMVLRPQKLANGRSTPETPTLPQLKADSPC TPPSRHEVRHLGSARHGCVRENGWPYQRNKNDNNGYSHFSNIYHIPIISSNPHHDPGSTP DSPAKPEKNGHAKDHPKIAKIFEIQTMPNGKTRTSLKTMSRRKLSQQKEKKATQMLAIVL GVFIICWLPFFITHILNIHCDCNIPPVLYSAFTWLGYVNSAVNPIIYTTFNIEFRKAFLK ILHC >gi568815587r:113310730_113524651|GENSCAN_predicted_CDS_3|2355_bp atgggcacaacagaacagagtgtggccgtgcagcccagcgggcctcccgactttacccca ggccccggggagggtggggctgctgtctttggaggctggagcagaagtgggcaggtttgg acttgcaagggctatgacatgctagaggatcccaagttccctgagggcatctacacagat ctgagtttgttcagcatttgtgcagccctcttcttgtctgttcatttccttgaagatgca gatgatgatgatcctacagtgaccaggatggtagtgacaaggattgagccctttccagga gtcagagagccctgtaggataaagcaggggacctgtctagagaccttaattgcaatcctg gttctgaaagcaataggctctatgaacctgggtcttgacaccaaaacagtatttgtgtgg gagtcctcaagtcccatcttgcctgcccacccctccattgcaagaaggcctcccatcgcc tttgccaaatgggaaagaaaggtccccggtgtcagcctgctggagccgaaggatgagaaa cgtggggacaagggcttccggagaccctgcctcaccccgctgcagaccctggaggcctgc aacaagaccctccacaccccagctccagggcttagaggctttgccctccagacagtgact gtgcctgtccatctacccactcccaccctcggcaacccaagagcctggccacccagtggc tccaccgccctgatggatccactgaatctgtcctggtatgatgatgatctggagaggcag aactggagccggcccttcaacgggtcagacgggaaggcggacagaccccactacaactac tatgccacactgctcaccctgctcatcgctgtcatcgtcttcggcaacgtgctggtgtgc atggctgtgtcccgcgagaaggcgctgcagaccaccaccaactacctgatcgtcagcctc gcagtggccgacctcctcgtcgccacactggtcatgccctgggttgtctacctggagctt atcctggggccctgtccaggacctgcaggaaagccctttacgtaccctcctgcctccacc cgcaaaatgggcatccagctgttagctcctgccaaactggtcagcgcagcagcacaggga gctgggagagaggccctggtacggggcccccatgtggtaggtgagtggaaattcagcagg attcactgtgacatcttcgtcactctggacgtcatgatgtgcacggcgagcatcctgaac ttgtgtgccatcagcatcgacaggtacacagctgtggccatgcccatgctgtacaatacg cgctacagctccaagcgccgggtcaccgtcatgatctccatcgtctgggtcctgtccttc accatctcctgcccactcctcttcggactcaataacgcagaccagaacgagtgcatcatt gccaacccggccttcgtggtctactcctccatcgtctccttctacgtgcccttcattgtc accctgctggtctacatcaagatctacattgtcctccgcagacgccgcaagcgagtcaac accaaacgcagcagccgagctttcagggcccacctgagggctccactaaagggcaactgt actcaccccgaggacatgaaactctgcaccgttatcatgaagtctaatgggagtttccca gtgaacaggcggagagtgggaagcccactggccatggttctgagacctcagaagctggcc aatgggagaagcaccccagaaacccccaccttgcctcagctgaaggcagactcaccgtgc acacctccaagcaggcatgaagtgagacacctcggttctgcaaggcatggatgtgtacga gaaaatggttggccataccaacgtaataaaaatgataataatggctattcacatttctca aacatctaccatatccctattatctcatcaaatcctcaccacgaccccgggagcactccc gacagccccgccaaaccagagaagaatgggcatgccaaagaccaccccaagattgccaag atctttgagatccagaccatgcccaatggcaaaacccggacctccctcaagaccatgagc cgtaggaagctctcccagcagaaggagaagaaagccactcagatgctcgccattgttctc ggcgtgttcatcatctgctggctgcccttcttcatcacacacatcctgaacatacactgt gactgcaacatcccgcctgtcctgtacagcgccttcacgtggctgggctatgtcaacagc gccgtgaaccccatcatctacaccaccttcaacattgagttccgcaaggccttcctgaag atcctccactgctga >gi568815587r:113310730_113524651|GENSCAN_predicted_peptide_4|190_aa IAFGLELEHQLSWVSSLLAHPEDFEFTKLHDHVIETGSWRMQQGGLQALIEWWLTTTTGI CQMSTLCMALCQVIYKLYVNGFLLKPFEAGDRRGEPAELLPAGALNGAAGPGARDRPRRV AAPDGCRRGGRAWMRRELEASSSRRRLCPRAPYGLKLQRMEEKQKKGEDRVASGYKDVDE RAEIGGSYRE >gi568815587r:113310730_113524651|GENSCAN_predicted_CDS_4|573_bp atagcttttggacttgaactggaacatcagctctcttgggtctccagcctgctggcccac cctgaagattttgaatttaccaaactccatgatcatgtcatagaaacaggaagctggcgc atgcaacagggtgggctacaggctttgatagaatggtggctaacaacaacaactggtatt tgtcagatgtctacactgtgcatggcattatgtcaggtgatctacaagctttacgtcaat ggattcttactgaaaccctttgaggccggggatcgccgaggagagccggccgagctgctg cccgccggggctctgaacggcgcggcggggccgggagccagggaccggccgaggagagtg gcggccccggacggctgccggaggggcggccgcgcgtggatgcggcgggagctggaagcc tcaagcagccggcgccgtctctgcccccgggcgccctatggcttgaagctacagagaatg gaagaaaaacagaagaaaggtgaagacagggtggcaagtgggtataaagatgtggatgag agagctgaaataggtggaagttacagggaatag