GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:25:18 Sequence gi568815593f:171320407_171556802 : 236396 bp : 47.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1282 1403 122 0 2 93 63 58 0.122 3.09 1.02 Intr + 15965 16079 115 2 1 69 103 87 0.938 8.75 1.03 Term + 17865 18026 162 2 0 64 47 62 0.408 -2.36 1.04 PlyA + 19033 19038 6 1.05 2.03 PlyA - 19406 19401 6 1.05 2.02 Term - 24905 24590 316 1 1 6 36 354 0.842 16.41 2.01 Init - 25224 24944 281 0 2 97 -7 390 0.002 26.88 2.00 Prom - 28352 28313 40 -3.06 3.00 Prom + 29886 29925 40 -8.06 3.01 Init + 32724 32849 126 2 0 50 89 48 0.020 1.31 3.02 Intr + 45890 46153 264 1 0 64 55 345 0.171 26.51 3.03 Intr + 46331 46444 114 1 0 39 4 146 0.368 1.94 3.04 Term + 50904 51071 168 2 0 58 52 161 0.975 7.38 3.05 PlyA + 53047 53052 6 1.05 4.00 Prom + 55447 55486 40 -5.86 4.01 Init + 65367 65379 13 2 1 113 103 8 0.504 3.82 4.02 Intr + 67209 67284 76 1 1 10 59 72 0.289 -4.93 4.03 Intr + 67527 67648 122 0 2 53 77 116 0.593 7.14 4.04 Intr + 69645 69724 80 1 2 97 32 116 0.978 6.17 4.05 Intr + 70899 71018 120 2 0 86 103 79 0.997 9.89 4.06 Intr + 71300 71393 94 1 1 99 77 -4 0.991 -0.86 4.07 Intr + 72304 72410 107 2 2 47 80 139 0.991 8.93 4.08 Intr + 72508 72572 65 0 2 60 89 223 0.996 17.12 4.09 Intr + 79747 79804 58 1 1 51 87 135 0.999 8.59 4.10 Intr + 80433 80519 87 2 0 49 106 69 0.964 4.97 4.11 Intr + 84896 84997 102 1 0 103 87 98 0.999 11.57 4.12 Intr + 87294 87368 75 2 0 91 78 55 0.835 4.51 4.13 Intr + 99556 99825 270 0 0 66 69 136 0.012 7.24 4.14 Intr + 99966 100037 72 2 0 107 98 42 0.012 6.70 4.15 Term + 100169 100342 174 1 0 63 47 96 0.016 0.76 4.16 PlyA + 102839 102844 6 1.05 5.00 Prom + 108301 108340 40 -7.26 5.01 Init + 108956 109214 259 1 1 101 41 224 0.467 16.52 5.02 Intr + 113067 113185 119 1 2 3 69 35 0.279 -6.72 5.03 Intr + 115687 115867 181 0 1 94 80 441 0.989 43.34 5.04 Intr + 128741 128847 107 0 2 47 113 180 0.561 16.33 5.05 Term + 136133 136399 267 1 0 115 47 508 0.981 44.89 5.06 PlyA + 137723 137728 6 1.05 6.09 PlyA - 138534 138529 6 1.05 6.08 Term - 142230 142084 147 0 0 91 42 54 0.306 -1.00 6.07 Intr - 147040 146896 145 0 1 77 94 102 0.929 9.98 6.06 Intr - 151422 151362 61 1 1 102 80 78 0.921 6.19 6.05 Intr - 154211 154091 121 2 1 77 4 73 0.753 -2.13 6.04 Intr - 154687 154619 69 1 0 52 109 39 0.509 1.78 6.03 Intr - 157987 157861 127 0 1 51 34 154 0.769 6.98 6.02 Intr - 163875 163717 159 2 0 85 82 80 0.597 6.20 6.01 Init - 167263 167208 56 1 2 65 89 62 0.947 4.87 6.00 Prom - 175978 175939 40 -6.16 7.03 PlyA - 176549 176544 6 1.05 7.02 Term - 177345 177196 150 0 0 -38 44 357 0.937 16.61 7.01 Init - 181440 181372 69 0 0 92 106 51 0.991 8.35 7.00 Prom - 189950 189911 40 -4.06 8.05 PlyA - 190972 190967 6 1.05 8.04 Term - 199216 198895 322 0 1 84 54 112 0.252 1.69 8.03 Intr - 213092 212830 263 2 2 15 87 140 0.082 2.89 8.02 Intr - 214894 214761 134 2 2 109 86 -12 0.171 1.06 8.01 Init - 216881 216818 64 2 1 83 75 61 0.464 5.61 8.00 Prom - 219538 219499 40 -1.56 9.07 PlyA - 219763 219758 6 1.05 9.06 Term - 223483 223403 81 1 0 111 44 35 0.653 -0.91 9.05 Intr - 224495 224398 98 0 2 106 98 4 0.457 2.93 9.04 Intr - 225423 225383 41 2 2 92 93 26 0.799 1.37 9.03 Intr - 231397 231274 124 2 1 89 74 61 0.709 4.44 9.02 Intr - 234947 234839 109 2 1 58 62 47 0.723 -1.04 9.01 Init - 235437 235327 111 0 0 46 105 91 0.901 6.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 15179 15369 191 1 2 83 29 83 0.906 0.38 S.002 Init - 25224 24941 284 0 2 97 77 392 0.956 35.51 S.003 Init - 99750 99517 234 0 0 40 66 344 0.935 23.54 S.004 Init - 190493 190441 53 2 2 69 103 58 0.805 6.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:171320407_171556802|GENSCAN_predicted_peptide_1|132_aa PPEGAALDKGPRPFAQFLLKSIGGKGICVSQLRLRFKDQERDTTTSPSHILKGQEAFATL GASSSIKCFLNQLDLPLSTLTEDMKEKMRMSTVISLRLVTTPRSRDQELSAPFPSSSHGL QVNKGLASGVIQ >gi568815593f:171320407_171556802|GENSCAN_predicted_CDS_1|399_bp cctccagaaggagctgccttggacaaaggccccaggccttttgcacagttcctgttgaaa agcataggagggaagggcatctgtgtctctcaattgcgtctaagatttaaagatcaagaa agggacacaactacatctccctcccacattctgaagggccaggaagcctttgccacgctt ggtgcttcttcctccatcaaatgctttctgaatcaactggatctgccgctgagcacgctc actgaagacatgaaagagaaaatgagaatgagcacagtgatctcgttaaggcttgtaaca accccaaggagcagagaccaagagttgtcagccccatttccaagcagcagccatgggctc caagtgaacaagggacttgcatctggggttatccagtga >gi568815593f:171320407_171556802|GENSCAN_predicted_peptide_2|198_aa MSMLGLQKRPAASVLRYGKKKVWLDPNEANEIASANSRQQIRKLIKDGLIICKPVTVHSQ AQCRKNTLAHRKGRHMGTGKRKGTANAPMPEKVTRYRDSKKINHHMYHSLYLKVKESVFK DKQILMEHIHKLKADKAGKKLLADQAEACRPKTKEARKQREECLQAKKEGIINTLSKDEE MKKQKLPLCLYILAFVIT >gi568815593f:171320407_171556802|GENSCAN_predicted_CDS_2|597_bp atgagcatgctcgggcttcagaagaggcctgccgctagtgtcctccgctatggcaagaag aaggtctggttggaccccaatgaggccaatgaaatcgccagtgccaactcccgacagcag atccggaagctgatcaaagatgggctgattatctgtaagcctgtgactgtccattcccag gctcaatgccggaaaaacaccttggcccaccggaagggcaggcacatgggcacgggtaag cgaaagggtacagccaatgccccaatgccagagaaggtcacaagataccgtgactctaag aagatcaatcaccacatgtatcatagcctgtacctgaaggtgaaggagagtgtgttcaaa gacaagcagattctcatggaacatatccacaagctgaaggcagacaaggccggcaagaag ctcctggctgaccaggctgaggcctgcaggcctaagaccaaggaagcacgaaagcaacgt gaagagtgcctccaggccaagaaggaggggatcatcaacactttgtcgaaggacgaagag atgaagaaacaaaagctccccctttgtctgtacatactggcctttgtgattacatag >gi568815593f:171320407_171556802|GENSCAN_predicted_peptide_3|223_aa MPWVTAEKASALCHRAAHTPGPGPRGPEREHSERRSQQEVAQCRLRTGMRGAFGKPQGTV ARVHIGQVIMSICTKLQNKEHVIEALHRAKFKFPGRQKIHISKKWGFIKFNANEFEDMVA EKRLILDGCGRAELSFAAATGATPIAGHFTPGTFINQVQAAFWELRLLLQIQQSKDEWIV LTMTRIPLGSVLGRGAISRTQTGVDDPGKVLPGPEEGSWTTSP >gi568815593f:171320407_171556802|GENSCAN_predicted_CDS_3|672_bp atgccgtgggttactgcagaaaaagccagtgccctctgccacagagctgcacatacacct ggcccaggccccagagggcctgaaagagaacattcggagaggcgaagccagcaagaggtg gcacaatgtaggctccggacaggcatgcgaggtgcctttggaaagccccagggcactgtg gccagggttcacattggccaagttatcatgtccatctgtaccaagctgcagaacaaggag catgtgatagaagccctgcacagggccaagttcaagtttcctggccgccagaagatccac atctcaaagaagtggggcttcatcaagttcaatgccaatgaatttgaggacatggtggct gagaagcggctcatcctagatggctgtgggcgggctgagctgagctttgctgctgccaca ggagccactcctattgctggccacttcacccctggaaccttcattaaccaggtccaggca gccttctgggagctgcgtctgttgctgcaaattcagcaatccaaggatgagtggattgtt ctcaccatgaccagaattcccctgggatctgtgctcggcagaggtgccatctcccgaacc cagactggagttgatgatccagggaaggtgcttccgggtcctgaagagggctcctggacc accagcccctga >gi568815593f:171320407_171556802|GENSCAN_predicted_peptide_4|504_aa MAHACDSSGGVGPVYECACSVGARGVPGRSACRHPMEDSMDMDMSPLRPQNYLFGNCWGE LERGRAGPGGGCELKADKDYHFKVDNDENEHQLSLRTVSLGAGAKDELHIVEAEAMNYEG SPIKVTLATLKMSVQPTVSLGGFEITPPVVLRLKCGSGPVHISGQHLVAVEEDAESEDEE EEDVKLLSISGKRSAPGGGSKVPQKKVKLAADEDDDDDDEEDDDEDDDDDDFDDEEAEEK APVKKSIRDTPAKNAQKSNQNGKDSKPSSTPRSKGQESFKKQEKTPKTPKGPSSVEDIKA KMQASIEKGGSLPKVEAKFINYVKNCFRMTDQEPLRQPGGRSAREEPRGALMPQGAPRSA PEQQSLQQQQPARREQQQRRRRRRRRRRRPVPAARSGHVQAGLGAAASLPPSDVFSALRL HLPVPLTASPSVSRRCLHFLLLCFQVQGAEGRLRVRPAPAGNRPRARGSAAGPEEALAVA ATVGLGWEEIPQSRRAQPQQYTQL >gi568815593f:171320407_171556802|GENSCAN_predicted_CDS_4|1515_bp atggctcacgcatgcgacagcagcggaggggtggggccagtgtacgagtgcgcgtgctcg gtgggagcccgcggagtacctggaaggagtgcgtgccgccacccgatggaagattcgatg gacatggacatgagccccctgaggccccagaactatcttttcggtaactgctggggggag ctggagcgaggccgagcggggcctggtggcggttgtgaactaaaggccgacaaagattat cactttaaggtggataatgatgaaaatgagcaccagttatctttaagaacggtcagttta ggggctggtgcaaaggatgagttgcacattgttgaagcagaggcaatgaattacgaaggc agtccaattaaagtaacactggcaactttgaaaatgtctgtacagccaacggtttccctt gggggctttgaaataacaccaccagtggtcttaaggttgaagtgtggttcagggccagtg catattagtggacagcacttagtagctgtggaggaagatgcagagtcagaagatgaagag gaggaggatgtgaaactcttaagtatatctggaaagcggtctgcccctggaggtggtagc aaggttccacagaaaaaagtaaaacttgctgctgatgaagatgatgacgatgatgatgaa gaggatgatgatgaagatgatgatgatgatgattttgatgatgaggaagctgaagaaaaa gcgccagtgaagaaatctatacgagatactccagccaaaaatgcacaaaagtcaaatcag aatggaaaagactcaaaaccatcatcaacaccaagatcaaaaggacaagaatccttcaag aaacaggaaaaaactcctaaaacaccaaaaggacctagttctgtagaagacattaaagca aaaatgcaagcaagtatagaaaaaggtggttctcttcccaaagtggaagccaaattcatc aattatgtgaagaattgcttccggatgactgaccaagagcccctgcgccagcccggaggg cgcagcgctcgggaggagccgcgcggggcgctgatgccgcagggcgcgccgcggagcgcc ccggagcagcagagtctgcagcagcagcagccggcgaggagggagcagcagcagcggcgg cggcggcggcggcggcggcggaggcgcccggtcccggccgcgcggagcggacatgtgcag gctgggctaggagccgccgcctccctcccgcccagcgatgtattcagcgccctccgcctg cacttgcctgtcccactgaccgcttctccatctgtttcccgcaggtgtttacacttcctg ctgctgtgcttccaggtacagggcgcggaagggcggctccgcgtgcgccctgcgccggcg gggaaccggccaagggcaaggggctctgccgcggggcccgaggaggcgctggctgtcgcg gccactgtcggcctcggctgggaggagatcccgcagagccggcgggctcagccgcaacaa tatacccagctctaa >gi568815593f:171320407_171556802|GENSCAN_predicted_peptide_5|310_aa MGPGMGASAQPVGANVSFTLNDSNPESVPWARVQEELGELRFQVERVADFIGNPHWRGVK VSSLNPEFGEALKALAGPMLLLLEEPGRSKAMGGESSGQMPTRCGPQEHWSSSVPEQQDV EGGGTQVLVAEENVDFRIHVENQTRARDDVSRKQLRLYQLYSRTSGKHIQVLGRRISARG EDGDKYAQLLVETDTFGSQVRIKGKETEFYLCMNRKGKLVGKPDGTSKECVFIEKVLENN YTALMSAKYSGWYVGFTKKGRPRKGPKTRENQQDVHFMKRYPKGQPELQKPFKYTTVTKR SRRIRPTHPA >gi568815593f:171320407_171556802|GENSCAN_predicted_CDS_5|933_bp atgggccctggcatgggggcaagtgcccagccagtgggagccaacgtgagcttcactctc aatgacagtaatccagagagcgtcccttgggccagagtccaggaagagttaggggagctg cgttttcaggtggagagagtggctgactttatcgggaacccccactggagaggagtgaag gtctccagtttgaatccagagtttggagaagctctgaaagccttggctggcccgatgctg ctgctgctggaggagccaggaaggtcaaaggccatgggtggggagtcttccggacaaatg cccaccaggtgtgggcctcaggagcactggagttcttctgtccctgagcagcaggatgtg gaaggaggcgggacccaggtgctggttgccgaggagaacgtggacttccgcatccacgtg gagaaccagacgcgggctcgggacgatgtgagccgtaagcagctgcggctgtaccagctc tacagccggaccagtgggaaacacatccaggtcctgggccgcaggatcagtgcccgcggc gaggatggggacaagtatgcccagctcctagtggagacagacaccttcggtagtcaagtc cggatcaagggcaaggagacggaattctacctgtgcatgaaccgcaaaggcaagctcgtg gggaagcccgatggcaccagcaaggagtgtgtgttcatcgagaaggttctggagaacaac tacacggccctgatgtcggctaagtactccggctggtacgtgggcttcaccaagaagggg cggccgcggaagggccccaagacccgggagaaccagcaggacgtgcatttcatgaagcgc taccccaaggggcagccggagcttcagaagcccttcaagtacacgacggtgaccaagagg tcccgtcggatccggcccacacaccctgcctag >gi568815593f:171320407_171556802|GENSCAN_predicted_peptide_6|294_aa MDPCGNKRLQLMEAATTARHTLTTILPVHFTDGKTEAHRGKGALLRSHRTSKAERMPAPE LQVQCSLLHTTSHYEIQASLESSARFPYWRSANKQEKDVMEDCVEHFVGWPRSHPAGKME GKTGQKKTRDEFAVVVQASSAENIQRCKHDDFIKVRALGDTCAGDGSPSNRGNSDLGGTL ERTPVEEEEPLKRNEGAGMFGSVPGLSPLDASGSPPAPSCDNWGVGDNQRCLHALPKWPW GTKSPQSWKPQPGSQAVLGQQGRALPPSGPQFGGLMFVTDLQANCMVGEQPSHL >gi568815593f:171320407_171556802|GENSCAN_predicted_CDS_6|885_bp atggacccatgtggcaacaagagactgcagctgatggaagctgccaccaccgccagacac accctcactaccatcctcccagtccattttacggatggaaaaaccgaggcccacagaggt aaaggcgctctcctgcggtcacacaggacatccaaggcagaacgcatgccagcacctgag ctccaggtccagtgctctctgctccacactacaagtcattatgagatccaggcctccttg gagtcttctgctagattcccctactggaggtcggccaacaagcaagagaaggacgtcatg gaggactgcgtggaacattttgtgggctggcccagaagtcaccctgctggaaagatggaa ggcaagacagggcaaaagaagactagagatgagtttgcggtggtggtccaggcatcatct gcagagaatattcaaagatgtaaacatgatgatttcatcaaggtccgcgcgttaggggac acatgtgcaggcgatggcagcccatcaaaccgtggaaattctgacctaggaggtacattg gagaggacaccggttgaagaagaggagcccctcaagaggaatgaaggggccgggatgttc ggcagcgtccctggactctctccactagatgccagtggcagtcccccggctcccagttgt gacaactggggagttggagacaaccaacgttgtctccacgcactgccgaaatggccctgg ggaacaaaatcgccccagagctggaaacctcagccaggaagccaggccgtcttgggacag caagggcgagccttgccgccatctggtccccagttcggcggcctaatgtttgtaacagac ctgcaagctaattgcatggttggggagcagccttcacatctttag >gi568815593f:171320407_171556802|GENSCAN_predicted_peptide_7|72_aa MNENNNEWKHIREAGKLPIKGQKKKEEEKEGEKEEEEEEEEEEEEEEEEEEEEGEGEGEG EEEGEEEEEGGT >gi568815593f:171320407_171556802|GENSCAN_predicted_CDS_7|219_bp atgaatgaaaacaacaatgaatggaaacacatcagagaagctggcaaacttcctataaag ggccagaaaaagaaagaagaagaaaaagaaggggagaaggaggaagaggaagaggaggag gaggaagaggaagaggaggaagaggaagaagaagaagaaggagaaggagaaggagaagga gaagaagaaggagaagaagaagaagaaggcggcacttag >gi568815593f:171320407_171556802|GENSCAN_predicted_peptide_8|260_aa MTHPQGKDIINMDELRALLDGVLNLLNYWSCLIMVQPYNRINTENDVKEACLLTWKVIHN ILLINKKYKSLYHRYIHTYVHHSTIDSIKDMEPTPTSIVNQIKKMWYIYTMKHYAAIKKE IMSFAATWIQLEAIILIELMREQKPKYHKLSHEFLLGTLHSLRSPPTASLAIALGRFDTL TDIPLNVLAPLSWTSSSPVPLLHPESAIRTSHHRNCTTSTIKDPISTFFTGSPLSVSGAR ELLYCPLRPQTVLKHISLCL >gi568815593f:171320407_171556802|GENSCAN_predicted_CDS_8|783_bp atgacacatccccagggtaaggacatcatcaacatggatgaactcagggctctgttagat ggagtcctgaatcttctgaattactggtcatgcttaattatggtacaaccatacaataga ataaacactgaaaatgatgttaaagaagcatgtctattgacatggaaagtcattcacaat atattgctaattaacaagaaatataaatcattgtaccacagatacattcacacgtatgtt catcatagcactattgacagtatcaaagacatggaaccaacaccgacatcaatagtgaac cagataaagaaaatgtggtacatatataccatgaaacactatgcagccataaaaaaggag atcatgtcctttgcagcaacatggatacagctggaggccattatcctaatcgaattaatg cgagaacagaaacccaaataccacaagctctcacatgaattcctcttgggcactcttcac tccctgaggagccccccaactgcatctcttgccattgccctgggtcgctttgacaccctc acagacatccctctcaatgtcctggcccctctgtcttggacttcctcgtcaccagtgccc ctcctccaccccgaatcagccatcaggactagtcatcacagaaactgcaccacttccacc atcaaggacccaatcagtaccttcttcaccggcagccctctctctgtctctggggctcgg gagctcctgtactgcccgctccgtccccagactgttctcaaacacatctctctctgcctc tga >gi568815593f:171320407_171556802|GENSCAN_predicted_peptide_9|187_aa MNLSVLIEDAKSRFCHYKVAQSTPGNELTLDSGNELEHCSSPGQIQDTSETQRGQMTYPR SHCEDAVDSDLNTASHHPEEVVQKENVSVCTHDSLQRNLESIDTERQSLIAPQPQAPREL SHLVPPGPALFPLPALSGITFQINMCAHILLSESVVCGLEGVLTKANSPSSEVTFQVFQE ASLTCQA >gi568815593f:171320407_171556802|GENSCAN_predicted_CDS_9|564_bp atgaatctgtctgttctcattgaggatgcaaaatcaagattttgccactacaaggtggca caaagtaccccgggcaatgaattgaccctggacagtggcaatgagctcgagcactgttca tcccctgggcaaattcaggacacgtctgagacccagaggggccaaatgacttacccaaga tcacattgtgaggacgcagtggactcggatttgaacacagcttcacaccaccctgaggaa gtggttcagaaagaaaatgtcagtgtttgtactcatgacagtcttcaaagaaatttggaa agcatagacactgaacggcagtccttaattgccccccaaccccaggccccaagagaactg agccacctggtgcctccaggacctgccctgttccccttgccagcactttctggaattacc ttccaaataaacatgtgtgctcacatcctcctttcagaatcagttgtgtgcgggctggag ggggtcctgactaaagccaactcacccagctcagaggttaccttccaggtcttccaagaa gcttccctgacctgccaagcctga