GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:44:24 Sequence gi568815596f:33415137_33659031 : 243895 bp : 40.47% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4986 5067 82 2 1 77 76 87 0.894 7.58 1.02 Intr + 15987 16097 111 1 0 99 47 57 0.534 2.13 1.03 Term + 16589 16710 122 0 2 140 42 35 0.644 1.76 1.04 PlyA + 18071 18076 6 1.05 2.04 PlyA - 18184 18179 6 1.05 2.03 Term - 22571 22504 68 0 2 55 37 61 0.212 -5.18 2.02 Intr - 24590 24456 135 0 0 103 47 90 0.475 6.02 2.01 Init - 25246 25120 127 1 1 88 91 36 0.440 4.27 2.00 Prom - 42879 42840 40 -2.35 3.03 PlyA - 43028 43023 6 1.05 3.02 Term - 43605 43493 113 1 2 67 54 70 0.043 -0.76 3.01 Init - 58865 58670 196 2 1 71 4 171 0.087 6.14 3.00 Prom - 60036 59997 40 -5.45 4.05 PlyA - 60046 60041 6 1.05 4.04 Term - 63389 62938 452 0 2 19 36 203 0.315 2.56 4.03 Intr - 67412 67285 128 1 2 63 92 84 0.291 5.70 4.02 Intr - 90212 89960 253 0 1 25 15 253 0.329 7.27 4.01 Init - 94560 94527 34 0 1 76 91 15 0.560 0.68 4.00 Prom - 97895 97856 40 -7.55 5.00 Prom + 99640 99679 40 -5.15 5.01 Init + 100001 100070 70 1 1 77 64 67 0.557 4.46 5.02 Intr + 104542 104588 47 2 2 29 95 35 0.420 -4.49 5.03 Intr + 104733 104878 146 2 2 60 84 112 0.982 6.16 5.04 Intr + 105417 105548 132 0 0 72 66 142 0.982 9.24 5.05 Intr + 106819 106966 148 1 1 82 52 46 0.632 -0.28 5.06 Intr + 108743 108916 174 1 0 66 95 73 0.928 5.01 5.07 Intr + 112001 112276 276 1 0 77 113 279 0.997 26.09 5.08 Intr + 119187 119264 78 2 0 72 86 100 0.667 7.03 5.09 Intr + 123958 124074 117 0 0 56 107 58 0.505 4.24 5.10 Intr + 124131 124187 57 2 0 62 91 48 0.546 0.66 5.11 Intr + 128376 128491 116 2 2 100 89 62 0.846 5.83 5.12 Intr + 134468 134615 148 2 1 99 78 50 0.980 4.42 5.13 Intr + 140395 140431 37 0 1 117 94 17 0.680 2.12 5.14 Intr + 143075 143200 126 0 0 60 95 48 0.602 2.43 5.15 Intr + 143536 143894 359 2 2 115 107 340 0.701 32.65 5.16 Term + 151717 151869 153 0 0 36 47 142 0.359 1.84 5.17 PlyA + 152867 152872 6 1.05 6.10 PlyA - 153070 153065 6 1.05 6.09 Term - 153685 153568 118 0 1 71 54 165 0.826 8.43 6.08 Intr - 170244 169607 638 0 2 41 29 496 0.009 28.96 6.07 Intr - 171542 171426 117 2 0 79 81 124 0.991 10.54 6.06 Intr - 172184 172104 81 2 0 37 92 81 0.614 2.32 6.05 Intr - 173383 173199 185 2 2 70 69 148 0.940 9.69 6.04 Intr - 177078 176944 135 1 0 46 89 139 0.523 9.42 6.03 Intr - 180501 180353 149 2 2 49 98 97 0.674 5.76 6.02 Intr - 184268 184033 236 2 2 26 90 306 0.790 20.16 6.01 Init - 187141 187022 120 1 0 35 50 105 0.516 1.64 6.00 Prom - 189283 189244 40 -4.35 7.00 Prom + 189319 189358 40 -6.45 7.01 Init + 195571 195668 98 0 2 93 101 41 0.405 5.75 7.02 Intr + 197619 197713 95 0 2 43 90 117 0.380 6.09 7.03 Intr + 217475 217541 67 0 1 120 93 38 0.605 4.54 7.04 Intr + 217941 218104 164 0 2 44 70 68 0.717 -0.70 7.05 Intr + 221379 221599 221 1 2 49 40 297 0.301 18.10 7.06 Intr + 221744 221992 249 1 0 -55 40 255 0.140 3.11 7.07 Term + 234696 234830 135 2 0 78 38 103 0.455 1.34 7.08 PlyA + 235007 235012 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:33415137_33659031|GENSCAN_predicted_peptide_1|104_aa MVKTDDLPYKQLQIFSLPRILSSEKFGVLFYKDVNSDTLVITALCMNTARICGIPKCLER LGKEGQSSIVSDRKQGALTIPLLVVTALIPHLITGRTCDLLLAS >gi568815596f:33415137_33659031|GENSCAN_predicted_CDS_1|315_bp atggtgaagacagatgatctgccttacaagcagctgcagatcttttccttacctcgaatc ctttcttcagagaaatttggagtattgttctataaagatgtcaacagtgacacactggtt atcacagcactttgtatgaatacagccagaatttgtggcattccaaaatgtcttgagagg cttggaaaggaaggtcagtcatccattgtgtcagacagaaaacagggggccctcacaatc ccactcttggtagtcacagccttgatccctcacctgatcacaggaaggacttgtgacttg cttctagccagctga >gi568815596f:33415137_33659031|GENSCAN_predicted_peptide_2|109_aa MDMLLNLSDLTFLNYKMWLAMEPADRWLQRQKEIMEIKGLTSSSFPLLLSSWLTLSSLEE LPDLGNQNTGPPAKCEFLINSKYYFRIRKKTASGIHTEPEPSPGLLCRF >gi568815596f:33415137_33659031|GENSCAN_predicted_CDS_2|330_bp atggacatgttacttaatctctctgacctcacttttctcaactacaaaatgtggctggca atggaacctgccgatagatggttgcagaggcagaaggagataatggagataaaaggcttg acatcgagctcattccctctgctgctgtcaagctggctgactctcagttcccttgaagag ttgccagatttgggaaatcagaatacagggcccccagctaagtgtgagtttctgataaac agcaaatactattttcgtataagaaagaagacggccagtgggatccacactgagcctgag ccgagtcctggcctcctgtgccggttctag >gi568815596f:33415137_33659031|GENSCAN_predicted_peptide_3|102_aa MNTIVNCAREGSTLHAPYESLMPDDLRWNSFILKSSPSPHSRSMEKLSSMKLVPGAKYIR DHWCRACNGAGGKRVSYLLTCFYPIFCTVALVTEEPALPHTI >gi568815596f:33415137_33659031|GENSCAN_predicted_CDS_3|309_bp atgaacactattgtgaactgtgcacgcgagggatctacactgcatgctccttatgagagt ctaatgcctgatgatctgaggtggaacagtttcatcctgaaatcatctccatcccctcat tcccggtccatggaaaaattgtcttccatgaaactggtccctggtgccaaatacattagg gaccactggtgtagagcatgcaatggtgctggtggaaaaagggtcagttaccttctcact tgtttctaccctattttctgcactgtggccttggtaactgaagagcccgctcttccacat accatctga >gi568815596f:33415137_33659031|GENSCAN_predicted_peptide_4|288_aa MTIKYLMEHRAAWVTRAKLHLKEKKKKNPVCGPNALEVPGLKFPEIKSHSLTPLLFHLIL SGFDADNNGGGGGGGGDDDDDDGDDMEEEEEEESFRNKGAILRQNFQECRWFCWARSVHT EEQTAVFIPQETTKLSLKDIPFYYGRLDDSGDTISPEGSLPLSTCGKSETIPGTNSLKND SGGTPSPSSQTGKSQTTWPRADPRTSHKCHTEEQRSWDYFSSLATLHFVDSSTFEGKEGG ELLNVRLAICLDRASETASIREHWLVRYSRPTVLRETRSQKCRALGHF >gi568815596f:33415137_33659031|GENSCAN_predicted_CDS_4|867_bp atgacaataaaatatcttatggagcacagagcagcctgggtaacaagagcaaaactccat ctcaaagaaaaaaaaaaaaaaaatcctgtctgcggtcctaatgctctggaagtacctggt ctcaaattccctgaaataaagtcacattcccttactcctttgttgttccacttaattctg agtggatttgatgctgataataatggtggtggtggtggtggtggtggtgatgatgatgat gatgatggtgatgatatggaggaggaggaagaggaggaatcatttaggaataaaggtgct atactcaggcagaactttcaagaatgccgatggttttgttgggcaaggagcgtgcacact gaagagcagactgcagttttcatacctcaggaaacaaccaaattgtctctaaaagacatt ccattttattatgggaggctcgatgattccggagacacaatttcaccagagggttctctt ccactatcaacctgtggaaaatctgaaactataccaggaacaaactccctaaagaatgac tcaggaggaaccccctcccccagcagtcagacagggaagtcacagaccacatggccgcgg gctgacccaaggacctcacacaaatgccataccgaggaacagagaagttgggactatttt tcctcattggccactctacattttgtggattcttctacatttgaaggaaaagaaggtgga gaacttctcaatgtcagactagccatttgtttagacagggcatccgaaactgcttccatc agagaacattggttggtgcggtactcaagacccactgtgctgagggaaactcgatcgcag aagtgccgtgccctgggtcatttttaa >gi568815596f:33415137_33659031|GENSCAN_predicted_peptide_5|727_aa MGSSGLGKAATLDELLCTCIEMFVGSYWLISVEEGKTNPELVYIEGTAAGVCKGKEVQGF FFFNWFTYRNATGESCNEFRLKICYFMRYWILKFPAEFNLDLGLIRMTEEFREVASQLGY EKHVSLIDISSIPSYDWMRRVTQRKKVSKKGKACLLFDHLEPIELAEHLTFLEHKSFRRI SFTDYQSYVIHGCLENNPTLERSIALFNGISKWVQLMVLSKPTPQQRAEVITKFINVAKN WNEMTELVSSNGNYCNYRKAFADCDGFKIPILGVHLKDLIAVHVIFPDWTEENKVNIVKM HQLSVTLSELVSLQNASHHLEPNMDLINLLTLSLDLYHTEDDIYKLSLVLEPRNSKSQPT SPTTPNKPVVPLEWALGVMPKPDPTVINKHIRKLVENVLRLSRNLMEPLKTTLWQSVFRN YDHDHDGYISQEDFESIAANFPFLDSFCVLDKDQDGLISKDEMMAYFLRAKSQLHCKMGP GFIHNFQEMTYLKPTFCEHCAGFLWGIIKQGYKCKDCGANCHKQCKDLLVLACRRFARAP SLSSGHGSLPGSPSLPPAQDEVFEFPGVTAGHRDLDSRAITLVTGSSRKISVRLQRATTS QATQTEPVWSEAGWGDSGSHTFPKMKSKFHDKAAKDKGFAKWENEKPRVHAGVDVVDRGT EFELDQDEGEETRQDGENLSDMQFIHRSMKKELLGVFYKATGLNTSKIKYHEQKKEGKGN FGSEKIE >gi568815596f:33415137_33659031|GENSCAN_predicted_CDS_5|2184_bp atgggatcaagtggccttgggaaagcagcaacattagatgaactgctgtgcacttgcatt gagatgtttgttggcagttattggttaataagtgtggaggaaggcaaaactaatcctgaa ttggtatacattgagggcacagctgcaggggtgtgcaaaggaaaggaggtgcaaggattt tttttctttaactggtttacgtatcgaaatgccactggagaaagctgcaatgaatttcga ttaaagatctgctacttcatgaggtactggattctgaagtttcctgcagagtttaatttg gatcttggtttgattcgtatgactgaggaatttcgggaagtagctagtcaactaggatat gaaaaacacgtcagcctcatcgacatatccagcattccttcctatgactggatgagaaga gtcacacagaggaaaaaagtatccaagaagggaaaagcctgtctgctgtttgaccatctg gagcccattgaattggctgagcacctcacttttctggagcataaatcttttagaaggatc tcattcactgattaccaaagctatgtcatccatggctgcctggagaataatccaaccttg gaaagatcgattgctttatttaatggaatctctaagtgggtccagttgatggttcttagc aaaccaaccccccagcaaagggcagaagtcatcacaaagtttatcaatgttgcaaagaac tggaatgaaatgacagagttggtctcctccaacggcaattactgcaattaccgcaaggcc tttgccgactgcgatggcttcaaaatccccatccttggagtacacttgaaagacttgata gctgtccatgtcattttcccagactggacagaggagaacaaagtgaacattgtgaaaatg caccagctctccgttaccctgagtgaactagtctccctgcagaatgcctctcaccactta gaacccaacatggatttgatcaacctgctcacgctttccctggacctctatcacactgaa gatgatatttacaaactgtcactggtgctggagcctagaaattctaaatcgcagcctacc tcccctacgacgcccaacaagcctgtggtacccctggagtgggcattaggggtgatgcca aagccagaccccacggtcatcaacaagcacataaggaaattagtggagaatgttctgcga ttgtccaggaatctgatggaacccctgaaaacaaccctctggcagtctgtatttagaaac tatgatcacgaccatgatgggtacatttcccaagaggactttgaaagtatagctgccaat tttcccttcttggattccttctgtgttctggacaaagatcaggatggcctaattagtaaa gatgaaatgatggcttacttcctgagagctaaatcccaactacactgtaaaatgggacca ggatttatccataattttcaggagatgacctatctcaagccaaccttctgcgaacactgt gcgggatttctctggggcataatcaagcaaggatacaaatgcaaagactgtggagccaat tgtcacaaacagtgcaaagacctcctggttctggcctgcaggagatttgcccgggcgccc tccttgagcagtggtcatgggtcactgcctggaagcccctcgctgcccccagcgcaggat gaggtgtttgagttccctggagtcactgctggacacagggatttagacagcagagccatc acactggttacaggctcttctcgcaagatctctgtgaggctacagagggccaccaccagc caggccacccagactgaacctgtctggtcagaggctggctggggggactcggggtcccac accttccctaaaatgaaatccaagttccatgacaaagcagcaaaggacaaaggctttgcc aaatgggaaaatgagaagcccagggtgcatgctggtgtggatgttgtagaccggggcacg gagtttgaacttgaccaggatgaaggagaagagaccagacaggatggtgagaatctaagt gacatgcagtttatccacaggagcatgaagaaggaattacttggagtattctacaaagca actggcctgaacacttcaaaaataaaatatcacgaacagaaaaaggagggaaagggcaat tttggatcagaaaagattgaatga >gi568815596f:33415137_33659031|GENSCAN_predicted_peptide_6|592_aa MTLRIPDDFMFQLQSEERVELARIAISQKRQQQTEQVTYMWAGVGRSVGPSFLRTLRTCA VRELRRQLWIWKFRGSRACASVRVAGEEGSLTTRKFEYHSSMECDLMETDILESLEDLGY KGPLLEDGALSQAVSAGASSPEFTKLCAWLVSELRVLCKLEENVQATNSPSEAEEFQLEV SGLLGEMNCPYLSLTSGDVTKRLLIQKNCLLLLTYLISELEAARMLCVNAPPKKAQEGGG SEVFQELKGICIALGMSKPPANITMFQFFSGIEKKLKETLAKVPPNHVGKPLLKKPMGPA HWEKIEAINQAIANEYEVRRKLLIKRLDVTVQSFGWSDRAKRCHRGRRGKMAPSSKQEAE EEGEVAMNIPHTEDEEVMNKEAGEVDVVAMTMVAEGEEEEISIKEAGQMEGVVEEVATKM VVIEIQVSSQVAIMVATAVVAIKAEVMVASKHLLHIQEVDTRVVATSRTIDTKMAGTMVI VVVVVVGEVVVEAEVVVQAREEAGEEEGARIITKGVNLNSISSMEVISIIILDLDREDIT LVEATEPYILLELKSSSQVQYPTSIISASITWELRNASSQAPPQTNRIRNCE >gi568815596f:33415137_33659031|GENSCAN_predicted_CDS_6|1779_bp atgacacttcggataccagatgatttcatgtttcaactgcagtctgaagagagggttgag ttggccaggattgcaatcagccaaaagagacagcaacaaactgaacaggtcacctacatg tgggcaggggtggggcggtcagtgggaccaagtttcttgcgcacgctccggacatgcgca gtgcgggagctgaggcgccagctgtggatttggaagttccggggaagtcgcgcatgcgcg agtgtacgcgttgccggcgaagaggggagcctgacgactcggaaatttgaataccacagt agcatggagtgtgacctcatggagactgacatcttggagtcgttggaagatctaggttac aagggcccattgttggaagatggagcgctctctcaggcagtctctgctggagccagttcc cccgagtttaccaaactctgtgcttggctggtgtctgaattaagagtgctctgtaaacta gaggaaaacgtgcaagcaactaacagtccgagtgaagctgaagaattccagcttgaggtg agtgggctactaggggagatgaactgcccgtatctttcactgacatctggggatgtgacc aagcgccttctcattcagaagaactgcctcctcttgctcacatacctcatctcagaacta gaagctgccagaatgctctgtgtgaatgctcctccaaaaaaagctcaagaaggaggcggt agtgaggtctttcaagagttgaaaggcatatgtattgctctaggaatgtccaaacctcca gccaatataactatgttccaattcttcagcgggattgaaaaaaaattaaaggaaacatta gcaaaagttccacctaatcatgtgggaaagcctttactgaagaagccaatgggaccagcc cactgggaaaagatagaagcaattaaccaagccatagccaatgaatatgaagtccggaga aagctgctaataaaacgtttggatgtcactgtacaatcctttggctggtctgacagagct aagagatgccaccgtggcagaagaggcaagatggcccccagcagcaaacaggaggccgag gaggagggagaggtggctatgaacattcctcatacggaggacgaggaggtcatgaacaag gaggcgggagaggtggacgtggtggctatgaccatggtggccgagggggaggaagaggaa ataagcatcaaggaggctggacagatggagggagtggtggaggaggtggctaccaagatg gtggttatcgagattcaggtttccagccaggtggctatcatggtggccacagcagtggtg gctatcaaggcggaggttatggtggcttccaaacatcttcttcatatacaggaagtggat accagggtggtggctaccagcaggacaatagataccaagatggcgggcaccatggtgatc gtggtggtggtcgtggtgggcgaggtggtcgtggaggccgaggtggtcgtgcaggccagg gaggaggctggggaggaagagggagccagaattatcaccaagggggtcaatttgaacagc atttccagcatggaggttatcagtataatcattctggatttggacagggaagacattaca ctagttgaggctaccgaaccttacattttgctagagctcaaaagcagttctcaagttcag tacccaactagcatcatcagcgccagcatcacctgggaacttagaaatgcaagttctcag gccccaccccagaccaaccgaatcagaaactgtgaatga >gi568815596f:33415137_33659031|GENSCAN_predicted_peptide_7|342_aa MVAAAPKLGQAFRSLTWDSWLHRFQGMEPWPMRYGTVNRAIKSLSILLEAAEKGDPETWY TSKKGQESFEGIKEVIRDLSMVTIGSRIEQTQFRGQKIKDSAKEGPEVRADSRREAACLF GKKNLPTLAYDLWSYLFFSCFKSPKEPEQLQKLFIGELSFETTDESLRSHSEQWGMLMDC VVMRDPNTKCSRGFGFVTYATVEEVDAAMNARPHKYGKIEVIEIMTDRGSGKKRAFAFAT FDNHESVDKIGIQKHHTMNGHSCKVRKALSKQEMASASPAKEVKVVLETFVVVVEVVLEK LAIDLHPILWEWKYSITYSEFEDIPFPYTTELPHFVLPMPIK >gi568815596f:33415137_33659031|GENSCAN_predicted_CDS_7|1029_bp atggtggcagctgctcccaagttggggcaggcctttcgttctctgacctgggattcttgg cttcacagattccaaggaatggaaccttggcccatgcgctatggcacagtgaacagggcc atcaaaagcctctccattcttctggaagctgcagagaaaggggacccagaaacctggtat accagcaaaaaggggcaggaatcctttgaagggataaaagaagtaatcagagacctgtcc atggtgacaatagggtcaagaattgaacagacacagttcagaggccaaaagatcaaagat tcagcaaaagaaggccccgaagtgcgtgctgattccagaagagaggctgcttgtctgttt gggaagaaaaacctcccaactcttgcttatgatctctggtcctatctctttttctcctgc ttcaagtctccaaaagagcccgaacagctgcagaagctctttattggagagctgagcttt gaaacaactgatgagagcctgaggagccattctgagcaatggggaatgctcatggactgt gtggtcatgagagatccgaacaccaagtgctccaggggctttgggtttgtcacatatgcc actgtggaggaggtggatgcagccatgaatgcaaggccacacaagtatggaaaaattgaa gtgattgaaatcatgactgaccgaggcagtggcaagaaaagggcctttgcctttgcaacc tttgacaaccatgagtccgtagataagattggcattcaaaaacaccatactatgaatggc cacagctgtaaagttaggaaagccctgtcaaagcaagagatggctagtgcttcaccagcc aaagaggtcaaagtcgttttggaaacttttgtggtggttgtggaggtggttttggagaaa ctggccattgaccttcatcccatcctttgggaatggaaatacagcatcacgtattctgaa tttgaggacattccattcccatataccacggaattgccccattttgtgctcccaatgcca attaagtag