GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:50:54 Sequence gi568815584f:49793737_49994261 : 200525 bp : 42.45% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.19 Intr - 2208 2055 154 2 1 66 92 145 0.881 11.85 1.18 Intr - 6960 6684 277 1 1 70 86 368 0.967 30.45 1.17 Intr - 8837 8717 121 2 1 56 36 104 0.593 1.15 1.16 Intr - 12397 12285 113 2 2 49 90 55 0.366 0.98 1.15 Intr - 18935 18813 123 0 0 22 43 115 0.015 0.04 1.14 Intr - 21121 21018 104 0 2 66 96 35 0.020 1.00 1.13 Intr - 32219 32131 89 2 2 52 111 35 0.075 0.05 1.12 Intr - 34618 34555 64 0 1 72 79 74 0.617 2.70 1.11 Intr - 35071 34880 192 0 0 20 106 243 0.975 16.99 1.10 Intr - 35526 35318 209 0 2 59 115 134 0.989 10.15 1.09 Intr - 39760 39687 74 2 2 68 64 54 0.877 -0.79 1.08 Intr - 40713 40627 87 1 0 49 95 85 0.594 4.32 1.07 Intr - 44470 44403 68 0 2 70 69 48 0.802 -1.27 1.06 Intr - 47130 46982 149 0 2 71 98 189 0.981 16.31 1.05 Intr - 50723 50661 63 2 0 62 83 57 0.571 0.60 1.04 Intr - 52529 52404 126 2 0 93 110 114 0.999 14.06 1.03 Intr - 57929 57827 103 1 1 89 92 115 0.999 11.26 1.02 Intr - 58139 58071 69 1 0 95 69 67 0.268 2.88 1.01 Init - 59017 58959 59 1 2 91 60 94 0.275 8.00 1.00 Prom - 62412 62373 40 -2.75 2.04 PlyA - 65244 65239 6 1.05 2.03 Term - 68580 68307 274 2 1 80 43 260 0.961 14.66 2.02 Intr - 74249 74151 99 1 0 134 85 -4 0.279 2.21 2.01 Init - 83338 83310 29 1 2 69 106 21 0.182 1.12 2.00 Prom - 85092 85053 40 -7.45 3.00 Prom + 93352 93391 40 -6.65 3.01 Init + 100001 100446 446 1 2 94 84 752 0.694 70.93 3.02 Term + 117156 117288 133 0 1 50 48 129 0.693 1.68 3.03 PlyA + 117455 117460 6 1.05 4.00 Prom + 117782 117821 40 -6.55 4.01 Init + 142993 143340 348 0 0 49 57 217 0.068 11.93 4.02 Intr + 149759 149960 202 1 1 46 71 79 0.021 -0.06 4.03 Intr + 158744 158882 139 0 1 87 45 100 0.035 4.20 4.04 Intr + 164180 164234 55 2 1 77 94 -3 0.014 -2.74 4.05 Intr + 170117 170335 219 1 0 101 69 167 0.176 13.68 4.06 Intr + 172653 172685 33 2 0 87 80 39 0.013 0.50 4.07 Intr + 185864 185945 82 1 1 63 78 81 0.052 2.89 4.08 Term + 186671 186942 272 0 2 81 43 119 0.335 1.36 4.09 PlyA + 187500 187505 6 1.05 5.06 PlyA - 187960 187955 6 1.05 5.05 Term - 188568 188469 100 2 1 119 55 113 0.505 7.72 5.04 Intr - 194850 194617 234 2 0 91 52 117 0.452 4.28 5.03 Intr - 196597 196398 200 1 2 93 62 98 0.563 4.93 5.02 Intr - 198028 197900 129 1 0 29 12 146 0.388 1.07 5.01 Intr - 199137 199042 96 0 0 99 95 73 0.258 8.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:49793737_49994261|GENSCAN_predicted_peptide_1|748_aa MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDFKATLLLESGIRIHT TEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHLIIELYDRG SSTPVPQTGTGLWNQATRQEGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHAR AAEPLLTLERLTEIVASAPKGELLKRVLNPLLPYGPALIEHCLLENGFSGNVKVDEKLET KDIEKVLVSLQKAEDYMKTTSNFSGKEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDW TEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDVDGDVNVEKNET EPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAE KAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVK RYLTPVINKQWNKEEITKDATKHFEMNEMKIQCVKTYEMQEEQPLGGEPIPPRTLTEAGT MALCYSAAWDARVITSAWWVYHHQVDESCVWRHQGERKVRVQDEDMETLASCTSELISEE MEQLDGGDTSSDEDKEEHETPVEVELMTQVDQEDITLQSGRDELNEELIQEESSEDEGEY EEVRKDQDSVGEMKDEGEETLNYPDTTIDLSHLQPQREMKKKKLPSDSGDLEALEGKDKE KESTVHIETHQNTSKNVAAVQPMKRGQK >gi568815584f:49793737_49994261|GENSCAN_predicted_CDS_1|2244_bp atgaagagccgctttagcaccattgacctccgcgccgtactcgcggagctgaatgctagc ttgctaggaatgagagtaaacaatgtttatgatgtggataataagacataccttattcgt cttcaaaaaccggactttaaagctacacttttacttgaatctggcatacgaattcataca acagaatttgagtggcctaagaatatgatgccgtctagttttgccatgaagtgccgaaaa catttgaagagtcggagattagtcagtgcaaaacagcttggtgtggatagaattgtagat tttcaatttggaagtgatgaagctgcttaccatttaatcattgagctctatgatagggga tcttcaacccctgtgccgcagactggaactggtctgtggaaccaggccacacggcaggag gggaacattgttcttacagattatgagtacgtaattttaaatattctaaggtttcgaact gatgaggcagatgatgttaaatttgctgttcgtgaacgctatccacttgatcatgctaga gctgctgaacctttgcttactttggaaaggttgactgaaatagtagccagcgcacctaag ggtgaactactgaagagggtgcttaacccattacttccctatggaccagctctcattgaa cactgtcttttagaaaatggattctcgggtaatgtcaaagtggatgaaaaacttgaaact aaagatattgaaaaagtacttgtttctctgcagaaagcagaagactatatgaaaacaaca tccaacttcagtgggaaggaaatagacaaactgaaaggagagctcatagaaatgaaccta caaatagttgacagagccattcaggtagttcgaagtgctttagctaaccagatagattgg acagaaattgggttaattgtgaaagaagcccaggctcaaggagaccctgttgcaagtgca atcaaagaattaaaactacaaacaaaccatgttacaatgctgctaagaaatccatacttg ttatcagaggaggaagatgatgatgttgatggtgacgtcaatgttgagaaaaatgaaact gaaccaccaaaaggaaaaaagaaaaaacaaaagaataaacagctgcagaagcctcagaaa aataagcccttacttgtagatgttgatctcagcttgtcagcatatgccaatgccaaaaag tattatgatcacaagagatatgctgctaagaaaacacaaaagactgttgaagctgctgag aaggcattcaagtcagcagaaaagaaaacaaagcaaacattaaaagaagttcagactgtt acctctattcaaaaagcaagaaaagtatattggtttgagaaatttctgtggttcattagc tcagagaactatctaattataggtggacgagatcagcaacagaatgaaataattgtgaaa agatacttgacaccagttataaataagcaatggaacaaagaggaaatcacaaaagatgcc acaaaacactttgagatgaacgaaatgaaaatacagtgtgtcaaaacttatgagatgcaa gaagagcagcccttagggggagaacccatccccccacggaccttgactgaagctggcaca atggcactttgctacagtgctgcttgggatgcacgagttatcactagtgcttggtgggtg taccatcatcaggtagatgagtcttgtgtttggagacatcagggtgaacgaaaagtcaga gtacaggatgaagacatggagacactggcaagttgtacaagtgaactcatatcagaagaa atggaacaattagatggaggtgacacgagcagtgatgaggataaagaagaacatgaaact cctgtggaagtagaactcatgactcaggttgaccaagaggatatcactcttcagagtggc agagatgaactaaatgaggagctcattcaggaagaaagctctgaagacgaaggagaatat gaagaggttagaaaagatcaggattctgttggtgaaatgaaggatgaaggggaagagaca ttaaattatcctgatactaccattgacttgtctcaccttcaaccccaaagggaaatgaaa aagaaaaaacttccaagtgactcaggagatttagaagcgttagagggaaaggataaagaa aaagaaagtactgtacacattgaaactcatcagaacacaagcaaaaatgttgcggctgtg cagccaatgaaacgaggacaaaag >gi568815584f:49793737_49994261|GENSCAN_predicted_peptide_2|133_aa MLLNNPWAKKCVGPPPRVFSHGPAIRACAAQVQPQLCLVPLPCAKAHQLTGCLGICNRLI AAPYRRIFKTSLRKAVITFGANRGCSFTHRQSEKNGRRITESKKYVLLKNFQVLPAAVQD AACSYQHYSPALS >gi568815584f:49793737_49994261|GENSCAN_predicted_CDS_2|402_bp atgcttctgaataatccatgggccaaaaagtgcgttgggccccctcctcgggtgttctct cacggcccagcaatcagggcctgtgcggctcaagttcagcctcagctttgtctggtaccc cttccctgcgcgaaagctcaccagttgacgggatgcttggggatttgcaaccgcctcatc gctgccccctaccggcgcatattcaaaacgtctctgcgaaaagctgttattaccttcgga gccaacaggggctgctcgtttacccacagacagtcggaaaaaaatggccgccggatcaca gagagcaaaaagtacgttctcttaaagaatttccaagttcttcccgcggccgttcaggat gctgcttgcagttaccaacattattcccctgcactgagctga >gi568815584f:49793737_49994261|GENSCAN_predicted_peptide_3|192_aa MGKVLSKIFGNKEMRILMLGLDAAGKTTILYKLKLGQSVTTIPTVGFNVETVTYKNVKFN VWDVGGQDKIRPLWRHYYTGTQGLIFVVDCADRDRIDEARQELHRIINDREMRDAIILIF ANKQDLPDAMKPHEIQEKLGLTRIRDRNCNKDAVGGHYPKRINTETENLMPHILTYKWEL NTGYTQTQRWEQ >gi568815584f:49793737_49994261|GENSCAN_predicted_CDS_3|579_bp atggggaaggtgctatccaaaatcttcgggaacaaggaaatgcggatcctcatgttgggc ctggacgcggccggcaagacaacaatcctgtacaagttgaagctgggccagtcggtgacc accattcccactgtgggtttcaacgtggagacggtgacttacaaaaatgtcaagttcaac gtatgggatgtgggcggccaggacaagatccggccgctctggcggcattactacactggg acccaaggtctcatcttcgtagtggactgcgccgaccgcgaccgcatcgatgaggctcgc caggagctgcaccgcattatcaatgaccgggagatgagggacgccataatcctcatcttc gccaacaagcaggacctgcccgatgccatgaaaccccacgagatccaggagaaactgggc ctgacccggattcgggacaggaactgcaacaaggatgcagttggaggccattatcctaag cgaattaacacagaaacagaaaacctaatgccgcacattcttacttataagtgggagcta aacactgggtacacacagacacaaagatgggaacagtag >gi568815584f:49793737_49994261|GENSCAN_predicted_peptide_4|449_aa MWKQLWNWVIGRGWYRLESSEADRKMWESLELPREMLNGFDKNADSDMNNKVQAEMVSDG DEELVGNWSKGDSCYVLAKRLAAFCPCPRDLWNFELERDDLAYLMEEISKQQRIQKLSLG DRVKTLSLKNNNNNNNNNNKKPKKTGRLHLSIKRYVTHYEVLYIRHDLLVRCISYHLGKS QNEGTSREWCRTQAQPVRASFLDLEYQGKKQRVKNSSEDASGGSVICGQQRGNQGTERLR NSSRITKLTKDLPRTLVVTDQALWLEELAQQRQLRSGEGWKPHTRAPLLLQGCQRSSHTQ RLARPCVLSFLPAGTSGAGSVMLEEHDRGEVEDGFFANNRNMQRLLRAVLVGSEVGGCGA LEHEPTQIAGQPFPSQDSSGNLLTALFWLPYLEHLMFLDVPSSSDNHPYDVPMATMLTLG LPFWDGNPAQGPPSVLILSQVLIASQWDQ >gi568815584f:49793737_49994261|GENSCAN_predicted_CDS_4|1350_bp atgtggaagcaactttggaactgggtaataggcagaggttggtaccgtttggagagctca gaagcagacaggaaaatgtgggaaagtttggaacttcctagagagatgttgaatggcttt gacaaaaatgctgatagtgatatgaacaataaggtccaggccgagatggtctcagatgga gatgaagaactggttgggaattggagcaaaggtgactcttgttatgttttagcaaagaga ctggcagcattttgtccctgccctagagatttgtggaactttgaactcgagagagatgac ttagcgtatctgatggaagaaatttctaaacagcaaaggattcaaaagcttagcctgggt gacagagtgaagacactgtctcttaaaaacaacaacaacaacaacaacaacaacaacaaa aaacccaaaaaaacaggacgtttgcatttgagcatcaagcgctatgtaacacattatgaa gtcctatatattagacatgatttactagtcagatgtattagttaccacctagggaagagc caaaatgaagggacctccagggaatggtgcaggactcaagctcagccagtcagagcctcc tttctggaccttgaatatcaagggaagaagcagagggtgaaaaatagttcagaggacgct agtggtggcagtgtcatctgtgggcagcaaaggggaaaccaaggcacagagagattaaga aactcatccaggattacaaagctaactaaggacttgcccagaactctggtggttacagac caagcactatggttggaggagctagcccaacagaggcagctgcgatcaggggaagggtgg aagccgcacaccagagcacccttgctgctccagggctgtcaacgatcttctcatactcag agactcgctcgcccctgtgtcctttcattccttcctgcaggcacttctggagcgggtagc gtgatgctggaggagcatgacagaggagaggtggaggatgggttttttgccaataacagg aacatgcagaggctgctcagagcagttttggttggtagtgaagttggaggatgtggagcc cttgaacatgagccaactcagatagcagggcagccattcccttcccaggactcctctgga aacctgctcactgctttgttttggttgccttatcttgagcatctgatgtttcttgatgta ccttctagctctgataaccacccctatgatgtccccatggccaccatgcttactctggga ttgcccttctgggatggtaaccctgcccaaggccctccttctgtccttattctgagccag gtgttaatagccagccaatgggatcaataa >gi568815584f:49793737_49994261|GENSCAN_predicted_peptide_5|252_aa EKELCHEKWKTPEKLFQRQESSRQAGLGKILPPSCDHLARQPKNKAVTMETKKRQKDTGS SLSRPAVHEALSEAHNFVFATWVHSSIVSFSLPSGINFLNLVDKALVAQGNKNYGWLATA AEWGWSSPNHHEGQGIILPGDRAEAPLRAVEQHDSGSEETLDGPLRLGSLTSTSLMGNCT VLGDIVLVVALRAFLCSFSPSIPQLDFLPAFHNSILHTGSKYLCTLTRHWLLVVTVLRAV GNMKSNETRSLL >gi568815584f:49793737_49994261|GENSCAN_predicted_CDS_5|759_bp gagaaagaattgtgccatgaaaaatggaaaactcctgagaagctgttccagcgtcaggag agcagcagacaagctggactggggaagattcttcctccatcttgtgatcatctggcaaga cagcccaagaataaagccgtcaccatggagactaagaagagacagaaagacactgggtct tcattgtctcgtccagctgttcatgaggccttatctgaagcccacaactttgtttttgca acatgggtccactcttccatagttagtttctcacttccctctggcattaactttctcaat ctagtggataaggcacttgtggcccagggtaacaagaactatggttggttggcaacagca gctgagtggggctggtcttctccgaatcatcatgaaggacagggaattattttgcctgga gacagggctgaagcacccctacgagcagttgaacagcatgactctggcagcgaagaaacc ctggatgggcctctgaggctggggagcctgaccagtacctcactgatgggaaattgcact gtccttggagacattgtccttgttgtcgccttaagagctttcctttgttctttctctccc agtataccgcagcttgattttctccctgctttccacaatagcattctgcacacaggatcg aaatacctctgtacactcaccagacactggctgctggttgtcacggtgctaagagctgtg ggaaacatgaagagcaatgagaccaggtccctgctctga