GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:33:46 Sequence gi568815595f:40661545_40861796 : 200252 bp : 40.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8571 8730 160 2 1 81 87 61 0.463 5.34 1.02 Intr + 30232 30356 125 2 2 84 72 80 0.026 5.38 1.03 Term + 34795 34905 111 0 0 80 48 79 0.033 0.68 1.04 PlyA + 35979 35984 6 1.05 2.10 PlyA - 36237 36232 6 1.05 2.09 Term - 37855 37522 334 0 1 -89 48 535 0.327 24.90 2.08 Intr - 38473 38372 102 0 0 35 77 128 0.389 4.77 2.07 Intr - 39882 39736 147 2 0 51 41 114 0.289 1.43 2.06 Intr - 40672 40507 166 2 1 35 84 99 0.117 2.30 2.05 Intr - 43894 43861 34 1 1 101 83 50 0.086 2.68 2.04 Intr - 49103 48899 205 1 1 68 110 131 0.950 11.58 2.03 Intr - 59426 59208 219 1 0 70 82 73 0.044 1.40 2.02 Intr - 61753 61690 64 2 1 126 77 5 0.074 0.06 2.01 Init - 65102 64988 115 2 1 68 110 39 0.722 4.53 2.00 Prom - 66172 66133 40 -5.95 3.03 PlyA - 66858 66853 6 1.05 3.02 Term - 71102 70912 191 0 2 37 43 225 0.539 9.53 3.01 Init - 77122 77041 82 1 1 71 76 51 0.374 3.38 3.00 Prom - 81492 81453 40 -4.25 4.07 PlyA - 81622 81617 6 1.05 4.06 Term - 84663 84477 187 2 1 16 38 185 0.201 2.28 4.05 Intr - 93865 93673 193 2 1 48 60 140 0.182 4.83 4.04 Intr - 95710 95495 216 2 0 82 -3 182 0.104 6.05 4.03 Intr - 97526 97375 152 1 2 73 86 32 0.018 0.39 4.02 Intr - 108934 108782 153 0 0 38 63 153 0.140 6.07 4.01 Init - 115530 115307 224 0 2 65 63 139 0.106 6.63 4.00 Prom - 118192 118153 40 -7.55 5.00 Prom + 120203 120242 40 -6.75 5.01 Init + 120932 121024 93 1 0 71 3 224 0.311 12.63 5.02 Intr + 125303 125636 334 1 1 76 45 269 0.053 15.62 5.03 Intr + 139952 140063 112 0 1 95 67 22 0.001 -0.68 5.04 Intr + 140126 140256 131 2 2 77 34 78 0.003 0.72 5.05 Term + 156920 157074 155 0 2 112 40 165 0.819 11.20 5.06 PlyA + 157079 157084 6 1.05 6.04 PlyA - 158340 158335 6 1.05 6.03 Term - 166390 166301 90 1 0 70 43 110 0.100 1.44 6.02 Intr - 168411 168291 121 2 1 95 58 51 0.050 2.38 6.01 Init - 193731 193604 128 0 2 82 55 94 0.173 5.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 48579 48511 69 0 0 87 55 78 0.854 1.36 S.002 Sngl - 119209 118817 393 1 0 60 32 254 0.855 13.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:40661545_40861796|GENSCAN_predicted_peptide_1|131_aa MWGHSGGMFSAASKEHQQIPSWVALLPTLTLVTPGHKPTPPGEGSPCAPCWVHSVMGTVA PMQRSCSPRTGEASLQIREAERLPDSFSFENCFQRASSIYLPNYLHQILADKITSVASVS RTKQWLIQQLL >gi568815595f:40661545_40861796|GENSCAN_predicted_CDS_1|396_bp atgtggggccactctgggggaatgttttctgctgctagcaaggaacaccagcaaatccca tcttgggtggctctgctccctacactcacacttgtcactcccggtcataaacccactcct cctggggaaggttcaccatgtgctccctgctgggttcattctgtcatgggaactgttgct cccatgcagagaagctgctcaccaaggactggagaggccagcctgcagatcagagaagca gagagacttccagacagtttctcatttgaaaattgtttccagagggcttccagcatctac ctacccaattacctacatcaaatccttgctgataagataactagtgttgcttctgtttcc cggactaaacaatggttaatacaacaactactgtaa >gi568815595f:40661545_40861796|GENSCAN_predicted_peptide_2|461_aa MDVSSSRILLYLQIPLRAPLYKQERELPSVPLIPGGWTDGAGLCNSAVILLYASTQTVFM DDPYASIEACKLGHQGSNSLLVHTHTSSLFPRVETGHKLAPLVPRQVWFQALMQTANHCL HFSISATSLAAHRTQKTSQCPLPKAAVSFSLQMRDNHLLDGAMRHQFHHGTSSGVWTPPF IFLSMEEPVATSIQIKGPINQVFEKQVVFRYIKCTTPTVNPNENYGLWEIIRCKGRFINC NKCTTVVQDVESGGGCACVGTEVYGNALLLSMSVEVQRALIWSPTGLGSHILMALLGVAH VAAHMGWSFVPEAFPGSPGAWTVQGQGTTSGENLAGEDSLKSPGAVQGISCKCVAASHAQ RLCPTTIVNNTVPKRKAEGDAKGDKAKVKDEPQRRSMRFSAKPAPPKPEPKPKKAPAKKG EKIPKGKKGKADAGEEGNNPAENGDAKTDQAQKAEGAGGAK >gi568815595f:40661545_40861796|GENSCAN_predicted_CDS_2|1386_bp atggatgtcagcagcagcaggatactcctgtaccttcaaattccccttagagctccccta tacaagcaggaaagggagcttccgtcagtgcctttaattccagggggctggacagatggg gcagggctgtgcaattcagcagtcattcttttatatgcatcaacacaaactgtgttcatg gatgatccctatgcaagtatagaggcttgcaagcttgggcaccaaggtagtaactctctt ttagtccacacgcacacctccagtctcttccccagggtggaaacaggccacaagctggcc ccattagtgcccaggcaggtctggttccaggccctcatgcagactgcgaaccactgtctc cacttctccatctccgccacctctctggctgcccatagaacacaaaagacatctcagtgt ccattgcctaaggcagcagttagcttcagccttcagatgagggacaaccatctccttgac ggagccatgaggcatcaattccatcatggaacttcttcaggggtttggactcctcccttt atctttctttctatggaggagccagttgctaccagtattcagattaaaggacccatcaat caggtttttgagaaacaggtggtgtttcgttacattaaatgtacaacaccaacagtgaat cctaatgaaaactatggactttgggagataataaggtgtaagggtagattcatcaactgt aacaaatgtaccactgtggtgcaggatgttgagagtgggggaggctgtgcatgtgtgggg acagaagtatatgggaacgctcttctcttatctatgagtgtggaagtccagagggccctg atatggtcccccacaggccttggcagccacatcctcatggctttactgggtgtagcccat gtggctgctcatatgggttggagttttgtgcctgaagctttcccaggcagccctggagcc tggacagttcaaggtcaagggacaacaagtggtgaaaaccttgctggtgaggactctctg aagagtcctggggctgtgcagggcatctcatgcaagtgtgttgctgcatcccacgcccag cgcttatgtcccaccaccatcgtcaacaacaccgtgcccaagagaaaggctgaaggggat gctaaaggagataaagccaaggtgaaggatgaaccacagagaagatccatgaggttttct gctaaacctgctcctccaaagccagagcccaagcctaaaaaggcccctgcaaagaaggga gagaagatacccaaagggaaaaagggaaaagctgatgctggcgaggaggggaataaccct gcagaaaacggagatgccaaaacagaccaggcacagaaagctgaaggtgctggaggtgcc aagtga >gi568815595f:40661545_40861796|GENSCAN_predicted_peptide_3|90_aa MQRASEINCLGSDYTHVTNSPTPPLSPGTTRYQENKELTDPLKAAAGLCKFQKTGEKLSS QSVRGENLPPNTSPLGNAKVQIMEEGFNLT >gi568815595f:40661545_40861796|GENSCAN_predicted_CDS_3|273_bp atgcagagagccagtgaaataaactgcttgggctcagattatacccatgtcaccaactca ccaacacctcctttgtctccaggaaccaccaggtaccaggaaaacaaagaattaacagat cctttgaaagcagcagcaggcctctgcaaattccagaagacaggtgaaaaactgagttct caaagtgtgaggggagaaaacctgcctccaaacacatccccactggggaacgcaaaagtc cagattatggaggaaggatttaaccttacctag >gi568815595f:40661545_40861796|GENSCAN_predicted_peptide_4|374_aa MHFVIWKALALSLARVAAISVGQKQNLEEVAMPRKHGSAQDRLRCPAGGKDEGAHYAEKD PLEVGVVTRRNPSDQTSCGYKEKETSDVFGALGTIQLPAEAGEQGFEGKLFIQEMISEST SQKLGKLPVHSDGSFFCCAELQTTAQGNKRGYKQMEEHSMLMGRKNQYHENGHTAQGLKK MIRAAEVLQDKLENVGLGNVTIPGSEFITKLIGVPEKASQMDMCDPVPHFTTESSLGHER FHPKAPAIGQSPRGGPDLFLNIVKETPTALSVCTGSFLQSEAGQGQCLLAVNRVPLAVLV QRVEFPSPGNPEGYFCGQESLQVERGAGAQDKKAPGYKQGIRNHQLKLQLTSRVAEVIQH QSWQHLLSRTNFLE >gi568815595f:40661545_40861796|GENSCAN_predicted_CDS_4|1125_bp atgcattttgtcatatggaaggctctagcactttctctggctagggtggcagctatcagt gtagggcaaaagcaaaacctggaagaggtggccatgcccagaaagcatggcagtgcacag gacaggctgcggtgcccagcaggagggaaggatgaaggtgcccactatgcagagaaggac ccactggaagtgggagtggtgaccaggagaaaccccagtgaccagaccagctgtggctat aaagaaaaagaaacctcagatgtgtttggagctcttggtactattcagcttcctgcagaa gcaggggaacaaggatttgagggcaagttgtttattcaggaaatgatctcagaaagcacc agtcaaaaactagggaagttgcctgttcactctgatggtagtttcttttgctgtgcagaa ctacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacattccatg ctcatgggtaggaagaatcaatatcatgaaaatggccatactgcccaaggtctcaagaag atgatcagggcagcagaagtccttcaagacaaactggaaaatgtggggcttgggaatgtt acgataccaggttcagagttcatcaccaaactcattggtgtcccagagaaagcaagtcag atggacatgtgtgacccagttccccattttacaactgagagcagtctagggcatgaaagg tttcacccaaaggccccagctattgggcagtcacccaggggtggacctgacctcttcctc aacattgtcaaggaaacaccaacagccttgtcagtgtgcacaggatccttcctgcagtca gaggcaggacagggacagtgtctccttgctgtgaacagagtccctctagctgtcttggta cagagggtagagtttcccagccctggcaatcctgagggttacttctgtggccaagaaagc cttcaggtggagagaggtgcaggggctcaagataagaaggcaccagggtacaaacagggt atcaggaaccatcaactgaagctacagttgacctccagggtggcagaagtgatacagcat cagagttggcagcacctgctgtcacgaacaaacttcctggaataa >gi568815595f:40661545_40861796|GENSCAN_predicted_peptide_5|274_aa MRNPLPLLAPSFDDDDNDDDDDDDGDDDNLCGMDWYWFVAWGLGTPGLGYQVEEIAKQQR IQDVIWLLLMAYNQIQEQINELNLELKVKREEEYKNLENLQTGHVIEKEKSFSGEEFKQS VEQPLAGEISVTKMEPSANIQGIPVLCPSVADLHRPHLSGSQILCLQTVLSHWVWAEEGS SGSVQQWMCSSSTAAPLGRPSGMAMTLTGSSDIFPSLFSFRFTGRDCESQTLTTHETLMV ADDGGDEIYNGNEVTNSERCNHQRNTTGCTENRK >gi568815595f:40661545_40861796|GENSCAN_predicted_CDS_5|825_bp atgaggaaccctctcccacttcttgctccttcctttgatgatgatgataatgatgatgat gatgatgatgatggtgatgatgacaacctctgtggcatggactggtactggtttgtggcc tgggggttggggacccctggcctagggtatcaggttgaagaaattgccaaacagcaaagg attcaagatgtgatctggctgcttctaatggcctacaatcagatacaggagcaaataaat gagttaaatttggaacttaaagttaaaagagaagaagaatataaaaatttggaaaatttg cagactggccatgtgatagagaaggaaaaatcattttcaggagaggaattcaagcagtct gtggagcaaccacttgctggagagatcagtgtaactaaaatggaaccaagtgctaatatc caaggcattcctgttttgtgtccttctgtggctgacctccacagaccacatctttcaggc tctcagatcctctgtcttcagactgtgctcagccattgggtatgggcagaggaaggaagc agtggcagtgttcagcagtggatgtgttcctcctcaactgcagctcctctaggcagacct tccggcatggctatgactctcacaggctccagtgatatttttccctcccttttttccttc aggtttacgggcagagactgtgaatcccagacactaactacacatgaaacattgatggtt gctgatgatggtggtgatgaaatttataatggaaatgaggtaaccaattccgagaggtgt aaccaccagagaaacaccactggctgtacagagaacagaaaataa >gi568815595f:40661545_40861796|GENSCAN_predicted_peptide_6|112_aa MREKQYVMILKASAPHLYPVHSAHIPLAKASITARSRVNRMEKEKDNYVCGPSSMSVFLI IFFKFAAPYYRKKNYMHCLNFPQTPDSFCSDPTPTEAVTQESSMSNAELESV >gi568815595f:40661545_40861796|GENSCAN_predicted_CDS_6|339_bp atgagggagaaacagtacgtgatgattcttaaggcttcagccccgcatttataccctgtc cactctgctcacattccattggccaaggcaagtatcacagccaggtccagagtaaatcgg atggagaaagagaaagataactatgtttgtggcccatcatctatgtctgtgtttcttatc atttttttcaaatttgcagcaccatactatcggaagaagaactacatgcattgcctcaat ttcccccagactcctgatagtttctgctcagatccaacaccaaccgaagcagtgacccag gagtcttcaatgagcaatgcagaactggaaagcgtctaa