GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:01:59 Sequence gi568815575f:154044284_154182677 : 138394 bp : 47.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 747 944 198 2 0 47 43 179 0.512 4.47 1.02 PlyA + 2648 2653 6 1.05 2.05 PlyA - 3677 3672 6 1.05 2.04 Term - 13559 13513 47 0 2 99 34 59 0.012 -1.03 2.03 Intr - 16461 16397 65 2 2 126 84 33 0.004 5.06 2.02 Intr - 53572 53321 252 0 0 104 74 184 0.875 15.15 2.01 Init - 65137 65073 65 1 2 73 81 62 0.789 4.62 2.00 Prom - 65759 65720 40 -7.66 3.02 PlyA - 66767 66762 6 1.05 3.01 Sngl - 67881 67543 339 0 0 40 42 226 0.984 9.23 3.00 Prom - 74323 74284 40 -8.46 4.03 PlyA - 74450 74445 6 1.05 4.02 Term - 78040 77924 117 1 0 94 42 100 0.858 4.54 4.01 Init - 78463 78329 135 1 0 76 110 85 0.951 9.65 4.00 Prom - 86060 86021 40 -5.46 5.00 Prom + 87081 87120 40 -5.46 5.01 Init + 87336 87384 49 2 1 86 58 8 0.519 -3.21 5.02 Intr + 87901 88055 155 2 2 84 41 149 0.862 9.59 5.03 Intr + 94568 94778 211 1 1 67 33 100 0.452 0.89 5.04 Intr + 95117 95181 65 0 2 73 87 116 0.914 8.44 5.05 Intr + 99926 100112 187 1 1 21 83 226 0.755 14.66 5.06 Intr + 106373 106669 297 0 0 129 91 522 0.931 53.45 5.07 Intr + 108657 108825 169 1 1 118 110 120 0.994 16.20 5.08 Intr + 110291 110456 166 2 1 82 103 259 0.998 26.76 5.09 Intr + 112011 112250 240 2 0 127 115 380 0.999 42.35 5.10 Term + 114533 114643 111 1 0 116 51 123 0.992 9.96 5.11 PlyA + 114719 114724 6 1.05 6.08 PlyA - 114735 114730 6 -4.33 6.07 Term - 115430 115074 357 2 0 93 49 150 0.741 6.11 6.06 Intr - 116185 116050 136 0 1 55 72 116 0.264 7.27 6.05 Intr - 124339 124224 116 1 2 70 32 98 0.168 1.65 6.04 Intr - 124670 124648 23 0 2 85 113 12 0.672 0.66 6.03 Intr - 125583 125484 100 0 1 80 80 54 0.657 3.48 6.02 Intr - 132066 131467 600 0 0 77 98 648 0.072 57.41 6.01 Intr - 135863 135764 100 1 1 104 84 46 0.850 5.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:154044284_154182677|GENSCAN_predicted_peptide_1|65_aa MREKQLSHDSTTVKHMGKGGTCRRYEDSVAEPKKICKIRETGPKSQTGRSTVCRWQIEGP EPPVP >gi568815575f:154044284_154182677|GENSCAN_predicted_CDS_1|198_bp atgcgagagaagcagctctcacacgattccaccacagttaaacacatgggcaaaggaggg acctgccggaggtatgaggattccgtagcagagcccaagaaaatctgcaagatcagagaa actgggcctaagagccagactggcagaagtacggtttgccggtggcaaatagaaggacct gaaccaccagtgccctag >gi568815575f:154044284_154182677|GENSCAN_predicted_peptide_2|142_aa MRRQYTPTRMAKIQNTDNTKCCPAITANDGRARSGAEGGARARRCSSARAGQEGGARRRP CGVPASAARALPPLGERAVVKAVRKMAAAAAAAPSGGGGGGEEERLYHTCLLKCHLPNVA FPEHTHVYLLPKNSFVIEFNQS >gi568815575f:154044284_154182677|GENSCAN_predicted_CDS_2|429_bp atgagacgccagtacacacctactagaatggccaaaatccagaacactgataacaccaaa tgctgcccggccatcacagccaatgacgggcgggctcgcagcggcgccgagggcggggcg cgggcgcgcaggtgcagcagcgcgcgggccggccaagagggcggggcgcgacgtcggccg tgcggggtcccggcgtcggcggcgcgcgcgctccctcctctcggagagagggctgtggta aaagccgtccggaaaatggccgccgccgccgccgccgcgccgagcggaggaggaggagga ggcgaggaggagagactgtatcacacatgtctgctcaagtgtcaccttcccaatgtggcc ttccccgagcacacccatgtctatctgctgcccaagaactcctttgtgattgaatttaac cagagttga >gi568815575f:154044284_154182677|GENSCAN_predicted_peptide_3|112_aa MRQKLVERQGEIDESIDKVRDFNTPLSEINRSSRQKIRKDIVELTNTINQPNIIDIYRIL YPTAAEYMFFSSSRGTFTEMDHILGHKAHLNKCKRIKSHSVFSQTTLELNKK >gi568815575f:154044284_154182677|GENSCAN_predicted_CDS_3|339_bp atgaggcaaaaactggtagaacgacaaggagaaattgatgagtccattgacaaagttaga gacttcaatacccctctgtcagaaattaacagatccagcaggcagaaaatcaggaaggac atagttgaactcaccaacaccatcaatcaacctaatataattgacatctatagaatactt tatccaacagcagcagagtacatgttcttctcaagctcccgtgggacattcaccgagatg gaccacattctgggccataaagcacaccttaacaaatgtaaaagaataaaatcacacagt gttttctctcagaccacattggaattaaacaagaaatga >gi568815575f:154044284_154182677|GENSCAN_predicted_peptide_4|83_aa MLESVKEASPEDMSPSPVTVPGLRVFDGYRVSRCPYRVGQLRNPQYKTQTRMTMIEMSRV PSSHPPQEPNSSQESIIPGTAVL >gi568815575f:154044284_154182677|GENSCAN_predicted_CDS_4|252_bp atgctggagagtgtcaaggaggcctcccctgaggacatgagtccctcccctgtgactgtg cccggactgagggtctttgatggctacagagtttcacggtgtccctacagagttggtcag ctcaggaacccccagtacaagactcagacccgcatgaccatgattgagatgtctcgggtg ccctcgagccaccccccccaggagccaaattcttcccaagagtccattatcccggggaca gctgtcttgtga >gi568815575f:154044284_154182677|GENSCAN_predicted_peptide_5|549_aa MGFHHVDQAGLKLLTSGQLCSSLKQLTRVSWQLSLVLDFSKISPQINNNTDKGLRTTVFG TWLLGTIQPPGNPVDSTYKSSPESATSYHPTVASQTWSPRFCPGPYSSQTAARGVHPNQL QIASLFLSKSDKAPHSIQKRQLQLRSTGSPSAAASRDLEQASIKRRDPQVMRQGRLPSGT GLSIAMAQQWSLQRLAGRHPQDSYEDSTQSSIFTYTNSNSTRGPFEGPNYHIAPRWVYHL TSVWMIFVVTASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAETVIASTISIVNQV SGYFVLGHPMCVLEGYTVSLCGITGLWSLAIISWERWMVVCKPFGNVRFDAKLAIVGIAF SWIWAAVWTAPPIFGWSRYWPHGLKTSCGPDVFSGSSYPGVQSYMIVLMVTCCIIPLAII MLCYLQVWLAIRAVAKQQKESESTQKAEKEVTRMVVVMIFAYCVCWGPYTFFACFAAANP GYAFHPLMAALPAYFAKSATIYNPVIYVFMNRQFRNCILQLFGKKVDDGSELSSASKTEV SSVSSVSPA >gi568815575f:154044284_154182677|GENSCAN_predicted_CDS_5|1650_bp atggggtttcaccatgttgaccaggctggcctcaaacttctgacctcagggcagctgtgc tcctctcttaagcaactgacccgtgtctcctggcaactctctttggtactggacttctcc aagatcagtcctcaaattaataacaacactgacaagggtcttcggaccaccgtgtttggt acctggctgcttgggaccatccagccaccaggaaatcctgttgactccacttacaaaagc tctccagagtctgctacctcttaccaccctactgtcgcctcccagacctggtctccccgc ttctgccctggcccctacagctcgcagacagctgccagaggagtccatccgaatcaactt cagattgcgtcactttttctttcaaaatccgacaaagctcctcattccattcagaaaagg cagctgcagctgcggagcacgggatctccatctgcagctgcatcccgggacctagaacag gccagtataaagcgccgtgaccctcaggtgatgcgccagggccggctgccgtcggggaca gggctttccatagccatggcccagcagtggagcctccaaaggctcgcaggccgccatccg caggacagctatgaggacagcacccagtccagcatcttcacctacaccaacagcaactcc accagaggccccttcgaaggcccgaattaccacatcgctcccagatgggtgtaccacctc accagtgtctggatgatctttgtggtcactgcatccgtcttcacaaatgggcttgtgctg gcggccaccatgaagttcaagaagctgcgccacccgctgaactggatcctggtgaacctg gcggtcgctgacctagcagagaccgtcatcgccagcactatcagcattgtgaaccaggtc tctggctacttcgtgctgggccaccctatgtgtgtcctggagggctacaccgtctccctg tgtgggatcacaggtctctggtctctggccatcatttcctgggagagatggatggtggtc tgcaagccctttggcaatgtgagatttgatgccaagctggccatcgtgggcattgccttc tcctggatctgggctgctgtgtggacagccccgcccatctttggttggagcaggtactgg ccccacggcctgaagacttcatgcggcccagacgtgttcagcggcagctcgtaccccggg gtgcagtcttacatgattgtcctcatggtcacctgctgcatcatcccactcgctatcatc atgctctgctacctccaagtgtggctggccatccgagcggtggcaaagcagcagaaagag tctgaatccacccagaaggcagagaaggaagtgacgcgcatggtggtggtgatgatcttt gcgtactgcgtctgctggggaccctacaccttcttcgcatgctttgctgctgccaaccct ggttacgccttccaccctttgatggctgccctgccggcctactttgccaaaagtgccact atctacaaccccgttatctatgtctttatgaaccggcagtttcgaaactgcatcttgcag cttttcgggaagaaggttgacgatggctctgaactctccagcgcctccaaaacggaggtc tcatctgtgtcctcggtatcgcctgcatga >gi568815575f:154044284_154182677|GENSCAN_predicted_peptide_6|477_aa XYSLSVSKISVNYLLDCIDAIEKAACSLKVMVLKAEHTRSPSATLPSNVPSCRSLSSSED GPSGPSSLADGGLAHNLQDSVRHRILYLSEQLRVEKASRDGNTVSYLKLVSKADRHQVPH IQQAFEKVNQRASATIAQIEHRLHQCHQQLQELEEGCRPEGLLLMAESDPANCEPPSEKA LLSEPPEPGGEDGPVNLPHASRPFILESRFQSLQQGTCLETEDVAQQQNLLLQKKGETET RTHPEGERRVETEAEADLPWQKPRKPAASSPFELHSTAGWVVYKERKFNVKALAAGEDLC TASSYGGRQKGKRAQALMEEQVNGRLQGQLNEIYNLKHNLACSEERMAYLSYERAKEIWE ITETFKSRISKLEMLQQVTQLEAAEHLQSRPPQMLFKFLSPRLSLATVLLVFVSTLCACP SSLISSRLCTCTMLMLIGLGVLAWQRWRAIPATDWQEWVPSRCRLYSKDSGPPADGP >gi568815575f:154044284_154182677|GENSCAN_predicted_CDS_6|1434_bp nnttattctctcagtgtatcgaagatatcagtcaactatcttctggattgcattgatgct attgagaaggcagcctgcagtctaaaagtcatggttttaaaggcggaacacaccaggagc cccagcgcaaccctcccctccaatgtgccttcatgccggtccctgtcatccagcgaagac ggccccagtggcccttccagcctcgcagatggaggcctagcccacaacttacaggatagt gtcaggcaccgcatcctctacctctcagagcagctgagagtggagaaggccagtcgggat ggcaacactgtgagctacctcaagctggtatccaaagcagaccggcaccaggtgccgcac atccagcaggcctttgagaaggtgaaccagcgcgcctctgccaccatcgcccagatcgag cacaggctccaccagtgtcaccagcagctccaggagctggaggaaggctgcaggcccgag ggcttactgctgatggcagaaagcgacccagccaactgcgagccacccagtgagaaggcc ctgctttcagagccccccgagccaggtggggaagacgggccggtcaacctgcctcatgcc agcaggcccttcatcttggagagtcgcttccagagcttacagcaggggacgtgcttagag acagaggatgtggcccagcaacaaaacctgctgttgcagaagaaaggggagacggagaca cggacacacccagagggagagcgccgtgtggagacagaggctgaggccgacctgccgtgg cagaagccaaggaagcccgcagcttcatctccttttgaacttcacagtaccgcaggctgg gtggtttataaagaaaggaagttcaatgtcaaggcgctggcagctggtgaggacctttgt acggcatcatcctatggcggaaggcagaagggcaagagggcgcaagcattgatggaagaa caggtgaatggtcgcctgcagggacagctgaatgagatttacaacctcaaacacaatctg gcctgcagcgaagagagaatggcctatctatcctatgagagagccaaggaaatatgggag atcacggagaccttcaagagccgaatatccaagctggagatgctacagcaagtcacccaa ctggaggcagcggagcacctccaaagccgtcccccgcagatgttgttcaagttcctgagt ccgcgcctctcactggcaaccgtcctcttggtctttgtctccaccttgtgtgcctgcccc tcgtcactgatcagctcacgcctgtgcacctgcaccatgctgatgctgatcgggcttggg gtcctggcctggcagaggtggcgcgccatccctgccacagactggcaggaatgggtcccc tccaggtgtagactgtactccaaggactctgggcctccagcagatggaccttaa