GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:23:23 Sequence gi568815591f:129557352_129855178 : 297827 bp : 44.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1153 1266 114 0 0 53 81 117 0.937 7.71 1.02 Intr + 9153 9219 67 2 1 67 33 80 0.045 -1.12 1.03 Intr + 25238 25414 177 0 0 6 116 179 0.040 12.39 1.04 Term + 26304 26845 542 1 2 -29 55 432 0.803 22.62 1.05 PlyA + 29263 29268 6 1.05 2.00 Prom + 45008 45047 40 -4.76 2.01 Init + 54280 54473 194 0 2 97 105 210 0.300 20.04 2.02 Intr + 97403 97517 115 2 1 86 115 -47 0.009 -1.75 2.03 Intr + 99995 100223 229 1 1 29 72 279 0.030 17.84 2.04 Intr + 114078 114192 115 1 1 97 85 87 0.863 8.81 2.05 Intr + 120281 120407 127 2 1 91 103 65 0.953 8.98 2.06 Intr + 133055 133195 141 1 0 104 94 179 0.945 20.65 2.07 Intr + 151724 151882 159 1 0 108 76 106 0.590 11.58 2.08 Intr + 153023 153220 198 1 0 67 109 37 0.863 3.25 2.09 Intr + 154124 154225 102 1 0 112 94 106 0.999 13.97 2.10 Intr + 159868 160025 158 0 2 112 41 92 0.590 5.71 2.11 Intr + 160306 160461 156 1 0 45 64 130 0.171 5.43 2.12 Intr + 169890 170014 125 0 2 113 95 56 0.259 9.03 2.13 Intr + 186822 186878 57 1 0 59 94 55 0.034 2.06 2.14 Term + 197667 197830 164 1 2 119 44 245 0.928 21.30 2.15 PlyA + 199702 199707 6 1.05 3.00 Prom + 200349 200388 40 -6.56 3.01 Sngl + 205049 205408 360 1 0 79 53 215 0.339 11.18 3.02 PlyA + 205919 205924 6 1.05 4.08 PlyA - 206192 206187 6 1.05 4.07 Term - 212434 212273 162 1 0 45 48 102 0.373 -0.16 4.06 Intr - 213172 212958 215 2 2 118 36 61 0.238 2.33 4.05 Intr - 214797 214685 113 2 2 106 31 65 0.324 2.62 4.04 Intr - 217249 217174 76 2 1 97 32 59 0.211 -0.23 4.03 Intr - 221514 221375 140 2 2 101 55 55 0.685 3.61 4.02 Intr - 222254 221738 517 0 1 95 65 107 0.242 1.12 4.01 Init - 222541 222439 103 1 1 56 109 90 0.620 6.35 4.00 Prom - 224588 224549 40 -4.06 5.00 Prom + 226251 226290 40 -6.76 5.01 Init + 228357 228457 101 2 2 79 69 143 0.192 9.24 5.02 Intr + 233659 233764 106 1 1 33 49 84 0.035 -0.78 5.03 Term + 247369 247479 111 0 0 102 49 74 0.678 3.46 5.04 PlyA + 250040 250045 6 1.05 6.00 Prom + 261650 261689 40 -1.86 6.01 Init + 267475 267599 125 0 2 36 75 142 0.408 5.45 6.02 Intr + 267654 267757 104 0 2 65 68 35 0.105 -0.98 6.03 Term + 276842 276957 116 0 2 84 44 131 0.933 7.03 6.04 PlyA + 277227 277232 6 1.05 7.04 PlyA - 277246 277241 6 -0.45 7.03 Term - 277710 277586 125 1 2 76 39 221 0.960 14.55 7.02 Intr - 281984 281856 129 0 0 101 116 62 0.671 10.97 7.01 Init - 297287 297284 4 2 1 100 80 0 0.185 0.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 25151 25414 264 0 0 55 116 199 0.912 16.88 S.002 Init + 100001 100223 223 1 1 51 72 260 0.812 19.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:129557352_129855178|GENSCAN_predicted_peptide_1|299_aa MALIDVGFVGEKRADGSPLSGQLFTIPGRRAKLAVGNMGSKKEKNENRAKGEAPYKTIRS LDWELYVDGSSFINPQGERCAGYAVVILDAVIEAKSLPQGTSAQKAELIALIWALELSEG PTVLSGIQAYGEAPFEDLNIDFTKMPKCGSNKYLLVPVCTYSGWVEAYPTRTKKARVLLR DLIARFGLPLWIGSENGPAFVADLVQKTAKVMSVDQRLERSPLAATVERTPDCHLDHSHS CEGRGNPSLGPPQLRKTCSTGDLVGETKPRQPLQSDSEEVDKPCSSHTRKPTGPLMAEA >gi568815591f:129557352_129855178|GENSCAN_predicted_CDS_1|900_bp atggccctcattgacgtgggcttcgtgggtgagaaacgggctgatgggtcgcctctttca gggcagttgttcactattccagggaggagggccaagctggcagtgggcaacatgggcagc aagaaggagaagaatgagaaccgagcaaagggggaagccccttataaaaccatcagatct ttagactgggagctgtacgtggacgggagcagcttcatcaacccacaaggagagaggtgt gcgggatatgcggtggtaatcctggatgctgtcattgaagccaaatcattgccccagggc acttcagcccagaaggccgaactcattgctttaatttgggccttagagctgagtgaaggt ccaaccgtcctgtctggcatacaagcttatggagaagccccctttgaagatctcaatata gacttcaccaagatgcccaaatgtggcagtaacaagtatttgctagttccagtgtgtaca tactctgggtgggtggaggcctatccaacacgaaccaagaaagctcgtgttcttctccga gatctcatcgctaggtttggactgcccttatggatcggctcagaaaatgggccggcgttt gtggctgacttggtacagaagacagcaaaggtgatgagtgtggatcaaagactggaacgt agcccccttgcagccacggtggaaaggaccccagactgtcatcttgaccactcccacagc tgtgaaggtagagggaatcccagcctgggtccaccacagctgcgtaaaacctgcagcacc ggagacctggtaggtgagaccaagcctagacaacccctgcaaagtgactctgaagaagtc gacaagccctgctccagtcacacccggaagccgactggtccactcatggccgaagcatga >gi568815591f:129557352_129855178|GENSCAN_predicted_peptide_2|679_aa MAAAGRGSRAVRRVRALSLPSPSSAAAAAAEAAALAIAAGGRRLRGAGAVAVSTAQAHGS AAALRDQLALFMWVYFWTFSSVTLVYFSGVSPIPYYLDYCRFMNFMEEHGVTQTEHMATI EAHAVAQQVQQVHVATYTEHSMLSADEDSPSSPEDTSYDDSDILNSTAADEVTAHLAAAG PVGMAAAAAVATGKKRKRPHVFESNPSIRKRQQTRLLRKLRATLDEYTTRVGQQAIVLCI SPSKPNPVFKVFGAAPLENVVRKYKSMILEDLESALAEHAPAPQEVNSELPPLTIDGIPV SVDKMTQAQLRAFIPEMLKYSTGRGKPGWGKESCKPIWWPEDIPWANVRSDVRTEEQKQR VSWTQALRTIVKNCYKQHGREDLLYAFEDQQTQTQATATHSIAHLVPSQTVVQTFSNPDG TVSLIQVGTGATVATLADASELPTTVTVAQVNYSAVADGEVEQNWATLQGGEMTIQTTQA SEATQAVASLAEAAVAASQEMQQGATVTMALNSSFYWLLMERSSPHHSSLLPPPSPNNQC SLSDFQPKLVKPSRLSYSVSSKQKHEAAAHAVATLAEATLQGGGQIVLSGETAAAVGALT GVQDANGLFMADRAGRKWILTDKATGLVQIPVSMYQTVVTSLAQGNGPVQVAMAPVTTRI SDSAVTMDGQAVEVVTLEQ >gi568815591f:129557352_129855178|GENSCAN_predicted_CDS_2|2040_bp atggcggcggccgggcggggaagccgggccgtgcgtcgcgtgcgtgccctctccctcccc tccccctcctcggcggcggcggcggcggcagaagcggcagcgctcgccattgccgctggt ggcaggaggctgcgaggagccggcgcggtcgcagtctccacggcgcaggcccacggtagc gcagccgctctgagagatcagctggctctatttatgtgggtctatttctggaccttctct tctgttacattggtctatttctctggtgtttcaccaataccatactatcttgattactgt aggtttatgaacttcatggaggaacacggagtgacccaaaccgaacatatggctaccata gaagcacatgcagtggcccagcaagtgcagcaggtccatgtggctacttacaccgagcat agtatgctgagtgctgatgaagactcgccttcttctcccgaggacacctcttacgatgac tcagatatactcaactccacagcagctgatgaggtgacagctcatctggcagctgcaggt cctgtgggaatggccgctgctgctgctgtggcaacaggaaagaaacggaaacggcctcat gtatttgagtctaatccatctatccggaagaggcaacaaacacgtttgcttcggaaactt cgagccacgttagatgaatatactactcgtgtgggacagcaagctattgtcctctgtatc tcaccctccaaacctaaccctgtctttaaagtgtttggtgcagcacctttggagaatgtg gtgcgtaagtacaagagcatgatcctggaagacctggagtctgctctggcagaacacgcc cctgcgccacaggaggttaactcagaactgccgcctctcaccatcgacggaattccagtc tctgtggacaaaatgacccaggcccagcttcgggcatttatcccagagatgctcaagtac tctacaggtcggggaaaaccaggctgggggaaagaaagctgcaagcccatctggtggcct gaagatatcccctgggcaaatgtccggagtgatgtccgcacagaagagcaaaagcagagg gtttcatggacccaggcactacggaccatagttaaaaactgttataaacagcatgggcgg gaagaccttttgtatgcctttgaagatcagcaaacgcaaacacaggccacagccacacat agtatagctcatcttgtaccatcacagactgtagtccagacttttagtaaccctgatggc actgtctcacttatccaggttggtacgggggcaacagtagccacattggctgatgcttca gaattgccaaccacggtcaccgttgcccaagtgaattattctgccgtggctgatggagag gtggaacaaaattgggccacgttacagggaggtgagatgaccatccagacgacgcaagca tcagaggccacccaggcggtggcatcgttggcagaggccgcagtggcagcttctcaggag atgcagcagggagctacagtcactatggcgcttaacagttccttctactggctgttgatg gaaagatcatctccgcatcattcttcactcctgccgccacccagccccaacaaccagtgt agcctttctgacttccagcccaagctggtcaagccttcccgtttatcctattctgtgtcc agtaagcagaagcacgaagctgccgcccatgctgtcgccaccctggctgaggccacctta caaggtgggggacagatcgtcttgtctggggaaaccgcagcagccgtcggagcacttact ggagtccaagatgctaatggcctctttatggcagatcgtgcaggtcgcaagtggatcctg actgacaaagccacaggcctggtccagatccctgtgagcatgtaccagactgtggtgacc agcctcgcccagggcaacggaccagtgcaggtggccatggcccctgtgaccaccaggata tcagacagcgcagtcaccatggacggccaagctgtggaggtggtgacattggaacagtga >gi568815591f:129557352_129855178|GENSCAN_predicted_peptide_3|119_aa MKTSWALLRSLGAGHSCGAVGHGRAEGPWPRGLPSMPTPTPPVGPSQCGPCLLAQPERHD AGKRLRPAGAQHLPDCERGSPQMAWAKAARVRETRRGVTQSPSASRTQSRRRGRTRLAG >gi568815591f:129557352_129855178|GENSCAN_predicted_CDS_3|360_bp atgaagacttcgtgggccctgctgaggtccttgggagcagggcattcgtgcggggctgtg gggcacggccgagctgagggtccctggcccaggggcctcccgtccatgccgacgcccact cctcccgtaggacccagccagtgcgggccctgtcttctagcccagcctgaacgccacgat gcagggaagcgcctgaggcctgccggggctcaacacctgcccgactgtgagcgcggaagc ccgcagatggcctgggcgaaggcggcccgggtgcgggagacgcgacgcggggtcactcag agcccgagcgcatcccgaacccagagccgccgtcgggggcgcacccgattggccggctga >gi568815591f:129557352_129855178|GENSCAN_predicted_peptide_4|441_aa MRLFPNRGRFCLASLVWTPRGAERRLCAQPGAGAGGGAHGAPGSRVRQWRLPSAQRCSRA GAGAGAAAVRVRPVLARCLPARSLAGVGLRFAALVGSGGSARLGGGCGERRGPGAGPSSV LSGLRGWFEAASVARSRPAQSLGESGSQTSRVPPLRGHSSREPRRTRMRHRSPPSTAPFL LIALVAPRFPALPQDRRLDGLPLAPHRQGAAAPPRFAPVFAKLLRAKILRARWRILGPGH SALRVRITLYLSPTHLAQLLCEGPALVGSQIALHCLGPRPLSQPLPSLRRVEMRSGTIDR LQPQAADGARLVRPQLAQTEASPAPGGSCLPPPVFGNGRTHTGEVTGSGGSRLANYGART QPAPCAQPAREGPAMLDLLFSARKEGTQVAHGLDLYASSIPLHGFVPPDASEPETKPPLT FPELVFAVGAPLSRSRATGSE >gi568815591f:129557352_129855178|GENSCAN_predicted_CDS_4|1326_bp atgcggctctttccaaaccggggccgtttttgcctggcttctctggtgtggactccgcgc ggagctgagaggcggctgtgcgcccagcctggagccggcgcaggcggaggcgcccacggg gcgcctgggtcccgggtccgccagtggcgtctgcccagcgcccagcgctgttcccgggcg ggggcgggggcgggagcggctgcggtccgagtgcggccggtgcttgcgaggtgcctaccc gcgaggagtttggctggggtcgggctgcgtttcgccgccctggtgggctcaggcggaagc gccaggctgggagggggctgcggagagcgccggggccccggagctggcccctccagcgtc ttgtcagggctgcgcggctggttcgaggctgcttccgtggcccggagccgccccgcccag tctctgggggaaagcggctctcagacctcccgtgtgcctccacttcgcggccactcatcc cgggaaccccggcgcacgcgaatgagacaccgttccccgccctcaactgccccattcctg ttaatagcgctagtggcacccaggttcccagcccttccccaagatcggcggctggacggc ttgcctctggcgcctcaccggcaaggtgcggcggcgcctcccaggtttgcgcccgtcttc gcaaagctcctgagggcgaaaatcctgcgagccaggtggcggattctcgggccgggtcac tcagctctacgggtgcgcatcaccctctacctgtcacccacccatctggctcagctgctg tgtgagggcccagcgctggtgggcagccagatcgccttacactgcctggggccacggcca ttatcccagcctctgcccagcctcaggcgggtcgaaatgcgatccggcaccattgatcgc ctgcagccccaggctgccgatggagcccggctggtgcggccacagctggcacagaccgag gcctccccagctcctggggggagctgcttgcctccccccgtttttggcaatggtagaact cacactggtgaggtaacaggatccggtggttctagacttgccaactatggggcgaggact cagccggcaccctgtgcacagccagcgagggaagggccggccatgctggacctgctgttc tccgcgaggaaggaggggactcaggtggcccatgggctggacctctatgcctccagcatt ccacttcatggatttgttcctcctgacgcctcagaacctgagacaaagcctccactcacc ttcccagagcttgtgtttgcagtgggagctcccctgagccggagccgggccacagggtca gagtga >gi568815591f:129557352_129855178|GENSCAN_predicted_peptide_5|105_aa MRSVQDLGVPFSLHPRLTAGSSPDAYRLFAPSLSAFMDIISSDSQDNPILRVRKLRLGEI EWLAQAIQLLCIEMKRNTNQRCLLNVLGVSCSHHHFIERFLDGKR >gi568815591f:129557352_129855178|GENSCAN_predicted_CDS_5|318_bp atgcgctctgtccaggacctaggagtccctttctccctccaccctcggctcacagcgggc agctcccccgacgcgtaccgcctctttgcccccagcctgagtgctttcatggacatcatc tcatctgattctcaggataaccccattttacgagtgagaaaactgagacttggagagatt gagtggcttgctcaagccatccagctactctgcatcgaaatgaagagaaacacaaaccag cgatgtctcctcaacgtcctgggcgttagctgttcccaccatcacttcatagagaggttt ctggatgggaagcgttag >gi568815591f:129557352_129855178|GENSCAN_predicted_peptide_6|114_aa MRPAAAILSCTGSGHPQPAHALVVAKPQLLLAVGSGLRFPARASDRRQEEGARESPEGGR RSLHSINLRRERPAGPGLDFTLYPKKKNWFRKHFPKFLQQNTVAAGPGSVKGSN >gi568815591f:129557352_129855178|GENSCAN_predicted_CDS_6|345_bp atgaggcctgccgcagccattctcagctgcaccgggtccgggcatccacagccggctcac gcgctggtggtcgccaagcctcagctcttgctggctgttggttcagggcttcggttccca gccagagcaagtgataggagacaggaagagggagctagagagagccctgaaggtggacgt cgcagtcttcactcaatcaatctccgcagggaacgccctgcaggcccaggtctggatttc acgctctaccccaagaaaaagaactggttcagaaaacacttccccaagtttctccaacag aacactgtggctgcagggcctggtagtgtgaagggcagcaactga >gi568815591f:129557352_129855178|GENSCAN_predicted_peptide_7|85_aa MNLTNIFESFLPQLLAYPNPIDPLNGDAAAMYLHRPEEYKQKIKEYIQKYATEEALKEQE EGTGDSSSESSMSDFSEDEAQDMEL >gi568815591f:129557352_129855178|GENSCAN_predicted_CDS_7|258_bp atgaatcttaccaatatatttgagtccttcctgcctcagttattggcctatcctaacccc atagatcctctcaatggtgacgctgcagccatgtacctccaccgaccagaagaatacaag cagaaaattaaagagtacatccagaaatacgccacggaggaggcgctgaaagaacaggaa gagggtaccggggacagctcatcggagagctctatgtctgacttttccgaagatgaggcc caggatatggagttgtag