GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:08:05 Sequence gi568815596f:68265352_68495813 : 230462 bp : 39.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 8643 8918 276 2 0 56 42 175 0.523 4.83 1.02 PlyA + 9266 9271 6 1.05 2.03 PlyA - 9411 9406 6 1.05 2.02 Term - 19133 19077 57 2 0 96 47 87 0.608 2.11 2.01 Init - 21348 21316 33 0 0 56 77 47 0.664 0.33 2.00 Prom - 25194 25155 40 -2.45 3.06 PlyA - 25409 25404 6 1.05 3.05 Term - 28675 28511 165 1 0 73 48 104 0.303 1.83 3.04 Intr - 34950 34870 81 0 0 46 85 61 0.262 0.52 3.03 Intr - 51956 51806 151 1 1 35 107 172 0.562 13.04 3.02 Intr - 54115 53871 245 1 2 79 72 324 0.925 25.27 3.01 Init - 56095 56024 72 1 0 118 44 35 0.663 3.22 3.00 Prom - 61965 61926 40 -5.35 4.00 Prom + 62190 62229 40 -3.55 4.01 Init + 76851 76865 15 2 0 79 95 16 0.292 1.29 4.02 Term + 80447 80572 126 1 0 55 40 170 0.704 6.20 4.03 PlyA + 81924 81929 6 1.05 5.00 Prom + 82139 82178 40 -7.05 5.01 Init + 82354 83208 855 0 0 43 25 311 0.397 14.97 5.02 Intr + 85538 85620 83 1 2 102 94 44 0.128 3.92 5.03 Intr + 107959 108064 106 1 1 12 96 82 0.034 0.70 5.04 Intr + 114977 115132 156 1 0 97 115 131 0.993 15.99 5.05 Intr + 115372 115553 182 0 2 57 22 203 0.587 8.34 5.06 Intr + 117191 117282 92 2 2 90 98 32 0.629 3.12 5.07 Intr + 121151 121335 185 0 2 83 41 218 0.517 15.19 5.08 Intr + 123036 123140 105 2 0 51 109 101 0.456 7.99 5.09 Intr + 128756 128825 70 1 1 104 95 47 0.125 4.84 5.10 Term + 130329 130465 137 1 2 127 43 159 0.951 12.50 5.11 PlyA + 131989 131994 6 1.05 6.00 Prom + 132742 132781 40 -8.55 6.01 Sngl + 141160 141708 549 0 0 87 42 321 0.812 23.26 6.02 PlyA + 141768 141773 6 1.05 7.04 PlyA - 145531 145526 6 1.05 7.03 Term - 153781 153612 170 2 2 93 37 107 0.072 3.16 7.02 Intr - 166774 166516 259 1 1 60 7 214 0.147 6.61 7.01 Init - 168567 167803 765 0 0 28 69 339 0.126 18.83 7.00 Prom - 168923 168884 40 -9.25 8.00 Prom + 169219 169258 40 -8.45 8.01 Init + 169617 169697 81 2 0 58 76 105 0.506 7.32 8.02 Term + 180190 180399 210 0 0 74 48 196 0.605 10.51 8.03 PlyA + 180917 180922 6 1.05 9.05 PlyA - 181714 181709 6 1.05 9.04 Term - 182837 182621 217 1 1 93 47 291 0.996 21.13 9.03 Intr - 183033 182849 185 0 2 18 7 214 0.697 3.96 9.02 Intr - 183399 183285 115 2 1 -52 76 253 0.825 9.63 9.01 Init - 183700 183552 149 1 2 76 92 178 0.765 16.61 9.00 Prom - 189778 189739 40 -3.65 10.03 PlyA - 189785 189780 6 1.05 10.02 Term - 190532 190275 258 2 0 29 41 207 0.045 4.67 10.01 Init - 199554 199489 66 0 0 39 110 98 0.685 8.22 10.00 Prom - 200352 200313 40 -6.65 11.00 Prom + 200862 200901 40 -1.25 11.01 Init + 202381 202476 96 0 0 89 83 156 0.967 14.25 11.02 Term + 214529 214594 66 1 0 103 37 105 0.737 3.86 11.03 PlyA + 218406 218411 6 -0.45 12.03 PlyA - 218663 218658 6 1.05 12.02 Term - 220184 220056 129 2 0 54 42 102 0.721 -0.60 12.01 Init - 224448 224404 45 0 0 61 119 54 0.442 6.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 91540 91443 98 1 2 79 105 75 0.941 8.20 S.002 Term - 199019 198858 162 2 0 85 49 69 0.853 -0.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_1|91_aa MGKNPKGIDGEGKILRGLMEKNYVFHQAEKSPEFGPSGIFSGYFLLVVPEPYGPMVPELY AVPGGNCATETQYPEGCSALVPATIDTEDRD >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_1|276_bp atggggaaaaaccctaaggggattgatggagaaggaaaaatcctaaggggattgatggag aagaattatgtttttcaccaggcagaaaaaagccctgaatttggtccctcaggaatcttc tcagggtattttctgctagtagtaccagagccctatggccctatggtgccagagctctat gcagttccaggagggaactgcgccactgagactcaatacccagaaggctgctcagctctt gttcctgctacaatcgacactgaggatagggattaa >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_2|29_aa MTIPRETVAQEECLEQRPQEISLTYECEE >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_2|90_bp atgaccatcccaagggagaccgtggcccaagaggagtgcctggaacaacgaccccaagag atctctcttacatatgaatgtgaagaataa >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_3|237_aa MPALEPGFKMESSSSRSNALFDTQPLGRSRRCPPPPGAAAPDPRPDMGDLPGLVRLSIAL RIQPNDGPVFYKVDGQRFGQNRTIKLLTGSSYKVEVKIKPSTLQVENISIGGVLVPLELK SKEPDGDRVVYTGTYDTEGVTPTKSGERQPIQITMPSPDWLKLKGLAMPSVGEDIEQLKF SLLFTDIGTFETVWQVKFYNYHKRDHCQWGSPFSVIEYECKPNETRSLMWVNKESFL >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_3|714_bp atgccggccttagaaccaggttttaaaatggagtcatcatcctccaggtccaatgctctt tttgatacacagccactcggccgcagccggcgctgtcctccgccccccggagccgccgcg ccagaccctcgcccagacatgggggacctgccgggcctcgtgcgcctctccatcgcgctg cgcatccagcctaatgacggcccggtcttttacaaggtggacgggcagcgcttcggccag aaccgcaccatcaagctgctcaccggctcctcctacaaggttgaggtgaagattaaaccc agcacgctgcaggtcgagaatatttccattggtggtgtgcttgtcccactggaactgaag tctaaagagcctgatggggacagagttgtttatacgggtacatatgacacagaaggtgtg accccaacgaagagtggagaacggcaacccatccagatcaccatgccgtcaccagactgg ctaaaattgaaaggactggcaatgccaagtgttggagaggatatagagcaattgaaattc tcattgctgttcacagacattgggaccttcgagacagtgtggcaagtcaagttctacaat taccacaagcgggatcactgccagtggggaagccccttctctgtcattgagtatgaatgc aagcccaacgagacacgcagtctgatgtgggtgaacaaggagtccttcctctga >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_4|46_aa MNQVKSPGDGAAASRSADGPSSYKAFLVFWSLQCVEATPKQKALSH >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_4|141_bp atgaaccaggtcaagagcccaggtgatggtgctgctgccagcagaagtgctgatgggcca agctcctacaaagctttcttggtcttctggagccttcagtgtgttgaagccacaccaaag cagaaggcgctttctcattag >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_5|656_aa MNIDAKILDKILANRIQQHIKKLTYQDQVSFIPGKQAWLNICKSINVIHHINRTKEKKHM IFSINAEKAFDKIQHRFMLKTLNKPAIDGTYLKIIRAIYDKSTANIILNGQKLEAFPVKT STRQGCPLSPFLFNIVLEVLARAIRQEKEIMGIHVGREEVKLPLFADDLILCLENPIISA QKPLKLINNFSKVSGYKINVQKSQAFLYTNNRQAESQIMNEFPFTIAAKRIKYPGIQLTR DVKDLFKENYKPLLNEIREDTYKWKNIPSSWIERINIMKMTILPKALVKISFNLHNDPRG GYDCSLQQISFHSISLFIVHNDPESQMTHPPQSQTPEAALDLEKHNVKGSVFNTWKPMWV VLLEDGIEFYKKKSDNSPKGMIPLKGSTLTSPCQDFGKRMFVFKITTTKQQDHFFQAAFL EERDAWVRDIKKAIKCIEGGQKFARKSTRRSIRLPETIDLGALYLSMKDTEKGIKELNLE KDKKIFNHCFTGNCVIDWLVSNQSVRNRQEGLMIASSLLNEGYLQPAGDMSKSAVDGTAE NPFLDNPDAFYYFPDSGFFCEENSSDDDVILKEEFRGVIIKQGCLLKQAEDPLGAIHLRG CVVTSVESNSNGRKSEEENLFEIITADEVHYFLQAATPKERTEWIRAIQMASRTGK >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_5|1971_bp atgaacattgatgcgaaaatactcgataaaatactggcaaaccgaatccagcagcacatc aaaaaacttacctaccaagatcaagtcagcttcatccctgggaaacaagcctggctcaac atatgcaaatcaataaatgtaatccatcacataaacagaaccaaagaaaaaaaacacatg attttctcaataaatgcagaaaaggcctttgataaaattcaacatcgcttcatgttaaaa actctcaataaaccagctattgatggaacatatctcaaaataataagagctatttatgac aaatccacagcaaatatcatactgaatgggcaaaagcttgaagcatttcctgtgaaaacc agcacaagacaaggatgccctctctcaccattcctattcaacatagtattggaagttctg gccagggcaatcaggcaagagaaagaaataatgggcattcacgtaggaagagaggaagta aaattgcctctgtttgcagatgacttgattctatgtttagaaaatcccatcatctcagcc caaaaaccccttaagttgataaacaactttagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcctttacaccaacaatagacaagcagagagccaaatcatgaat gaattcccattcacaattgctgcaaagagaataaaatacccaggaatacagctaacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataagagaggac acatacaaatggaaaaacattccatcctcatggatagaaagaatcaatatcatgaaaatg accatactgcccaaagcattagtcaagatctcatttaatcttcacaacgaccctagagga ggctacgattgctctctgcaacagatctcatttcacagtatcagcctcttcattgtgcat aatgaccctgaatcccaaatgacacatcctcctcagagtcagaccccggaggctgcactt gatttagaaaagcacaacgtgaaggggagcgtgttcaatacgtggaaacccatgtgggtt gtattgttagaagatggaattgaattctataagaagaaaagtgacaacagccccaaagga atgatcccgctgaaagggagcactctgactagcccttgtcaagactttggcaaaaggatg tttgtgtttaagatcactacgaccaaacagcaggaccacttcttccaggcagccttcctg gaggagagagatgcctgggttcgggatatcaagaaggccattaaatgcattgaaggaggc cagaaatttgccaggaaatctaccaggaggtccattcgactgccagaaaccattgactta ggtgccttatatttgtccatgaaagacactgaaaaaggaataaaagaactgaatctagag aaggacaagaagatttttaatcactgcttcacaggtaactgcgtcattgattggctggta tccaaccagtctgttaggaatcgccaggaaggcctcatgattgcttcatcgctgctcaat gaggggtatctgcagcctgctggagacatgtccaagagtgcagtggatggaactgctgaa aaccctttcctggacaaccctgatgccttctactactttccagacagtgggttcttctgt gaagagaattccagtgatgatgatgtgattctgaaagaagaattcagaggggtcattatc aagcagggatgtttactgaagcaggcagaagatcccctgggagcaattcacttgagaggc tgtgtggtgacttcagtggagagcaactcaaatggcaggaagagtgaggaagagaacctt tttgagatcatcacagcagatgaagtgcactatttcttgcaagcagccacccccaaggag cgcacagagtggatcagagccatccagatggcctcccgaactgggaagtaa >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_6|182_aa MELKNTAQELCEAYTSINSQIDQAEERIAEIKDKINEIKCEDKIREKRMKRNEQSLQEIW DYVKRPNLCLIGVPGSDRENGTRLENTLQDIIQDNFPNLARQANIQIQEIQRPPQRYSLR RATPRHIIIRLTKVEMKEKMLRAAREKGQVTHKWKPIKLTADLSAETLQARREWGPIFDI LK >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_6|549_bp atggagctgaaaaacacagcacaagaactttgtgaagcatacacaagtatcaatagccaa atcgatcaagcagaagaaaggatagcagagattaaagataaaattaatgaaataaagtgt gaagacaagattagggagaaaagaatgaaaaggaatgaacaaagcctccaagaaatatgg gactatgtgaaaagaccaaacctatgtttgattggtgtacctggaagtgacagggagaat ggaaccagattggaaaacactcttcaggatattatccaggacaacttccccaacctagca agacaggccaacattcaaattcaggaaatacagagacctccacaaagatactccttgaga agagcaaccccaagacatataatcatcagattgaccaaggttgaaatgaaggaaaaaatg ttaagggcagccagagagaaaggtcaggttacccacaaatggaagcccatcaaactaaca gctgatctctctgcagaaaccctgcaagccagaagagagtgggggccaatattcgacatt cttaaataa >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_7|397_aa MVAVGLLEGLLPSCCLQWGDASGGIRSGFRGSSGSSGPLCPVSPRRPTVPTPLSYSWAGP TPRPVVSTRASALLPAASQEPMSTQPQAQLGLTGLPLEHQVHLCRSWLGSLPNLHLIHCC YRENVGRRCGQDCALHGAHRSWEQARAPPFPSWQGRAPQRQLQPPKLQQQPGHPCALRSQ KQAGAPPSQAQRQLPKLWLQIQASLHSQRPEKACPLPLQDRKCLLPLPGFSLLPAPAPIL EQRRGQAQVLSQPSQAKRGREELRPFGDPRPGCSLSQGCDSLFGALQLLESASFQAPEHL PVPAMEAACGAPSPDAALQGASTHTSTWSCLPHSSSWHAQLSSENRVVIPFVEIIPLLFR QSSLAGFSPDPQGLILLVCNGKPSLRHPLPWQCRSTG >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_7|1194_bp atggtggcggtgggacttctggaggggttgctgccatcatgctgcctgcagtggggagat gccagtggtggcatcaggagtggcttcaggggcagcagtggcagcagtgggcccttgtgc cctgtgtccccaagacggccaactgtaccaaccccactctcatacagctgggcaggaccc actcccaggcctgtagtctccaccagggcctcagccttgctccctgctgcatcccaggaa cccatgagcacccagccacaggcacagctgggactcacagggctgcccctggagcatcag gttcatttgtgcaggagttggctggggtcgctgcccaacctgcacctcatccactgctgc tacagggaaaatgtaggaaggaggtgtggccaggactgtgcactccatggagcacacagg agctgggaacaagcaagagccccgccctttccgagttggcagggcagagcaccccagagg caactgcagccacccaagctgcagcagcaaccagggcacccctgtgctctcaggagccag aagcaggcaggagccccaccatcccaggcacagcggcagctgcccaagctgtggctgcag atccaagcatctctgcactctcagaggcctgagaaggcatgcccccttcccctgcaggac aggaaatgcctgcttccactgcctggcttctccctgctgccagcacctgctccaatcttg gagcaaaggcggggccaagctcaggtgctgtcacagcccagccaggcaaaaagagggaga gaagagctgcggccctttggggatcccaggcctgggtgttccctgagccagggctgtgac tccctctttggagccctgcagcttctggagtctgcaagcttccaggccccagagcatttg ccagtgccagctatggaagctgcttgtggtgcacctagtccagatgcagccttgcaggga gccagcacccacaccagcacctggagttgcttgccccacagcagcagctggcatgcccag ctatcctctgagaatcgggtggtcatcccctttgttgagatcatacccctgctgttccgg cagtcctctttggcaggattttcaccagacccacagggccttatcctgcttgtctgcaat gggaagccctcactcaggcatcctttaccctggcaatgcaggagcacaggctga >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_8|96_aa MSLPEEHQDQYGKSTVGMGDGEKKGLRCAQRSNQNNKHARRLRSPSCKTGFLAGPMSSQF LNQVHRLCKRHLTELPATLKRVWALPTWEPLATGTS >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_8|291_bp atgagtctgcctgaggagcatcaagatcagtatggcaaaagcacagtgggcatgggagat ggggagaaaaagggcctgaggtgtgcgcaaaggtcaaatcaaaataacaaacacgcacga cgtcttcgatcaccttcctgtaaaaccgggtttctagccggacccatgtcctcacagttc ctgaatcaagttcaccgtctgtgcaagcgccacctgacagagctccctgcgacactgaaa cgcgtttgggcacttcccacgtgggaaccactcgccacgggcacctcctga >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_9|221_aa MTASSSMTAVLENKSQENKGDDEPPLVQGSGAIVGPPSPSLAAVFFNRWHHTDAADWSCS TFLSMLLDVAVHPDDHFVLTADRDEKIQENRVVLLSTCIAVGYIFQFHAGRQKLVYRQQL SFQHRVWDVAFEETQGLSVLQDCLEAPLPWPVGGQWQSVPESAMLKKVSSVLCGNRAMLE GSACVNPSFSSLYKANFDNITFYLKKKEERLQQQLKKQQRS >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_9|666_bp atgacagcctcttcatctatgactgcagtgcttgaaaacaagtcacaagaaaataaaggg gacgacgagccgcccttggttcaggggagcggtgcaattgtcggtccaccttctccaagt ctggcagctgttttctttaaccgatggcaccacacggatgcggccgactggagctgcagc acctttctgtccatgctattagacgtggctgtgcatcctgatgaccacttcgtcctcact gcggaccgggacgagaagatccaggagaaccgggtggtgcttctgagcacctgcattgct gtgggatacatcttccagtttcacgccggtagacagaagctggtgtacaggcagcagctg tcattccagcatcgagtgtgggatgtagcttttgaggaaacccaggggctgtcggtgctc caggactgcctggaagcccccctcccctggcctgtgggtggccagtggcagtctgttcct gaaagtgccatgttaaagaaagtctccagtgttctttgtgggaaccgggccatgctggaa ggctctgcctgcgtgaaccccagctttagcagtctctacaaggccaactttgacaacata accttctacctgaagaagaaagaggagagactgcagcagcagctgaagaagcagcagcgc agttga >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_10|107_aa MTVRAVCRREIDDDLESGYSWRWDMGYLSEFPPPHEIAVEVAENTAKEPHSRSRGIESLN QEPNRGCCSVFRVTVATVQIEVGQSQLKEDSYTSIPDCAKKTDCKLQ >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_10|324_bp atgactgtaagagctgtgtgccgaagagaaatagatgatgatctagaaagtggttattcc tggaggtgggacatgggatatctgagtgagttccccccaccccatgagatagctgtagaa gtagctgagaacacagcaaaggagcctcacagccgaagccgtgggattgagagcctaaac caagagcccaatagaggttgctgcagtgtctttagagtaacagtagccacagtgcaaatt gaggtgggccagagtcagctgaaggaagactcctatacctcaatccccgattgtgctaag aaaactgactgcaaattacagtaa >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_11|53_aa MSGGFELQPRDGGPRVALAPGETVIGRGPLLGPTQGEDNEDEDLYDDPLPFNE >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_11|162_bp atgtccgggggcttcgagctgcagccgcgggacggcggtccccgggtggccctggcgccc ggggagacggtgatcggccgcgggccgctgctgggacctactcaaggtgaagacaatgaa gatgaagacctatatgatgatccacttccatttaatgaataa >gi568815596f:68265352_68495813|GENSCAN_predicted_peptide_12|57_aa MIKVLKGRDPMLEDLITDKGNGRLTAKSGRYHLTKVITLTTSPQEDKLIDVLPNVVL >gi568815596f:68265352_68495813|GENSCAN_predicted_CDS_12|174_bp atgataaaggttttaaaaggaagagatcccatgctcgaggatctgattactgacaaagga aatggtagattaactgcgaaatctggcagataccaccttaccaaggtgattacacttacc acatcaccacaggaggacaaactgatagatgtgcttcctaatgtggtactctga