GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:46:33 Sequence gi568815595f:186800198_187000554 : 200357 bp : 44.93% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 575 570 6 1.05 1.04 Term - 2603 2544 60 2 0 111 42 34 0.128 -1.10 1.03 Intr - 6340 6228 113 2 2 -37 97 110 0.243 -0.50 1.02 Intr - 6787 6551 237 2 0 41 115 139 0.886 9.49 1.01 Init - 10612 10543 70 1 1 81 105 61 0.911 8.31 1.00 Prom - 15317 15278 40 -5.06 2.00 Prom + 15629 15668 40 -4.16 2.01 Init + 17435 17493 59 1 2 51 77 69 0.394 2.88 2.02 Intr + 24784 25013 230 1 2 72 38 107 0.291 1.61 2.03 Intr + 26355 26425 71 1 2 71 105 120 0.151 10.90 2.04 Intr + 26850 26947 98 2 2 48 92 64 0.712 1.61 2.05 Term + 27020 27365 346 2 1 88 39 167 0.889 5.77 2.06 PlyA + 29068 29073 6 1.05 3.00 Prom + 50458 50497 40 -5.96 3.01 Init + 52841 53075 235 1 1 46 110 203 0.749 14.60 3.02 Term + 53987 54507 521 0 2 80 43 645 0.999 53.66 3.03 PlyA + 56659 56664 6 1.05 4.04 PlyA - 56685 56680 6 1.05 4.03 Term - 74470 74376 95 2 2 86 49 54 0.273 -0.71 4.02 Intr - 89830 89446 385 1 1 33 76 457 0.615 33.42 4.01 Init - 93782 93702 81 2 0 38 98 22 0.630 -0.83 4.00 Prom - 98838 98799 40 -3.96 5.00 Prom + 99075 99114 40 -9.46 5.01 Sngl + 100001 100360 360 1 0 110 41 300 0.989 23.57 5.02 PlyA + 100373 100378 6 1.05 6.04 PlyA - 100418 100413 6 1.05 6.03 Term - 106143 105875 269 1 2 44 37 183 0.017 4.46 6.02 Intr - 115679 115512 168 0 0 72 52 62 0.006 0.92 6.01 Init - 122430 122121 310 0 1 65 42 351 0.315 25.68 6.00 Prom - 127047 127008 40 -2.56 7.03 PlyA - 131666 131661 6 1.05 7.02 Term - 142414 142241 174 1 0 107 48 70 0.499 2.66 7.01 Init - 151174 151091 84 1 0 86 55 82 0.634 3.52 7.00 Prom - 154764 154725 40 -1.86 8.06 PlyA - 154881 154876 6 1.05 8.05 Term - 160074 160021 54 0 0 116 36 57 0.484 0.86 8.04 Intr - 175510 175325 186 1 0 52 96 36 0.021 0.69 8.03 Intr - 187474 187401 74 2 2 131 91 -4 0.577 3.33 8.02 Intr - 196416 196377 40 0 1 89 97 6 0.479 -0.60 8.01 Init - 196758 196579 180 0 0 76 75 84 0.442 5.08 8.00 Prom - 198178 198139 40 -2.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:186800198_187000554|GENSCAN_predicted_peptide_1|159_aa MAPGCSASGEGLMLPYDMEEKQKDTIRSKSHVSSRGQGKASSDVSALKAHAFCSATKQDK EDRDLHLRRPAPTKIAGSGPSVDRQGSRGPRAAAPPSLLSLPAPEVTRYVPAYSYRRIPA LLFARRSVAGTEATVGTSVIVFEYESLLYNIYWIPKRRH >gi568815595f:186800198_187000554|GENSCAN_predicted_CDS_1|480_bp atggcgccaggctgttcagcttctggtgagggcctcatgctgccttatgacatggaggag aagcagaaagacactattagaagtaaaagccacgtgtcctcaagagggcaaggcaaagca tcttcagatgtctctgcccttaaagcacacgcgttctgctctgcgacgaagcaggacaag gaggacagggacctgcacctccggaggcccgcacctacgaagatagcgggctcgggacct tcggtggaccggcagggttccagaggcccgcgcgccgccgccccgccctcattgctgagc ctgccagcaccggaagtgacgcgttacgtgcccgcgtattcctaccggcgtattcccgcc ctgcttttcgcccgccgttccgtggcgggaactgaggcgactgtggggacatcagtgatc gtatttgaatatgagtccctgctttataacatatactggattcccaagagaagacattaa >gi568815595f:186800198_187000554|GENSCAN_predicted_peptide_2|267_aa MVSDANSEGSKNSAGIYSGCPGIIRSTEAMDTQGMVKVVAPSVLPFLPHRAFSKIDGHSS VLTAPGIRGPQGWMTSLKRNLSGFPAAAPKSRLEDEGCSSREFSSMDTFEGYEGAEDMEK LDDSHETCSSKSAFLERRESTQGEKEERDEEATATGSTKKERAGASTPADSCGGDGWFPG EGAAPEDLHPGELPSLRSPRLPGSRAPAFPREAMWGPGPCPATQKAAGAAAAAERGRPGP PPGSPRGALRARLAPAARRRTSRDGDP >gi568815595f:186800198_187000554|GENSCAN_predicted_CDS_2|804_bp atggtgagtgatgcaaacagcgagggctctaagaattcagcagggatctacagtggatgc cccggcatcatccggagcacagaggccatggacactcaggggatggtcaaggtcgtagct ccctccgttcttccgttcctccctcacagagctttcagcaagatagatggtcattcttct gtcctcactgctcccggcataaggggaccccagggctggatgacttccctaaagaggaat ctctcagggtttcctgcagccgctcccaagtcccgtctggaggatgaaggctgctccagc agggagtttagctccatggacacctttgaaggctatgaaggtgcagaagacatggaaaag ttggatgacagtcacgaaacctgttcctccaaatccgctttcctggagaggagagaaagc acccagggagaaaaagaagaacgcgatgaggaagcaacggcgaccggcagcaccaagaag gagcgtgctggggcgtccacgccggctgactcctgcgggggcgacggctggtttccaggc gagggcgcggcgcccgaggacctccaccccggagagctgccctccctgcggtcgccccgg ctccccggctccagagcgcccgcattcccgagagaggcgatgtgggggcccgggccctgc ccagctacgcagaaagcagccggggccgcggcggcggcagaaaggggacgcccagggccg cctcccgggagcccgaggggtgccctgcgtgctcgtctagctcccgccgcccggcgaagg acctcgcgggacggggacccctag >gi568815595f:186800198_187000554|GENSCAN_predicted_peptide_3|251_aa MWIPGLRMLLLGAVLLLLALPGHDQETTTQGPGVLLPLPKGACTGWMAGIPGHPGHNGAP GRDGRDGTPGEKGEKGDPGLIGPKGDIGETGVPGAEGPRGFPGIQGRKGEPGEGAYVYRS AFSVGLETYVTIPNMPIRFTKIFYNQQNHYDGSTGKFHCNIPGLYYFAYHITVYMKDVKV SLFKKDKAMLFTYDQYQENNVDQASGSVLLHLEVGDQVWLQVYGEGERNGLYADNDNDST FTGFLLYHDTN >gi568815595f:186800198_187000554|GENSCAN_predicted_CDS_3|756_bp atgtggattccagggctcaggatgctgttgctgggagctgttctactgctattagctctg cccggtcatgaccaggaaaccacgactcaagggcccggagtcctgcttcccctgcccaag ggggcctgcacaggttggatggcgggcatcccagggcatccgggccataatggggcccca ggccgtgatggcagagatggcacccctggtgagaagggtgagaaaggagatccaggtctt attggtcctaagggagacatcggtgaaaccggagtacccggggctgaaggtccccgaggc tttccgggaatccaaggcaggaaaggagaacctggagaaggtgcctatgtataccgctca gcattcagtgtgggattggagacttacgttactatccccaacatgcccattcgctttacc aagatcttctacaatcagcaaaaccactatgatggctccactggtaaattccactgcaac attcctgggctgtactactttgcctaccacatcacagtctatatgaaggatgtgaaggtc agcctcttcaagaaggacaaggctatgctcttcacctatgatcagtaccaggaaaataat gtggaccaggcctccggctctgtgctcctgcatctggaggtgggcgaccaagtctggctc caggtgtatggggaaggagagcgtaatggactctatgctgataatgacaatgactccacc ttcacaggctttcttctctaccatgacaccaactga >gi568815595f:186800198_187000554|GENSCAN_predicted_peptide_4|186_aa MCQGHKTIVGRGLADKHVNKGLCIIDKPGRQSETVSQKKKKKEREKEKRKRKRRNKKKKE EKEKERKRKRRKKKKKEEKEKEKEKRKKKERERERKRKRKKRKKKKKRKKKKKKKKKKII PHIFLNLKPPRFVLCRVGSISLLVSDNGDVNEMIIDMATIRFLNLFPTFPPFPFHKTAIV IMARSQ >gi568815595f:186800198_187000554|GENSCAN_predicted_CDS_4|561_bp atgtgtcagggtcacaagacaatagtggggagagggttagcagacaaacacgtgaacaaa ggtctttgcatcatagacaagcctgggcgacagagcgagactgtgtctcaaaaaaaaaaa aagaaggagagggagaaggagaagaggaagaggaagaggaggaacaagaagaagaaggaa gagaaggagaaggagaggaagaggaagaggaggaagaagaagaagaaggaggagaaggaa aaggagaaggagaagaggaagaagaaggagagggagagggagaggaagaggaagaggaag aagaggaagaagaagaagaagaggaagaagaagaagaagaagaagaagaagaagattatt cctcacatcttcctcaacctcaagcctcccagatttgtcctatgcagagtgggcagcatc tccctcttggtgagtgacaatggagatgtcaatgagatgatcattgacatggcaaccatc cgatttctcaatcttttccccacctttcccccctttccattccacaaaaccgccattgtc atcatggcccgttctcaatga >gi568815595f:186800198_187000554|GENSCAN_predicted_peptide_5|119_aa MAFKDTGKAPVEPEVAIHRIRITLTSRSVKSLEKVGADLIRGAKAKNLKVKGGVRMPSKT LRITTRKTPCGEGSKTWDRFQMRIHKRLFDLHSPSEIVKQITSISTEPGVEVEVTIADA >gi568815595f:186800198_187000554|GENSCAN_predicted_CDS_5|360_bp atggcttttaaggataccggaaaagcacccgtggagccggaggtggcaattcaccgaatt cgaatcaccctaacaagccgcagcgtaaaatccttggaaaaggtgggtgctgacttgatc agaggcgcaaaagcaaagaatctcaaagtgaaaggaggagttcgaatgccttccaagact ttgagaatcactacaagaaaaactccttgtggtgaaggttctaagacgtgggatcgtttc cagatgagaattcacaagcgactctttgacttgcacagtccttctgagattgttaagcag attacttccatcagtactgagccaggagttgaggtggaagtcaccattgcagatgcttaa >gi568815595f:186800198_187000554|GENSCAN_predicted_peptide_6|248_aa MCFAKKHNKKSLKKMQANSAEAMSAHAEAIRALVKPKEAKPKIPKDVSHKLNRLAYIPHP KLGKRARARVAKGLRLCRPKAKAKDQTKVQAAAPASTPAQAPKDYKAEALEEETTCTRKL LDKDGKMTLDDIISLLPTFNTMNSKSKSNRKKVPSISAFEIQATIREYYKHHCANKLENL EEMDKFLDTYTLSRLNQEEVESLNRPITGSEIEAVINNLPTKKNPESDGFTAEFYQRYNE ELVPFLLK >gi568815595f:186800198_187000554|GENSCAN_predicted_CDS_6|747_bp atgtgctttgccaagaagcacaacaagaagagcctaaagaagatgcaggccaacagtgcc gaggccatgagtgcacatgccgaggctatcagggcccttgtgaagcccaaggaggctaag cccaagatcccaaaggatgtcagccacaagctcaatcgacttgcctacattccccacccc aagcttgggaagcgtgctcgtgcccgcgttgccaaggggctcaggctgtgccggccaaag gccaaggccaaggaccaaaccaaggtccaggctgcagctccagcttcaactccagctcag gctcccaaagattataaagctgaggctctagaagaagaaaccacttgcacaagaaaactt ctggataaagatggcaaaatgactctagatgacataatctctctcctcccaacatttaac acaatgaacagcaaaagtaaaagcaacagaaaaaaagtaccatcaatttccgcttttgaa atacaagctaccatcagagaatactataaacaccactgtgcaaataaactagaaaatcta gaagaaatggataaattcctggacacatacaccctttcaagactaaaccaggaagaagtc gagtccctgaatagaccaataacaggctctgaaattgaggcagtaattaataatctacca accaaaaaaaatccagaatcagacggattcacagccgaattctaccagaggtacaacgag gagctggtaccattccttttgaaatga >gi568815595f:186800198_187000554|GENSCAN_predicted_peptide_7|85_aa MAKNYGLAQWLMPVIPALWEADVGGLPEERKGSALNTVGALDSSTEDHREVVKGTAFLKL KLEARHCTKENSINQQTDADLQRLQ >gi568815595f:186800198_187000554|GENSCAN_predicted_CDS_7|258_bp atggccaagaattacggcctggcgcagtggctcatgcctgtaatcccagcactttgggag gccgatgtgggcggattgcctgaggaaagaaagggatcagccttaaacacggtgggtgcc ctcgattccagcacagaggaccatcgagaggttgtaaagggtacagcattcctaaaatta aagctggaagctcggcactgtactaaggaaaactccatcaaccaacagacggatgcagat ttacaaagactgcaatga >gi568815595f:186800198_187000554|GENSCAN_predicted_peptide_8|177_aa MNKLKVEMALETNWQYKVESAGYQCSRDPSREGQEAGDTYVVANRKDSRVQTFAKQESRK VLQLTHDPLKQVCRFSHLRKILRTFNSTSAKNIFRFPQCSVSLHLFRWILSFLLGKRSRT RGKKWMGEYKEAMEEHTVGKFSCGQAEGDWLNSKRQMARQTQKHLDDTEDDSKEEGG >gi568815595f:186800198_187000554|GENSCAN_predicted_CDS_8|534_bp atgaacaagctcaaagtggagatggccttggaaacaaactggcaatacaaggtggaaagt gcaggataccagtgtagcagggacccaagcagagaggggcaagaagcgggggacacgtat gtagtggccaacaggaaggactctagagtgcagacatttgcaaaacaggagagcagaaag gttctccagctcacccatgatccactcaagcaggtgtgccgtttctcccatctccgtaag atcttgaggaccttcaacagcacttcagcaaaaaacattttcaggtttcctcagtgttct gtttctctccatctctttcgctggatcctctccttcctcctgggcaagagaagtagaaca agagggaagaagtggatgggagagtacaaagaggccatggaagaacacactgtggggaag ttcagttgtggacaggcagagggagactggttgaacagcaaaaggcagatggccaggcag acccagaagcatctagatgacacagaagatgattctaaggaagagggtggctaa