GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:04:16 Sequence gi568815593f:121751964_121952689 : 200726 bp : 35.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1376 1426 51 1 0 82 55 53 0.763 2.61 1.02 Intr + 6700 6870 171 0 0 69 99 117 0.560 10.12 1.03 Intr + 12103 12229 127 0 1 70 98 32 0.079 1.73 1.04 Term + 19975 20078 104 2 2 110 44 42 0.397 -0.54 1.05 PlyA + 20471 20476 6 1.05 2.00 Prom + 30099 30138 40 -3.05 2.01 Init + 34308 34337 30 2 0 76 116 21 0.899 3.49 2.02 Term + 40994 41116 123 1 0 66 49 147 0.988 6.00 2.03 PlyA + 41345 41350 6 1.05 3.00 Prom + 53344 53383 40 -3.65 3.01 Init + 53463 53549 87 2 0 78 26 67 0.264 0.19 3.02 Term + 61336 61512 177 0 0 79 46 130 0.411 4.60 3.03 PlyA + 62304 62309 6 1.05 4.07 PlyA - 62549 62544 6 1.05 4.06 Term - 66459 66253 207 0 0 62 39 117 0.299 0.56 4.05 Intr - 68623 68512 112 0 1 86 11 101 0.328 1.66 4.04 Intr - 76524 76356 169 1 1 79 88 124 0.293 9.58 4.03 Intr - 83898 83805 94 0 1 69 85 62 0.195 2.62 4.02 Intr - 90932 90871 62 0 2 37 119 27 0.679 -1.77 4.01 Init - 92297 92222 76 2 1 71 94 65 0.905 6.70 4.00 Prom - 96046 96007 40 -7.05 5.00 Prom + 98693 98732 40 -7.75 5.01 Sngl + 100001 100729 729 1 0 71 46 748 0.997 62.87 5.02 PlyA + 100847 100852 6 1.05 6.05 PlyA - 102597 102592 6 1.05 6.04 Term - 106190 105843 348 2 0 73 55 229 0.164 11.60 6.03 Intr - 108288 108194 95 1 2 69 12 154 0.240 4.66 6.02 Intr - 115891 115779 113 0 2 15 100 141 0.296 7.10 6.01 Init - 120415 120342 74 1 2 10 78 112 0.303 2.99 6.00 Prom - 122045 122006 40 -8.35 7.04 PlyA - 122207 122202 6 1.05 7.03 Term - 124308 124183 126 0 0 94 47 121 0.931 5.90 7.02 Intr - 138543 138404 140 1 2 107 42 163 0.904 12.86 7.01 Init - 138665 138587 79 2 1 66 40 64 0.623 0.68 7.00 Prom - 144389 144350 40 -3.05 8.00 Prom + 151417 151456 40 -5.75 8.01 Sngl + 155390 156091 702 1 0 77 41 224 0.507 12.56 8.02 PlyA + 156600 156605 6 1.05 9.03 PlyA - 158601 158596 6 1.05 9.02 Term - 164955 164827 129 0 0 74 48 124 0.779 4.20 9.01 Init - 166452 166354 99 0 0 49 116 58 0.470 5.01 9.00 Prom - 173172 173133 40 -3.95 10.03 PlyA - 173926 173921 6 1.05 10.02 Term - 198352 198290 63 1 0 101 48 37 0.618 -2.09 10.01 Intr - 199754 199420 335 0 2 61 86 197 0.316 11.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 45845 46102 258 1 0 99 41 131 0.808 4.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:121751964_121952689|GENSCAN_predicted_peptide_1|150_aa MASKRSSEYVNEDVSKERRQKNVFILAANSFAETWASYDWGVKGELTLEQQSPIPEQWTG SVPWPMKNWAAQQELNRGIRTSAPMGLTDAGEDFFPEYKGVGGSLNPHYVPEAQGSVSGV VQNSTSYLLHSVIYHPLVFEKEKGNIPQLS >gi568815593f:121751964_121952689|GENSCAN_predicted_CDS_1|453_bp atggccagtaaaaggtcttctgagtatgtcaatgaggatgtttcaaaagagcgaaggcag aaaaatgtttttattctggcagccaacagctttgctgaaacatgggcatcctatgactgg ggggtgaagggagaattgacattagaacagcagtccccaatccctgagcaatggactggt agcgttccatggcctatgaagaattgggctgcacagcaggagctaaataggggcatccgt acgtctgcccctatgggacttacagatgcgggggaggatttttttccagaatataaaggt gtagggggatctcttaatccccactatgttccagaagcacagggctcagtttctggtgtt gtccagaacagtaccagttacttattacattcagtgatttatcacccattggtatttgaa aaggaaaaggggaacatcccccagctttcttag >gi568815593f:121751964_121952689|GENSCAN_predicted_peptide_2|50_aa MTSQYTADTKPYRESPNQLTKEEKFELGSDMGQLSMRMQITNGQQLHYPH >gi568815593f:121751964_121952689|GENSCAN_predicted_CDS_2|153_bp atgacttcacagtacaccgcagacaccaagccatatagggaatctcctaaccagctgact aaagaagaaaagtttgagcttggttcagacatgggtcagcttagcatgcgaatgcaaatc acaaacggacagcagttacactaccctcactaa >gi568815593f:121751964_121952689|GENSCAN_predicted_peptide_3|87_aa MEYYAAVKKDEFMSFAGTWMKQEPIILSKGCCCLLGVHFRPYSSDSLLHLEMSLKEAEEQ QGWMSASSSGIPELEGPQPDANRIILV >gi568815593f:121751964_121952689|GENSCAN_predicted_CDS_3|264_bp atggaatactatgcagccgtaaaaaaggatgagttcatgtcctttgcagggacatggatg aagcaggaacccatcattctcagcaaaggctgctgttgtttgctgggggttcacttcagg ccctattcatctgattcgcttttacacctggagatgtcactcaaggaggccgaagaacag caaggatggatgtctgcttcttcttctgggatccctgaacttgaggggccccaacctgat gccaacaggattattcttgtatag >gi568815593f:121751964_121952689|GENSCAN_predicted_peptide_4|239_aa MWIDFEEKHCPTGTSLVGASALSLTGHEGPFKFHYGEISDFRIIKLNTTDRVIYNKQKLI DFTVVEAGKSNARVLATEPTQMRRNQKTNPGDMTKQGSLTPPPNPPNTQNHTSSTAMDQS QEEIPDLPEKEFRRFVEAKYRNEQQQKEKKVKMMLSNGKKILNIFDNDRSQLNDTFGQTL QQTVSHPHPLSPPEVHLQEECQWLPVSLLETLSASRITLYPQMCQTSSTEELTTLEQSL >gi568815593f:121751964_121952689|GENSCAN_predicted_CDS_4|720_bp atgtggatcgattttgaggaaaagcattgtcctactggcactagccttgttggtgcctct gcattgtctctgacaggacatgagggtccctttaaatttcactatggggaaatttctgat tttcgtatcattaagctgaacaccacagaccgggtaatttataataaacagaagttgatt gactttacagttgtggaggctgggaagtccaatgccagggtgctagcaactgagcctacc caaatgagaaggaaccagaaaaccaaccctggtgatatgacaaaacaaggctctttaaca ccgccccccaacccccccaacacacaaaatcatactagctcaacagcaatggatcaaagc caagaagaaatccctgatttacctgaaaaagaattcaggagatttgtggaagctaaatat agaaatgaacaacaacagaaagagaaaaaagttaaaatgatgcttagcaatggcaagaag attttgaatatatttgataatgacagatcacaactcaatgacacattcggtcagacacta cagcagacagtgtcacacccacatcccctcagcccacctgaagttcacctgcaggaagaa tgccaatggctacctgtgtctctgcttgaaacattatctgcttctaggattacgctctac ccacaaatgtgccagaccagcagtactgaagagttaacaactctggagcaatctctttaa >gi568815593f:121751964_121952689|GENSCAN_predicted_peptide_5|242_aa MLSCFRLLSRHISPSLASLRPVRCCFALPLRWAPGRPLDPRQIAPRRPLAAAASSRDPTG PAAGPSRVRQNFHPDSEAAINRQINLELYASYVYLSMAYYFSRDDVALNNFSRYFLHQSR EETEHAEKLMRLQNQRGGRIRLQDIKKPEQDDWESGLHAMECALLLEKNVNQSLLELHAL ASDKGDPHLCDFLETYYLNEQVKSIKELGDHVHNLVKMGAPDAGLAEYLFDTHTLGNENK QN >gi568815593f:121751964_121952689|GENSCAN_predicted_CDS_5|729_bp atgctgtcctgcttcaggctcctctccaggcacatcagcccttcgctggcgtctctgcgc ccggtgcgctgctgcttcgcgctcccgctgcgttgggccccggggcgccccttggacccc aggcagatcgccccccgccgccccctggccgcagccgcctcctcccgggaccctaccggg cccgccgccggcccctctcgggtgcgccagaacttccaccccgactccgaggctgccatc aaccgccagatcaacctcgagctctatgcgtcctacgtgtacttgtccatggcctattac ttctcccgggatgacgtggccttgaacaacttctccaggtatttccttcaccagtcccgg gaggagaccgagcacgcggagaagctgatgaggctgcagaaccagcgaggaggccggatc cgcctgcaggacatcaagaagccggaacaggacgactgggaaagcgggctgcatgccatg gagtgtgctctactcttggaaaagaacgtgaaccagtcgttgctggaattgcacgctcta gcctcagataaaggtgacccccatttgtgcgatttcctggaaacctactacctgaatgag caggtgaagtctatcaaagaactaggtgaccacgtgcacaacttagtgaagatgggggcc ccggatgctggcctggcggagtacctttttgacacacatacccttggaaatgaaaacaag cagaactaa >gi568815593f:121751964_121952689|GENSCAN_predicted_peptide_6|209_aa MLQNLDSPRLLEEPRIHDEEGKLPLLSQSDSLESGSETEIGIQGVNGGVAEGSLCQPLKG NEDNKEEQEYSEVAEEVTEHVYLLAKAKAAKEGELIENAQLRFLTDEQLVTLFTQLQTAV RSRMHPFYITHIRAHTPLPGPLTKGNQTADRLVATAISNARHFRNVTHVNASGFKRRYSI TWKEAKAIIQRCPTCQMVHSSSFTGGVNP >gi568815593f:121751964_121952689|GENSCAN_predicted_CDS_6|630_bp atgctgcagaaccttgactcaccacgtttactggaggagccacgaatccatgatgaggaa ggcaagttgcctttactctcacagtcagactccctggaaagcggttctgagacagagatt ggcattcaaggagtgaatgggggagtggcagagggctccttgtgtcaaccactgaaggga aatgaagataacaaggaggaacaagagtatagcgaagtagcagaagaggttacagagcat gtttatttgctggctaaagctaaagcggcaaaggaaggagagttaattgaaaatgctcag ttacgatttcttacagatgaacaactggtgactttatttacccaattgcaaacagcagtt aggagtagaatgcatcctttttacatcactcacattagagctcatacacctcttccagga cctttaactaaagggaatcaaacggctgatcgcctagttgctactgcaatatctaatgct agacactttcgcaatgtaactcatgttaatgcctctggtttcaaacgtagatacagcatt acctggaaagaagctaaagctattatccagcgatgcccaacttgccaaatggtacattcc tcatcttttacaggaggagttaatccttga >gi568815593f:121751964_121952689|GENSCAN_predicted_peptide_7|114_aa MLPSIGLWTVELKELISALTSPLGLPGDTQLGLAAGPAQTLLLCRWSGQPDPALTCSCAP SHKRLSMAGRVDRCEFFPGVNLLDVEHDRTNTGEQQSSKQMIVLHTKEVGMDIG >gi568815593f:121751964_121952689|GENSCAN_predicted_CDS_7|345_bp atgctaccatccatcgggctgtggacagtggagctaaaagagctaattagcgcactaaca tcccctctggggcttccaggtgacacgcagcttggtctggctgcaggccccgcacagacc ttgctgctatgtcgatggagtggccagccagatcccgcactcacttgttcctgtgctcct tcccacaagaggctaagcatggcaggccgagtagacaggtgtgaattcttccctggagtg aacctcttggatgtggaacatgacagaaccaatactggtgaacaacagtcctccaagcaa atgatagtgctacatacaaaggaagttggaatggatattggttaa >gi568815593f:121751964_121952689|GENSCAN_predicted_peptide_8|233_aa MTILPKAIYKFNAIPVKISSSFFTELQKTILKFKWNKKGAHMAKVTLSRKNKSGGITLPN FKLYYKAIVTKTAWYWYKTGHTDQWTRIEKLEKKPSTYTQLIFNKANKNIKQGKDTLFNK WCWDNWQATCRRMKLDPHLSHHTKINSKWIKDLNLRPETIKILEDSIGKTLLDIGLGKDF MTKNPKANATKAKINRWDLIKLKSFCTAKEIITRVNRQPTEWVKIFANYASNK >gi568815593f:121751964_121952689|GENSCAN_predicted_CDS_8|702_bp atgaccatactgccaaaagcaatctataaattcaatgcaattcctgtcaaaatatcatca tcattcttcacagaactacaaaaaacaattctaaaattcaaatggaataaaaaaggagcc cacatggccaaagtaacattaagcagaaagaacaaatctggaggcatcacattacccaac ttcaaactatactacaaggctatagttaccaaaacagcatggtactggtataaaaccgga cacacagaccaatggaccagaatagagaaacttgaaaaaaagccaagtacttatacacaa ctgatctttaacaaagcaaacaaaaacataaagcagggaaaggacaccctattcaacaaa tggtgctgggataattggcaagccacatgtagaagaatgaagttggatcctcatctctca catcatacaaaaatcaactcaaaatggatcaaagacttaaatctaagaccagaaaccata aaaattctagaagatagcattggaaaaactcttttagacattggcttaggcaaagacttc atgaccaagaatccgaaagcaaatgcaacaaaagcaaagataaatagatgggacttaatt aagctaaaaagcttctgcacagcaaaagaaataatcaccagagtaaacagacaacccaca gagtgggtgaaaatatttgcaaactatgcatccaacaaataa >gi568815593f:121751964_121952689|GENSCAN_predicted_peptide_9|75_aa MVYSSAKTTRRRMVFQEKCGVLLLKAGGVHTGKAFLRRNLRRFLIWHMKRLTIEEATQAV DTADKARYEAFAGLV >gi568815593f:121751964_121952689|GENSCAN_predicted_CDS_9|228_bp atggtttatagttctgctaaaaccacaaggaggagaatggttttccaagagaagtgtggg gtgctgttactaaaagctggtggggtgcacactgggaaggctttcctgagacggaatctg agaagattcctcatatggcatatgaagaggctgaccatagaagaagcaactcaggcagtg gatacagcagacaaggcaagatacgaagcgtttgcaggactggtgtga >gi568815593f:121751964_121952689|GENSCAN_predicted_peptide_10|132_aa XNPIVSAQNLLKLISNFSKVSGYKISVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKY LGIQLTRDVKDLFEENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKAFSPYVFL LYRSCQKLMGHY >gi568815593f:121751964_121952689|GENSCAN_predicted_CDS_10|399_bp naaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcagtgtacaaaaatcacaagcattcttatacaccaacaacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcgaggagaactacaaaccattg ctcaatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaaggcattttccccatatgtgttctta ttgtatcgcagctgtcagaaacttatgggccattactaa