GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:18:22 Sequence gi568815593f:156750563_156951177 : 200615 bp : 39.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7019 7142 124 2 1 71 78 150 0.598 11.97 1.02 Term + 8655 8828 174 2 0 114 48 66 0.916 1.98 1.03 PlyA + 9015 9020 6 1.05 2.00 Prom + 17141 17180 40 -4.75 2.01 Init + 17699 17812 114 1 0 96 82 50 0.818 5.46 2.02 Intr + 27303 27377 75 2 0 94 67 122 0.325 9.49 2.03 Intr + 29131 29235 105 0 0 72 83 32 0.019 0.59 2.04 Intr + 33351 33509 159 2 0 54 72 123 0.024 6.56 2.05 Term + 34213 34356 144 0 0 48 39 115 0.920 -0.47 2.06 PlyA + 34569 34574 6 -0.45 3.00 Prom + 34697 34736 40 -6.15 3.01 Sngl + 36613 37569 957 0 0 60 43 327 0.962 21.78 3.02 PlyA + 37629 37634 6 -0.45 4.00 Prom + 37913 37952 40 -3.65 4.01 Init + 39282 39481 200 2 2 62 86 85 0.058 4.12 4.02 Intr + 52163 52269 107 2 2 72 36 143 0.130 6.44 4.03 Intr + 56400 56737 338 1 2 33 116 333 0.140 24.91 4.04 Intr + 63169 63279 111 0 0 56 97 44 0.066 1.76 4.05 Term + 66741 66920 180 2 0 20 38 125 0.008 -2.67 4.06 PlyA + 67393 67398 6 1.05 5.04 PlyA - 68871 68866 6 1.05 5.03 Term - 69883 69653 231 1 0 68 55 130 0.084 3.19 5.02 Intr - 79546 79461 86 2 2 73 78 65 0.049 2.62 5.01 Init - 81561 81498 64 0 1 54 98 34 0.393 2.37 5.00 Prom - 85862 85823 40 -4.85 6.00 Prom + 99335 99374 40 -3.85 6.01 Sngl + 100001 100618 618 1 0 91 43 917 0.816 83.44 6.02 PlyA + 101241 101246 6 1.05 7.00 Prom + 102238 102277 40 -5.75 7.01 Init + 110430 110518 89 2 2 83 41 108 0.505 5.76 7.02 Intr + 127961 128158 198 2 0 -40 42 247 0.271 4.84 7.03 Term + 128282 128444 163 2 1 30 47 185 0.378 5.03 7.04 PlyA + 128997 129002 6 1.05 8.00 Prom + 133630 133669 40 -5.15 8.01 Init + 139166 139307 142 1 1 52 98 137 0.822 11.44 8.02 Term + 159939 160090 152 1 2 93 49 62 0.040 -0.11 8.03 PlyA + 160580 160585 6 -0.45 9.06 PlyA - 161484 161479 6 1.05 9.05 Term - 163854 163637 218 1 2 69 54 117 0.278 2.72 9.04 Intr - 171654 171537 118 0 1 78 94 77 0.716 6.42 9.03 Intr - 175750 175701 50 2 2 115 98 49 0.983 6.08 9.02 Intr - 177225 177178 48 1 0 97 81 55 0.253 3.43 9.01 Intr - 199169 199089 81 0 0 124 75 81 0.857 9.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 33285 33509 225 2 0 88 72 169 0.920 13.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:156750563_156951177|GENSCAN_predicted_peptide_1|99_aa XLESPTRSLVMEAPKGVEINAEAGNMEATCRTELRLESKDGEIKLDAAKIRLPRLPHGSY TPTGTRQKVFEICVCANGRLFLSQAGAGSTCQINTSVCL >gi568815593f:156750563_156951177|GENSCAN_predicted_CDS_1|300_bp nngttggagtccccaacccggtctctagtgatggaggccccaaaaggagtggaaatcaat gcagaagctggcaatatggaagccacctgcaggacagagctgagactggaatccaaagat ggagagattaagttagatgctgcgaaaatcaggctacctagactgcctcatggatcctac acgcctacaggaacgaggcagaaggtcttcgagatctgcgtctgcgccaatgggagatta ttcctgtctcaggcaggagctgggtccacttgtcagataaacacaagtgtctgcctctga >gi568815593f:156750563_156951177|GENSCAN_predicted_peptide_2|198_aa MGETIDERRHCGRTTWIHSPQLILHFEHLISPGLPQRQTVLINKKALTRRSPLTALDFSA PVTVKRSSPPTAEIRHSLITCSWKDTEFPPHCSGERALEHNSFPAREQNWIEHDFEELTE VGFRRSVITNFSELKEHVLTHRKEAKNLEKKACLTRGPEGSTKHGKKQLVPATAKTWQIV KTMDVMKKLHKLTGKISS >gi568815593f:156750563_156951177|GENSCAN_predicted_CDS_2|597_bp atgggtgagacaatcgatgagagaaggcattgtggcagaaccacttggatccactcacca caattaattcttcattttgagcatctcatctctcctggtcttccacaacgacagacagtc ctcatcaacaagaaggctctcaccagacgcagccccttgaccgccttggacttctcagca cccgtaactgtaaagagaagctccccgcctacagcagagattagacacagcctcattacc tgctcctggaaggatacagaatttcctccacattgctcaggagaaagagccttagaacac aactccttcccagcaagagaacaaaactggatagagcatgactttgaagagttgacagaa gtaggcttcagaaggtcagtaataacaaacttctctgagctaaaggagcatgttctaacc catcgcaaggaagctaaaaaccttgaaaaaaaggcctgccttacaagaggtcctgaagga agcactaaacatggaaagaaacaactggtaccagcgactgcaaaaacatggcaaattgta aagactatggacgttatgaagaaactgcataaattaacgggcaaaataagcagctag >gi568815593f:156750563_156951177|GENSCAN_predicted_peptide_3|318_aa MSELTFTIATKRIKYLGIQLTRDVKDLFKENYKPLLNKIKEDKNKWKNIPCSWAGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIGNQKRAHIAKTILSKKNKAGGITLP DFKLYYKATVTKTAWYWYQNRYIDQWNRTEASEITPHIYNLLIFDKPDRNKQWEKDSLFN KWCWENWLATGRKMKVDPFLIPFTKINSRWIKDLNVRNKTIKALEENLGYTIQDVGMGKD LMTKTPKAMATKAKIDKWDLIKLKSFCTAKESIIRVDRQPTEWEKMFAIYPSDKGLISRI YKGHKEIYKKKTNNLIKK >gi568815593f:156750563_156951177|GENSCAN_predicted_CDS_3|957_bp atgagtgaactcacattcacaattgctacaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaacaagataaaa gaagacaaaaacaaatggaagaacattccatgctcatgggcaggaagaatcaatatagtg aaaatggccatactgcccaaggtaatttatagattcaacgccatccccatcaagctacca atgactttcttcacagaattggaaaaaacaactttaaagttcatagggaaccaaaaaaga gcccacattgccaagacaatcctaagcaaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agatatatagaccaatggaacagaacagaggcctcagaaataacaccacacatctacaac cttctgatctttgacaaacctgacagaaacaagcaatgggaaaaggattccctatttaat aaatggtgctgggaaaactggctagcaacaggtagaaagatgaaagttgatcccttcctt atacctttcacaaaaattaactcaagatggattaaagacttaaatgtaagaaataaaacc ataaaagccctagaagaaaacctaggctataccattcaggatgtaggcatgggcaaggac ctcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaattaaagagcttctgcacagcaaaagaaagtatcatcagagtggacaggcaacct acagaatgggagaaaatgtttgcaatctatccatctgacaaagggctaatatccagaatc tacaaaggacacaaagaaatttacaagaaaaaaacaaacaacctcatcaaaaagtag >gi568815593f:156750563_156951177|GENSCAN_predicted_peptide_4|311_aa MNELSFTITTKRIKYLGMQLKRNVKDFFKENYKPLLNEIKEDKNKWKNIPCSWIKRINIV KVATLPSWLTSLVEEQSSGQHPYNVMTISYQKCAGTDNRIQKDCSSSPATEQSWMENDFD ELTQVDFRRLVITNFSELNEDVQTHRGEAKNLKKRLDERLTRINSVKTLNDLMELKNTAR ELHDACTRFNSRFDQVEERVSVIEDQINEIQQENKESLQTLWASINGLKSVGVSNAKNLF FLPAEFSPMKLLPPESPKIQRDSSAATWWKIYGQQKESDIQKMEVRYRNNWFGYSSVFAL FEHGSNNVGYI >gi568815593f:156750563_156951177|GENSCAN_predicted_CDS_4|936_bp atgaatgaactctcattcacaattaccacaaagagaataaaatacctaggaatgcaactt aaaaggaatgtgaaggacttcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggacaaaaacaagtggaagaacattccatgctcatggataaaaagaatcaatatagtg aaggtggccacactgccaagctggctgacatctctggttgaggagcaatcatccgggcag catccttacaatgtaatgacaatcagttaccaaaaatgtgcaggcactgataacaggatt cagaaggattgcagctcctcgccagcaacggaacaaagctggatggagaatgactttgat gagctgacacaagtagacttcagaaggttggtaataacaaacttctccgagctaaatgag gatgttcaaacccatcgtggggaagctaaaaaccttaaaaaaagattagatgaaaggcta actagaataaacagcgtaaagaccttaaatgacctgatggagctgaaaaacacagcacga gaacttcatgacgcatgcacaagattcaatagccgattcgatcaagtagaagaaagggta tcagtgattgaagatcaaattaatgaaatacagcaagaaaacaaggaaagcctgcagacc ttgtgggctagtataaatggattgaagtctgtaggagtttcaaatgcaaagaacttgttc ttcttacctgctgagttttcccccatgaaacttttgcccccagaatcaccgaagattcag agagactccagtgcagccacatggtggaagatttatggacagcaaaaggaaagtgatata cagaaaatggaagtgagatacagaaacaactggtttggttacagctccgtgtttgccttg tttgaacatggttcaaataacgttggctacatttga >gi568815593f:156750563_156951177|GENSCAN_predicted_peptide_5|126_aa MPVFALCPGDPRPGVACGFWEEWSSIVMMEKGVPVIIFTLQGPGGWVWCEFSAHIFREAP VLLTFPSEHPETSKQMAAGGLGGTEGSSRGNQWQEAPATTAGHQASSCGKLWERHKMLPL GYPYEC >gi568815593f:156750563_156951177|GENSCAN_predicted_CDS_5|381_bp atgccagtttttgccctgtgtcctggagacccacgacctggcgtggcatgtgggttttgg gaagaatggagctccattgtcatgatggaaaagggtgttccagtcatcatattcacgttg caaggaccaggtggatgggtctggtgcgagttctcagctcacatcttcagagaggcccct gtcttactcacgttcccctcggagcaccctgagacaagcaagcagatggcagcaggtggc ttgggaggcacagaaggcagtagcaggggaaatcagtggcaagaggcaccagccaccacc gcaggccaccaagcttcatcctgtgggaagctctgggaaaggcataaaatgctgcctctg ggttacccctatgagtgctga >gi568815593f:156750563_156951177|GENSCAN_predicted_peptide_6|205_aa MAASTASHRPIKGILKNKTSTTSSMVASAEQPRRSVDEELSKKSQKWDEINILATYHPAD KGYGLMKIDEPSPPYHSMMGDDEDACRDTETTEAMAPDILAKKLAAAEGLEPKYRIQEQE SSGEEDSDLSPEEREKKRQFEMRRKLHYNEGLNIKLARQLISKDLHDDDEDEEMLETADG ESMNTEESNQGSTPSDQQQNKLRSS >gi568815593f:156750563_156951177|GENSCAN_predicted_CDS_6|618_bp atggcggcctcgacggcctcccaccggcccatcaaggggatcttgaagaacaagacctct acgacttcctctatggtggcgtcggccgaacagccccgcaggagtgtcgacgaggagctg agcaaaaaatcccagaagtgggatgaaattaacatcttggcgacctatcatccagcagac aaaggctatggtttaatgaaaatagatgaaccaagccctccttaccatagtatgatgggt gatgatgaagatgcgtgtagggacaccgagaccactgaagccatggcgccagacatccta gccaagaaattagctgctgctgaaggcttggagccaaagtaccggattcaggaacaagaa agcagtggagaggaggatagtgacctctcacctgaagaacgagaaaaaaagcgacaattt gaaatgagaaggaagcttcactacaatgaaggactcaatatcaaactagccagacaatta atttcaaaagacctacatgatgatgatgaagatgaagaaatgttagagactgcagatgga gaaagcatgaatacggaagaatcaaatcaaggatctactccaagtgaccaacagcaaaac aaattacgaagttcatag >gi568815593f:156750563_156951177|GENSCAN_predicted_peptide_7|149_aa MATQNVMCSQQQQQQQQNPRDCSKCKILAGNETAQCCGELQGQWCGHDGQSPPRPPVIEA RLSKTDSKPGERSKKLDGAVNLPQDMQQRVSDLHEDLIVFRDVKFSPSTRAASQRRASCG SPPLVNLKRLKGDAPGDKKLSLQELTNSG >gi568815593f:156750563_156951177|GENSCAN_predicted_CDS_7|450_bp atggctactcagaatgtaatgtgtagccagcagcagcagcagcagcagcaaaatccaaga gattgttcaaaatgcaaaatcttagccggaaatgaaacagcccagtgttgtggagaattg cagggccagtggtgtggtcatgatggacagtctcctccaaggccacctgtgattgaagcc cgattgtcaaaaacagacagtaaaccaggagagagaagcaagaagttagatggtgctgtc aacttgccacaggatatgcagcaaagggtatcggacttgcatgaagatctcattgtcttc cgggatgtgaagttctctcctagtaccagggctgcaagccagagaagagccagctgtggg tccccacctctagtgaacctcaagagactcaaaggggatgctccaggagacaagaagctc tctcttcaggagctgaccaacagtgggtag >gi568815593f:156750563_156951177|GENSCAN_predicted_peptide_8|97_aa MWEVGYEKAWKETLDNWTLSLKHAGFQVPVSVRVEMASRLLEILREPGTSEIIQIIGIMK WVIKIEMALQTVSCLGSGTGTLLFLTMPQLPTGPFGT >gi568815593f:156750563_156951177|GENSCAN_predicted_CDS_8|294_bp atgtgggaggtcggatatgagaaagcctggaaggagacgctagacaactggacgctgtct ttgaaacatgctggcttccaagtgccagtgagtgtgagggtggaaatggctagcaggctg ttggaaatactcagagaaccaggcacatcagaaattatccaaataattggaatcatgaaa tgggtgataaagatagaaatggctttgcagactgtaagctgcttaggatcagggactggg actctcttgttcctcacaatgccccagctccccacagggccctttggcacatag >gi568815593f:156750563_156951177|GENSCAN_predicted_peptide_9|171_aa XSETVLPSDSWSSVESTSADTVLLTSKVLVDSSGHGQVSQVGTASDTAVPEQNKTTKTGQ MDGIPMSMKNEMPISQLLMIIAPSLGFVLFALFVAFLLREEAVKGEPGRKLNEARWDTFK WECVELTHEELKPAVSRLNERLTGIQQWVAGETLCLSKENRKLRSGLPGWR >gi568815593f:156750563_156951177|GENSCAN_predicted_CDS_9|516_bp naatcagaaactgtcctccccagtgattcctggagtagtgttgagtctacttctgctgac actgtcctgctgacatccaaagtcttagtggacagctcaggccatggccaggtgtctcaa gtaggaactgcatctgatacagcagttcctgagcagaacaaaacaacaaaaacaggacag atggatggaatacccatgtcaatgaagaatgaaatgcccatctcccaactactgatgatc atcgccccctccttgggatttgtgctcttcgcattgtttgtggcgtttctcctgagagag gaagctgtaaagggagaacctggcaggaaacttaatgaagccaggtgggacacatttaaa tgggaatgtgttgaactaactcatgaagagctaaagccagctgtcagcaggcttaatgaa aggttaacaggaatccagcaatgggtggctggggagactctttgtctctctaaagaaaac aggaagttgagaagtgggctccctgggtggaggtga