GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:28:11 Sequence gi568815596f:62952991_63156313 : 203323 bp : 37.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15876 15996 121 2 1 35 54 140 0.499 5.70 1.02 Intr + 16797 16834 38 1 2 84 64 25 0.391 -3.24 1.03 Intr + 26201 26345 145 1 1 21 105 214 0.576 15.33 1.04 Intr + 34941 35048 108 1 0 80 48 145 0.995 9.04 1.05 Intr + 37726 37850 125 2 2 61 106 151 0.998 13.58 1.06 Intr + 40540 40678 139 0 1 68 75 142 0.975 10.02 1.07 Intr + 40881 40978 98 1 2 90 -27 88 0.674 -3.69 1.08 Intr + 43653 43776 124 2 1 87 96 158 0.758 15.74 1.09 Intr + 84545 84644 100 0 1 14 84 82 0.043 -1.35 1.10 Intr + 85753 85826 74 1 2 42 116 136 0.411 9.93 1.11 Intr + 92076 92190 115 1 1 90 73 243 0.901 21.59 1.12 Term + 92420 92510 91 2 1 62 42 118 0.860 0.81 1.13 PlyA + 93469 93474 6 1.05 2.02 PlyA - 94309 94304 6 1.05 2.01 Sngl - 95154 94675 480 0 0 91 55 333 0.815 26.13 2.00 Prom - 96425 96386 40 -9.15 3.00 Prom + 96709 96748 40 -12.72 3.01 Init + 96851 96935 85 1 1 64 9 103 0.482 1.03 3.02 Intr + 97040 97162 123 0 0 -3 111 110 0.546 3.84 3.03 Intr + 97756 97837 82 2 1 49 53 108 0.632 1.18 3.04 Intr + 99224 99281 58 2 1 64 64 68 0.460 -0.03 3.05 Intr + 99890 100097 208 1 1 91 88 98 0.825 7.83 3.06 Intr + 101057 101208 152 0 2 80 85 222 0.994 20.06 3.07 Intr + 101551 101649 99 0 0 119 7 63 0.616 0.69 3.08 Intr + 102511 102733 223 0 1 107 47 358 0.990 30.38 3.09 Intr + 104261 104392 132 0 0 106 59 90 0.905 7.60 3.10 Intr + 106308 106469 162 1 0 63 75 113 0.479 6.53 3.11 Term + 107080 107228 149 2 2 69 45 138 0.985 4.68 3.12 PlyA + 107862 107867 6 1.05 4.05 PlyA - 109276 109271 6 1.05 4.04 Term - 117307 117194 114 1 0 89 38 147 0.592 7.39 4.03 Intr - 119352 119205 148 2 1 40 92 38 0.058 -1.28 4.02 Intr - 137169 135343 1827 2 0 -13 53 441 0.077 16.87 4.01 Init - 137442 137279 164 0 2 68 72 123 0.215 7.95 4.00 Prom - 138393 138354 40 -6.15 5.03 PlyA - 138563 138558 6 1.05 5.02 Term - 153556 153398 159 1 0 37 53 148 0.046 3.16 5.01 Init - 180569 180492 78 2 0 79 100 40 0.465 5.41 5.00 Prom - 184695 184656 40 -3.05 6.04 PlyA - 184812 184807 6 1.05 6.03 Term - 188067 187649 419 1 2 16 36 279 0.040 9.95 6.02 Intr - 190341 190116 226 0 1 77 3 98 0.043 -3.16 6.01 Intr - 191896 191745 152 2 2 64 88 129 0.346 9.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 188083 187649 435 1 0 25 36 303 0.953 14.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:62952991_63156313|GENSCAN_predicted_peptide_1|425_aa MCMTGSAFVGPDLRSQRAFSSFLPTKEQPDNQQEALDVTQEKQRSGHPQRNEQQDEERRR QLRERARQLIAEARSGVKMSELPSYGEMAAEKLKERSKASGDENDNIEIDTNEEIPEGFV VGGGDELTNLENDLDTPEQNSKLVDLKLKKLLEVQPQVANSPSSAAQKAVTESSEQDMKS GTEDLRTERLQKTTERFRNPVVFSKDSTVRKTQLQSFSQYIENRPEMKRQRSIQEDTKKG NEEKAAITETQRKPSEDEKGFKDTSQYVVGELAALENEQKQIDTRAALVEKRLRYLMDTG RNTEEEEAMMQEWFMLVNKKNALIRRMNQLSLLEKEHDLERRYELLNRELRAMLAIEDWQ KTEAQKRREQLLLDELVALVNKRDALVRDLDAQEKQAEEEDEHLERTLEQNKGKMAKKEE KCVLQ >gi568815596f:62952991_63156313|GENSCAN_predicted_CDS_1|1278_bp atgtgcatgactggcagtgcatttgttgggccagatctaaggagccagagagcgttcagc tcctttttgcctacaaaagagcagccagacaaccagcaagaggccttggatgtaacacaa gagaagcagagatcaggccatccccagagaaatgagcagcaagatgaagagcgacgtcgg cagctgagagagagagctcgtcagctaatagcagaagctcgatctggagtgaagatgtca gaacttcccagctatggtgaaatggctgcagaaaagttgaaagaaaggtcaaaggcatct ggagatgaaaatgataatattgagatagatactaacgaggagatccctgaaggctttgtt gtaggaggtggagatgaacttactaacttagaaaatgaccttgatactcccgaacaaaac agtaagttggtggacttgaagctgaagaagctcctagaagttcagccacaggtggcaaat tcaccctccagtgctgcccagaaagctgtaactgagagctcagagcaggacatgaaaagt ggcacagaagatctccggactgaacgattacaaaaaacaacagaacgttttagaaatcct gttgtgttcagcaaagattctacagtcagaaaaactcaacttcagtctttcagccaatat attgagaatagaccagagatgaaaaggcagagatcaatacaggaagatacaaagaaagga aatgaggagaaggcagcgataactgaaactcagaggaagccatcagaagatgaaaaaggg ttcaaagacaccagtcagtatgtagtaggagaattggcagcactagagaatgagcaaaag caaattgacacccgtgccgcgctggtggagaagcgccttcgctatctcatggacacagga aggaacacagaagaagaagaagctatgatgcaggaatggtttatgttagttaataagaaa aatgccttaataaggagaatgaatcagctctctcttctggaaaaagaacatgatttagaa cgacggtatgagctgctgaaccgggaattgagggcaatgctagccattgaagactggcag aagaccgaggcccagaagcgacgcgaacagcttctgctagatgagctggtggccctggtg aacaagcgcgatgcgctcgtcagggacctggacgcgcaggagaagcaggccgaagaagaa gatgagcatttggagcgaactctggagcaaaacaaaggcaagatggccaagaaagaggag aaatgtgttcttcagtag >gi568815596f:62952991_63156313|GENSCAN_predicted_peptide_2|159_aa MTVPASAAVREGEVGTPLEEVKPSSSRPSSREGARELGVETRIQHPGDLQARPGSGGHLG QGEGFGSEAEGSDQGRKFCSGPQTVPKQARCQRPCSSATLGPIVSFPAARAFEFRVLRRA GLVPVLGSQRLGTTQRCPDAAPLPRALPRQAPESTLHRR >gi568815596f:62952991_63156313|GENSCAN_predicted_CDS_2|480_bp atgaccgttccagcatcggccgcggtgcgggaaggggaagtggggacaccactggaggag gtgaagccgtccagctcaaggccctcgagccgggaaggggcgagggagctgggtgtggag acaaggatccagcatccaggtgacctccaggcccggccaggctctggaggacacttggga caaggggagggctttggatctgaggccgagggatctgaccaggggcggaagttctgctcc ggcccgcagactgtgcccaagcaagcccgctgccaacgcccctgctcctctgcgaccctc ggaccgatagtctcttttcccgcagcgcgggctttcgagttccgggtcctccggcgtgct gggctcgtgcccgttctcgggagccagcgtctgggcaccacgcagcgatgtccggatgca gcgcccttgccgcgggccttaccccgccaggcgccggagtccacgctccaccgccgctga >gi568815596f:62952991_63156313|GENSCAN_predicted_peptide_3|490_aa MGFARTNNPGKSPQGGSLRSMWKKVEGREVGCTCEGKRTLGSNVSVEALKTCKHPWWGPG TPGDADEQAARAGPPIGGLAPLRTRLAFTYPGRAVESGPFGVLWPSAKPGPVTAVEARPP DASDPEGLRGGSPAPLLAPGPLDPSGRLHPAVSMMSYLKQPPYGMNGLGLAGPAMDLLHP SVGYPATPRKQRRERTTFTRSQLDVLEALFAKTRYPDIFMREEVALKINLPESRVQDLAV SLRGLNSASALVFHPGNPRNGRAEIGWSLVWFKNRRAKCRQQQQSGSGTKSRPAKKKSSP VRESSGSESSGQFTPPAVSSSASSSSSASSSSANPAAAAAAGLVVAKLPCPLHIFSLCVF IEENRLVSGSWARDIRSVEETDKSGYRGPLWLRQQLPMPLEARGSLRRALRGPAVLLAAK WNETRRPGFALVWTTVWPPEQGSLFGLGLGVRDTIGAIRIEATSGDLATRENLATFRRQQ FSFFEWSTKT >gi568815596f:62952991_63156313|GENSCAN_predicted_CDS_3|1473_bp atgggcttcgcccgtacgaacaatccggggaaatcgcctcaaggaggatccttacgcagc atgtggaaaaaagttgagggcagggaagtagggtgtacgtgtgaggggaagaggacgctg ggctccaacgtttcagtagaagcgcttaagacttgcaaacacccttggtggggacctgga accccgggagatgccgacgagcaagcagcccgcgcggggcccccgattggcggcctagcc ccgttacgcactcgcctcgcgttcacatacccggggagggcagtagaaagtggccccttc ggcgtcctctggccctctgcgaagcctgggcccgttactgcggttgaggccaggccccca gacgcatcagaccctgaaggactgcgtggtgggagccctgcaccgctcctggccccgggc cccctggatccgtcggggcgcctccacccagctgttagcatgatgtcttacctcaaacaa cccccatacggcatgaacgggctgggcctggccgggcccgccatggacctcctgcaccca tccgtgggctatccggccactccgcggaagcagcggcgggagcgcaccaccttcacgcgt tcacagctggacgtgctcgaggcgctcttcgccaagactcgctaccctgacatcttcatg cgggaggaggtggcgctcaagatcaacctgccggagtctagagtccaggacttggcagtt tcccttagaggcctgaattctgcctctgcccttgtcttccaccccgggaatcctagaaat ggaagagcggagattggctggagcctggtctggttcaagaaccgccgcgccaaatgccgc cagcagcagcagagcgggagcggaaccaagagccgcccagccaagaagaagtcctctcca gtgcgggagagctcgggctccgaaagcagtggccaattcacgccgccagctgtgtccagc tctgcctcgtcctctagctcggcgtccagctcttccgccaacccagcggctgcagcggct gcgggactagttgttgcaaagctgccctgccctcttcacatcttctccctctgtgtattt attgaagagaaccgcttggtttcaggaagctgggcgcgggatatccgaagtgtggaggaa acagacaagtcagggtacagagggccgctttggctcaggcagcagctgccaatgccgctc gaggcccgcggctccctgaggcgagctctgcggggcccagcagtcctcttggctgctaaa tggaacgaaaccaggagacccggctttgcactcgtttggacaactgtgtggcctccggag caaggaagtttgttcggtttgggtttgggtgttcgtgacacaatcggagcaattagaatt gaagcaacttcaggagatttagcaacccgcgaaaatttagcaacgttcagacgtcaacag ttttctttctttgagtggtcaacgaaaacttga >gi568815596f:62952991_63156313|GENSCAN_predicted_peptide_4|750_aa MGKFLDTHTLARLNQEEIESLNRPITGSEIVAIINTLPTKKSPGPDGFTAKFYQSRAETQ PKKNFRPISLMNIDAKILNKILANQIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQH INRTKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGTDGTYLKIIRAIYDKPTANIILNG QKLEAFPLKTGTRQGCPLSPLLFNIMLEVLARAISQEKEIKGIQLGKEEVKLSLFADDMI VYLDNPVVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMRELPFTIASKR IKYLGIQLTRDVKDLFKENYEPLLNEIKEDTNKWKNVPCSWVGRINIVKMAIVPKVIYRF NVIPILLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTK TAWYWYQNRDIYQWNRTQPSEITPHIYTYLIFDKPEKNKQWGKDSLFNKWCWENWLAICR KLKLDPFLTPYTKINSRWIKDLNVIPKTIKTLEENLGITIQDIGMGQDFMSKTPKAMATK AKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRMYNELKQIYKKKT NNSIKKWAKDMNRHFSKEDIYAAKKHMKKCSPSLAIREMQIKTTMRYHLTPVRMVIIKKS GNNSLGDKARPCLKKRKEKKMFLTVGTKSLEEKAEGIRGTSSSLVKEYLKVSKILPALEY WTPSSSVLELGLALLAPQPADGRLWDLVIL >gi568815596f:62952991_63156313|GENSCAN_predicted_CDS_4|2253_bp atgggtaaattcctcgacacacacaccctcgcaagactaaaccaggaagaaattgaatct ctgaatagaccaataacaggctctgaaattgtggcaataatcaataccttaccaaccaaa aagagtccaggaccagatggattcacagccaaattctaccagagccgggcagagacacaa ccaaaaaagaactttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaa atactggcaaaccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggc ttcatccctgggatgcaaggctggttcaatatatgcaaatcaataaatgtaatccagcat ataaacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggccttt gacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtactgatgggaca tatctcaaaataataagagctatctatgacaaacccacagccaatatcatactgaatggg caaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctctcacca ctcctattcaacataatgttggaagttctggccagggcaattagtcaggagaaggaaata aagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgatt gtatatctagacaaccctgttgtctcagcccaaaatctccttaagctaataagcaacttc agcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatataccaat aacagacaaacagagagccaaatcatgagggaactcccattcacaattgcttcaaagaga ataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggagaactac gaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaatgttccgtgctca tgggtaggaagaatcaatatcgtgaaaatggccatagtgcccaaggtaatttatagattc aatgtcatccccatcctgctaccaatgactttcttcacagaattggaaaaaactacttta aagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaac aaagctggaggcatcacgctacctgacttcaaactatactacaaggctacagtaacaaaa acagcatggtactggtaccaaaacagagatatatatcaatggaacagaacacagccctca gaaataacgccacatatctacacctatctgatctttgacaaacctgagaaaaacaagcaa tggggaaaggattctctatttaataaatggtgctgggaaaactggctagccatatgtaga aagctgaaactggatcccttccttacaccttatacaaaaattaattcaagatggattaaa gacttaaacgttattcctaaaaccataaaaaccctagaagaaaacctaggcatcaccatt caggacataggcatgggccaggacttcatgtctaaaacaccaaaagcaatggcaacaaaa gccaaaattgacaaatgggatctcattaaactaaagagcttctgcacagcaaaagaaact accatcagagtgaacaggcaacctacaaaatgggagaaaatttttgcaacctactcatct gacaaagggctaatatccagaatgtacaatgaacttaaacaaatttacaagaaaaaaaca aacaactccatcaaaaagtgggcgaaggatatgaacagacacttctcaaaagaagacatt tatgcagccaaaaaacacatgaaaaaatgctcaccatcactggccatcagagaaatgcaa atcaaaaccacaatgagataccatctcacaccagttagaatggtgatcattaaaaagtca ggaaacaacagcctgggtgacaaggcgagaccctgtctgaaaaaaagaaaagaaaagaaa atgtttctaacagtgggcacaaaaagcctagaagaaaaggcagaggggattcggggaact tcaagcagcttagtgaaggaatatcttaaagtgtcaaagatacttcctgccctcgaatat tggactccaagttcttcagttttggaacttggactggctctccttgctcctcagcctgca gatggtcgattgtgggaccttgtgatcctgtaa >gi568815596f:62952991_63156313|GENSCAN_predicted_peptide_5|78_aa MGKTAPMIQLPPPGPTLDTWELLQFKTNPMTAAAASCCRTEVQENHTLQNCLRLLLPLRV APLSPVAGFCSTVATVPI >gi568815596f:62952991_63156313|GENSCAN_predicted_CDS_5|237_bp atggggaaaactgcccccatgattcagttacctccacctggtcccacccttgacacgtgg gaattattacaattcaagacaaatccgatgactgctgctgctgcaagctgttgcaggact gaagtgcaagaaaatcatacactccagaactgtctgcgtttgctgctcccactgagagtg gccccactttccccagtggcaggcttctgcagcacagttgccactgtccctatctga >gi568815596f:62952991_63156313|GENSCAN_predicted_peptide_6|265_aa XPTQMRRNQKTNPGNMTKQGSSTPPKNHTSSAAVDPNQEEIPDLPEKEFREKTHLTHKDS HKLKVKGQKKAFHANGHQKRAGAAILIPDKTNFKVTAVKRDQEGHYIMVKTLSNRKYHNP KNRGTCDAEKAFDKIQHPFMIKTLSKISIQRTYLNVINSIYDKPTANVILNGEKLKAFPL GTGTRQECPLSPLLFNIVLEVLARAIRQEKEIKGIQISKQEVKLSLFADDTIVYLENPKD SSRKLLELIKEFSKVSGYKINIQNQ >gi568815596f:62952991_63156313|GENSCAN_predicted_CDS_6|798_bp nagcctacccaaatgagaaggaaccagaaaacaaaccctggtaatatgacaaaacaaggc tcttcaacacccccaaaaaatcacactagttcagcagcagtggatccaaatcaagaagaa atccctgatttacctgaaaaagaatttagggagaagactcacctaacacataaggactca cataaacttaaagtaaaggggcagaaaaaggcatttcatgcaaatggacaccaaaagcga gcaggggcagctattcttataccagacaaaacaaattttaaagtaacagcagttaaaaga gaccaagagggacattatataatggtaaaaaccttgtccaacaggaaatatcacaatcct aaaaacagaggcacctgcgatgcagaaaaagcattcgacaaaatccagcatccctttatg attaaaactctcagcaaaatcagcattcaaaggacataccttaatgtaataaactccatc tatgacaaaccaacagcaaacgtgatactgaatggggaaaagttgaaagcattccctctg ggaactggaacaagacaagaatgcccgctgtcaccactcctgttcaacatagtactggaa gtcctagccagagcaatcagacaagagaaagaaataaagggcatccaaatcagtaaacag gaagtcaaactgtcactgtttgctgatgatacaatcgtttaccttgaaaaccctaaggac tcctccagaaagctcctagaactgataaaagaattcagcaaagtttctggatacaagatt aatatacaaaatcagtag