GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:08:18 Sequence gi568815576f:35163076_35393648 : 230573 bp : 47.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3689 3925 237 1 0 62 47 178 0.821 7.27 1.02 PlyA + 4500 4505 6 1.05 2.00 Prom + 5034 5073 40 -4.96 2.01 Sngl + 5496 6179 684 2 0 66 47 262 0.703 16.09 2.02 PlyA + 6329 6334 6 1.05 3.08 PlyA - 7346 7341 6 1.05 3.07 Term - 15514 15349 166 0 1 80 53 112 0.431 4.29 3.06 Intr - 15938 15874 65 2 2 41 107 51 0.434 -0.18 3.05 Intr - 18642 18481 162 0 0 101 60 135 0.609 12.17 3.04 Intr - 20319 20165 155 1 2 13 90 95 0.383 1.99 3.03 Intr - 29472 29337 136 0 1 40 47 86 0.291 -0.16 3.02 Intr - 31944 31782 163 2 1 72 81 78 0.730 5.48 3.01 Init - 37854 37715 140 0 2 86 8 155 0.516 6.91 3.00 Prom - 55526 55487 40 -2.06 4.04 PlyA - 55859 55854 6 1.05 4.03 Term - 73896 73484 413 1 2 39 36 243 0.493 9.60 4.02 Intr - 82772 82606 167 1 2 74 88 77 0.162 5.90 4.01 Init - 87459 87410 50 0 2 76 92 16 0.343 1.38 4.00 Prom - 90423 90384 40 -5.16 5.02 PlyA - 90807 90802 6 1.05 5.01 Sngl - 94582 94316 267 1 0 58 41 258 0.700 11.33 5.00 Prom - 96396 96357 40 -10.55 6.00 Prom + 96473 96512 40 -4.96 6.01 Init + 99316 99346 31 0 1 110 86 64 0.999 8.30 6.02 Intr + 100003 100151 149 2 2 57 115 135 0.979 13.05 6.03 Intr + 100721 100799 79 1 1 49 78 56 0.927 -0.18 6.04 Intr + 101573 102528 956 0 2 85 98 486 0.736 40.11 6.05 Intr + 107270 107327 58 1 1 106 101 -22 0.318 -0.64 6.06 Term + 107443 107546 104 2 2 81 47 50 0.318 -1.36 6.07 PlyA + 107724 107729 6 -0.45 7.00 Prom + 107887 107926 40 -0.76 7.01 Init + 120902 120962 61 1 1 68 80 92 0.504 7.81 7.02 Intr + 122922 122986 65 1 2 55 119 51 0.980 3.34 7.03 Intr + 124272 124487 216 2 0 76 67 156 0.650 11.00 7.04 Intr + 129917 130039 123 1 0 119 97 23 0.983 7.08 7.05 Term + 130532 130576 45 1 0 101 48 37 0.796 -1.79 7.06 PlyA + 131901 131906 6 1.05 8.00 Prom + 135712 135751 40 -5.86 8.01 Init + 136854 136905 52 2 1 92 109 27 0.304 6.76 8.02 Intr + 154802 154886 85 0 1 113 86 200 0.401 21.18 8.03 Intr + 158884 158962 79 1 1 100 79 85 0.961 8.35 8.04 Intr + 159953 160102 150 1 0 89 109 321 0.781 34.66 8.05 Intr + 160421 160555 135 1 0 93 76 184 0.999 18.46 8.06 Intr + 160693 160839 147 0 0 116 97 89 0.921 13.03 8.07 Intr + 164196 164312 117 2 0 86 99 221 0.968 23.66 8.08 Intr + 167272 167405 134 0 2 96 20 345 0.992 27.94 8.09 Intr + 169906 169939 34 1 1 129 93 48 0.997 7.63 8.10 Intr + 170329 170422 94 0 1 78 80 110 0.999 8.74 8.11 Intr + 171253 171373 121 2 1 117 70 141 0.984 14.75 8.12 Intr + 175530 175713 184 0 1 66 103 85 0.453 7.59 8.13 Intr + 181126 181179 54 0 0 89 89 22 0.551 1.58 8.14 Intr + 182650 182709 60 0 0 110 68 93 0.992 8.43 8.15 Intr + 183855 183894 40 2 1 123 62 70 0.999 5.70 8.16 Term + 183980 184134 155 0 2 123 48 180 0.987 15.58 8.17 PlyA + 193465 193470 6 1.05 9.00 Prom + 195713 195752 40 -6.26 9.01 Init + 200650 200697 48 0 0 61 31 110 0.231 3.55 9.02 Intr + 205436 205572 137 1 2 91 94 -8 0.050 -0.53 9.03 Intr + 220031 220151 121 2 1 101 95 142 0.280 16.70 9.04 Intr + 223610 224101 492 1 0 93 94 902 0.996 84.50 9.05 Intr + 226789 226888 100 0 1 83 100 158 0.434 16.18 9.06 Term + 230393 230523 131 0 2 144 42 50 0.306 4.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 148832 148667 166 2 1 66 68 105 0.806 5.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:35163076_35393648|GENSCAN_predicted_peptide_1|78_aa ERSSSPAMEQSWTENDFVELREEGFRRSNFSKLKAEVRTHRKEAKNLEKRLDKWLTRITS VKNSLNDLMELKTMAGEL >gi568815576f:35163076_35393648|GENSCAN_predicted_CDS_1|237_bp gaacgcagctcctcaccagcaatggaacaaagctggacggagaatgactttgttgagttg agagaagaaggcttcagacgatcaaacttctccaagctaaaggcggaagttcgaacccat cgcaaagaagctaaaaaccttgaaaaaagattagacaaatggctaactagaataaccagt gtaaagaactccttaaatgacctgatggagctgaaaaccatggcaggagaactataa >gi568815576f:35163076_35393648|GENSCAN_predicted_peptide_2|227_aa MKAEIKMFFETNENKDTTYQNLWDTFKAVYRGKFIALNAHKRKKERSKIDTVTSQLKELE KQEQTQSKASRRQEITKIKAELKEIDTQKTLQKINESRSWFFEKINKIDRLLARLIKKKR EKNQIDAIKNDKGDITTNPTEVQTTIREYYKHLYTNKLENLEEMDKFLDTYTLPRLNQKE VESLNRPITGFEIEAIINSLPTKKSPGPDGFTAKFYQRYRRSWYHSF >gi568815576f:35163076_35393648|GENSCAN_predicted_CDS_2|684_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggatacatttaaagcagtgtatagagggaaatttatagcactaaatgcccac aagagaaagaaggaaagatctaaaattgacaccgtaacatcacaattaaaagaactagag aagcaagagcaaacacagtcaaaagctagcagaaggcaagaaataactaagatcaaagca gaactgaaggagatagacacacaaaaaacccttcaaaaaatcaatgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagactgctagcaagactaataaagaagaaaaga gagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccaccaatcccaca gaagtccaaactaccatcagagaatactacaaacacctctacacaaataaactagaaaat ctagaagaaatggataaattccttgacacatacaccctcccaagactaaaccagaaagaa gttgaatctctgaatagaccaataacaggctttgaaattgaggcaataattaatagccta ccaaccaaaaaaagtccaggaccagacggattcacagccaaattctaccagaggtacagg aggagctggtaccattccttctga >gi568815576f:35163076_35393648|GENSCAN_predicted_peptide_3|328_aa MVEGKEEQVPSYMDGSSRQREHEEDAKVETPDKTIRSPETYSLPREDALQGYPKRICTKA AAALLQFYLHLHDVSLGQKHTQVVSKFEAFSKQWQGRATRKVWLSEEDCWIVPRGARQKP YTAGCYSKWSRPSVRLQKRCLIEGDSIVALMSRKALVILRELDEGPEWKLAVTKLGGFAV GTKTHSSTDSLKMSSGQKRVGYFHSIPRGNACAEIAGDKARAVIWNWIHGTWIMEDLNAK LMCLDKILQASGEHNHDSEERNSKRRKVDEAQESCVLFMAAPYEEDRESPPVIWNHLELP RKCRTTLPAILPLCSVVHFDVPVWMPQP >gi568815576f:35163076_35393648|GENSCAN_predicted_CDS_3|987_bp atggtggagggcaaggaggagcaagtcccatcttacatggatggcagcagcaggcaaaga gaacatgaagaagacgcaaaagtggagacccctgataaaaccatcagatctcctgagact tactcactaccaagagaagacgctcttcaagggtacccgaagcgaatttgcaccaaagca gcagctgcgttgctgcagttctatcttcaccttcacgatgtttctcttggtcaaaaacac actcaagtcgtctccaagttcgaagcattcagcaaacaatggcaaggcagagccaccaga aaggtgtggctgtcagaggaggactgctggattgtgccacgtggagccaggcagaagcca tacacagctggttgctacagcaagtggagcagaccaagtgtccggctgcagaagaggtgt ctaatagaaggagacagcattgtagcactgatgtcaagaaaagccttagtcatcctcaga gagcttgatgaaggacctgagtggaagttagctgtgacaaagcttggaggctttgctgtt gggacaaagacacacagtagtacagactctttgaagatgtcaagtggccagaagcgagta ggttacttccacagtatacctagaggaaatgcatgtgcagaaatagcaggagataaggcc agagcagtcatttggaactggatccatggaacatggatcatggaagacctcaatgccaag ctgatgtgtttggacaagatcctgcaggcaagtggggagcacaaccatgatagtgaagaa agaaatagtaaaaggaggaaggttgatgaggcacaagagagctgtgtgttgtttatggca gctccctatgaggaagacagggagtcacctcctgtcatctggaatcacctggagcttccc cgaaagtgccgaaccactctgccagccattctaccattgtgcagcgtggtacacttcgat gtgcccgtgtggatgccccaaccctga >gi568815576f:35163076_35393648|GENSCAN_predicted_peptide_4|209_aa MGGRSLMNGLVLCSCNGREQQKPSRGPKLPLLGSSNEEPPSGVNGGNLDFYPQLAVMRRQ TPLCPARVVSEKDAEKAFNKIQHPFMIKTLSKIGMQRLNVIKAIYDKPTANIILNREKLK AFPLRTGTRQGCPPSPLLFNIVLEVLARAIGQEKEIKGIQIGKEEVKLLLFANDIISYLE NPKGSSRKLLELMKEFSKVSNTRLMYTNQ >gi568815576f:35163076_35393648|GENSCAN_predicted_CDS_4|630_bp atggggggcagatccctcatgaatggcttggtgctgtgctcctgtaatgggagagagcag cagaagcctagcagagggccgaagctcccactcttgggcagcagtaatgaggaacccccc tctggtgtcaatggtggcaacctggacttctacccccagctagcagtaatgaggcggcaa acccctctttgccctgctagagtggtatcagaaaaagatgcagaaaaagcattcaacaaa atccagcatccctttatgattaaaaccctcagcaaaatcggcatgcaaaggcttaatgta ataaaagccatctatgacaaacccacagccaacataatactgaatagggaaaagttgaaa gcattccctctgagaacaggaacaagacaaggatgcccaccttcaccactcctcttcaac atagtactggaagtcctagccagagcaattggacaagagaaagaaataaagggcatccag attggtaaggaggaagtcaaactgttgctgtttgctaacgatatcatcagttaccttgaa aaccctaagggctcctccagaaagctcctagaactgatgaaagaattcagcaaagtttcc aatacaagattaatgtacacaaatcagtag >gi568815576f:35163076_35393648|GENSCAN_predicted_peptide_5|88_aa MSRAQGGGWRRGGAAAASGWGPSPVSWGLRSPLPSLGSPGSRLNAADRRHLGEGEKSASG QGASVEPRAQTRAPEGESRSTKSAERIE >gi568815576f:35163076_35393648|GENSCAN_predicted_CDS_5|267_bp atgtcgcgggcccagggcggcgggtggcgacggggcggggctgctgccgcctcagggtgg gggcccagcccggtctcctggggtctccgctcgccgctaccttcactcggctcgcccgga tcccgcctcaacgccgccgatcgccgccatcttggagaaggagagaaaagcgcgagcggc cagggagcctcagtcgagcccagagcgcagactcgggctccggagggggaatcccgctcg accaagagcgccgagcgcatcgaataa >gi568815576f:35163076_35393648|GENSCAN_predicted_peptide_6|458_aa MAYDDSVKKEDCFDGDHTFEDIGLAAGRSQREKKRSYKDFLREEEEIAAQVRNSSKKKLK DSELYFLGTDTHKKKRKHSSDDYYYGDISSLESSQKKKKKSSPQSTDTAMDLLKAITSPL AAGSKPSKKTGEKSSGSSSHSESKKEHHRKKVSGSSGELPLEDGGSHKSKKMKPLYVNTE TLTLREPDGLKMKLILSPKEKGSSSVDEESFQYPSQQATVKKSSKKSARDEQGALLLGHE LQSFLKTARKKHKSSSDAHSSPGPEGCGSDASQFAESHSANLDLSGLEPILVESDSSSGG ELEAGELVIDDSYREIKKKKKSKKSKKKKDKEKHKEKRHSKSKRSLGLSAVPVGEVTVTS GPPPSIPYAGAAAPPLPLPGLHTDGHSEKKKKKEEKDKERERGEKMGPSSWKKISSGLPL ILHDGPWKNCFPRNRYPVPKRLRTADLVGLYLGSQKHK >gi568815576f:35163076_35393648|GENSCAN_predicted_CDS_6|1377_bp atggcttatgatgactccgtgaagaaagaagattgttttgatggtgatcatacctttgag gacataggacttgcagctggccgaagccaacgagagaaaaaacgttcttacaaagatttt ttaagggaagaggaagaaattgctgctcaggtcaggaattcttccaagaagaagttgaag gatagtgaactttacttcttggggacggacacacacaagaagaagaggaagcactcctct gatgattactactatggagatatttcgtctttggaatcgtcacagaagaaaaagaaaaag tccagcccacagtctactgatacagctatggacctgttgaaagctatcacttccccactg gcagcaggctccaagccctccaaaaagactggggagaaatcctctggctcttcaagccat tcggagagtaaaaaggagcaccacaggaagaaagtcagtggaagcagtggggaactaccc ctagaggatggtggctcccacaaatcgaaaaaaatgaaacctctctatgtgaacacagag acactgacccttcgggagcctgatggtttaaaaatgaaacttattctgtcaccaaaggag aagggaagcagctctgttgatgaggagtcttttcaatatccctcccaacaagcgactgtg aaaaaatcctcaaagaaatcagctcgggatgagcagggtgctttactcctaggacatgag ttacagagctttctgaaaacagcccggaaaaagcacaagtcatcctcagacgcacattca tctcctggccctgaaggctgtgggtctgacgcctcccagttcgcagagtcccacagtgct aaccttgatctttcagggcttgaacctattctggtagaatcagactcatcctctggtggg gaactagaggctggggagttagtgatagatgattcttaccgagaaatcaagaagaaaaag aagtcaaagaagagcaaaaagaagaaagacaaggagaagcataaagagaagcgacactcc aagtccaagagaagtttaggactttctgccgtgccagtgggagaggtcacagtgacatct ggccctcctcccagcatcccatacgctggagcagcagcacctcccctgccacttcctggc ctccacacagatgggcatagtgaaaaaaaaaagaaaaaagaagagaaggacaaagagaga gagagaggagaaaagatgggaccatctagttggaagaaaataagctcagggctcccactg attctacatgatggtccatggaaaaattgttttccacgaaaccggtacccggtgccaaaa aggttgaggaccgctgacctagtgggcttgtatctgggaagccagaaacataaatga >gi568815576f:35163076_35393648|GENSCAN_predicted_peptide_7|169_aa MSAYQVFCKEYRVTIVADHPDFGELSKKLAEVWKQLPEKDKLIWKQKAQYLQHKQNKAEA TTVKRKASSSEGSMKVKGSDHIPPLLFSKACEFCFLAKKLGKKLAVLEVDLDSQGMVAVS GSLSVLLDSIICALGPLACLTTQLPELNGCPKQVLSNTLDNIAYIMPGL >gi568815576f:35163076_35393648|GENSCAN_predicted_CDS_7|510_bp atgtcggcctaccaggtgttctgtaaagagtatcgcgtgaccattgtggctgaccatcca gattttggggaacttagtaaaaaactggctgaggtgtggaagcaattaccagaaaaagac aaactgatttggaagcaaaaagctcagtatctgcagcacaaacagaacaaagcagaagcc acaactgtgaaaaggaaagcatccagctcagaaggttccatgaaagtcaaaggtagtgac cacatcccgcccctgcttttctctaaagcatgtgaattttgcttccttgctaagaaactg ggaaaaaagcttgcagtattagaagtggaccttgattcccagggtatggtggctgtgtct ggcagtttgtcagtgcttctggattccattatctgtgcccttggccccttggcatgtctc accacacaactacctgaattgaatggctgtcccaaacaggtcttgtcaaacacattagac aacattgcttacatcatgccgggactgtga >gi568815576f:35163076_35393648|GENSCAN_predicted_peptide_8|546_aa MDFLLGNPFSSPVGQRIEKATDGSLQSEDWALNMEICDIINETEEGPKDALRAVKKRIVG NKNFHEVMLALTVLETCVKNCGHRFHVLVASQDFVESVLVRTILPKNNPPTIVHDKVLNL IQSWADAFRSSPDLTGVVTIYEDLRRKGLEFPMTDLDMLSPIHTPQRTVFNSETQSGQDS VGTDSSQQEDSGQHAAPLPAPPILSGDTPIAPTPEQIGKLRSELEMVSGNVRVMSEMLTE LVPTQAEPADLELLQELNRTCRAMQQRVLELIPQIANEQLTEELLIVNDNLNNVFLRHER FERFRTGQTTKAPSEAEPAADLIDMGPDPAATGNLSSQLAGMNLGSSSVRAGLQSLEASG RLEDEFDMFALTRGSSLADQRKEQAWRMGAVNLCPPARSVLASAIGKGSIQWLGGLSFRV KYEAPQATDGLAGALDARQQSTGAMGGGCGKSLSDWMVRQGMIPVTQACLMEDIEQWLST DVGNDAEEPKGVTSEEFDKFLEERAKAADRLPNLSSPSAEGPPGPPSGPAPRKKTQEKDD DMLFAL >gi568815576f:35163076_35393648|GENSCAN_predicted_CDS_8|1641_bp atggactttctcctggggaacccgttcagctctccagtgggacagcgcatcgagaaagcc acagatggctccctgcagagcgaggactgggccctcaacatggagatctgcgacatcatc aacgagacggaggaaggtcccaaagatgccctccgagcagtaaagaagagaatcgtgggg aataagaacttccacgaggtgatgctggctctcacagtcttagaaacctgtgtcaagaac tgcgggcaccgcttccacgtgctggtggccagccaggacttcgtggagagtgtgctggtg aggaccatcctgcccaagaacaacccacccaccatcgtgcatgacaaagtgctcaacctc atccagtcctgggctgacgcgttccgcagctcgcccgatctgacaggtgtggtcaccatc tatgaggacctgcggaggaaaggcctggagttccccatgactgacctggacatgctgtca cccatccacacaccccagaggaccgtgttcaactcagagacacaatcaggacaggattct gtgggcactgactccagccagcaagaggactctggccagcatgctgcccctctgcccgcc ccgcccatactctccggtgacacgcccatagcaccaaccccggaacagattgggaagctg cgcagtgagctggagatggtgagtgggaacgtgagggtgatgtcggagatgctgacggag ctggtgcccacccaggccgagcccgcagacctggagctgctgcaggagctcaaccgcacg tgccgagccatgcagcagcgggtcctggagctcatccctcagatcgccaatgagcagctg acagaggagctgctcatcgtcaatgacaatctcaacaatgtgttcctgcgccatgaacgg tttgaacggttccgaacaggccagaccaccaaggccccaagtgaggccgagccggcagct gacctgatcgacatgggccctgacccagcagccaccggcaacctctcatcccagctggca ggaatgaacctgggctccagcagtgtgagagctggcctgcagtctctggaggcctctggt cgactggaagatgagtttgacatgtttgcgctgacacggggcagctcactggctgaccaa cggaaagagcaggcctggaggatgggagccgtcaatctgtgccccccagcccgctccgtt cttgcatcagccatcggtaagggcagcatccaatggcttggtggtctctctttcagggta aaatacgaagccccccaagcaacagacggcctggctggagccctggacgcccggcagcag agcactggcgcgatgggaggcggctgcgggaaaagcctcagtgactggatggtgagacaa ggcatgatcccagtcacccaggcctgcctcatggaggacatcgagcagtggctgtccact gacgtgggtaatgatgcggaagagcctaagggggtcaccagcgaagaatttgacaaattc ctggaagaacgggccaaagccgcggaccgattgcccaacctctccagcccctcagctgag gggcccccgggtcccccatctggcccagcgccccggaagaagacccaggagaaagatgat gacatgctgtttgccttatga >gi568815576f:35163076_35393648|GENSCAN_predicted_peptide_9|342_aa MQTASIQCYVTFGYEEPTPPTTPSCPGAPSCQLSCKEASPCPGLQGTGFFLEPRVLPAFL GSMPQDLSEALKEATKEVHTQAENAEFMRNFQKGQVTRDGFKLVMASLYHIYVALEEEIE RNKESPVFAPVYFPEELHRKAALEQDLAFWYGPRWQEVIPYTPAMQRYVKRLHEVGRTEP ELLVAHAYTRYLGDLSGGQVLKKIAQKALDLPSSGEGLAFFTFPNIASATKFKQLYRSRM NSLEMTPAVRQRVIEEAKTAFLLNIQLFEELQELLTHDTKDQSPSRAPGLRQRASNKVQD SAPVETPRGKPPLNTRSQAPLLRWVLTLSFLVATVAVGLYAM >gi568815576f:35163076_35393648|GENSCAN_predicted_CDS_9|1029_bp atgcaaacagccagtatccagtgctacgtgactttcggttacgaggaaccaactcccccg accacgcccagctgcccaggggccccgtcctgccagctctcctgcaaggaggcttctccc tgccctggtctccaaggaacaggcttcttcctggagcccagggtccttcctgccttcctt ggcagcatgccccaggatttgtcagaggccctgaaggaggccaccaaggaggtgcacacc caggcagagaatgctgagttcatgaggaactttcagaagggccaggtgacccgagacggc ttcaagctggtgatggcctccctgtaccacatctatgtggccctggaggaggagattgag cgcaacaaggagagcccagtcttcgcccctgtctacttcccagaagagctgcaccgcaag gctgccctggagcaggacctggccttctggtacgggccccgctggcaggaggtcatcccc tacacaccagccatgcagcgctatgtgaagcggctccacgaggtggggcgcacagagccc gagctgctggtggcccacgcctacacccgctacctgggtgacctgtctgggggccaggtg ctcaaaaagattgcccagaaagccctggacctgcccagctctggcgagggcctggccttc ttcaccttccccaacattgccagtgccaccaagttcaagcagctctaccgctcccgcatg aactccctggagatgactcccgcagtcaggcagagggtgatagaagaggccaagactgcg ttcctgctcaacatccagctctttgaggagttgcaggagctgctgacccatgacaccaag gaccagagcccctcacgggcaccagggcttcgccagcgggccagcaacaaagtgcaagat tctgcccccgtggagactcccagagggaagcccccactcaacacccgctcccaggctccg cttctccgatgggtccttacactcagctttctggtggcgacagttgctgtagggctttat gccatgtga