GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:21:49 Sequence gi568815576f:34967088_35185690 : 218603 bp : 45.93% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5792 5895 104 0 2 43 121 79 0.521 6.59 1.02 Intr + 21876 21987 112 2 1 12 87 93 0.050 1.55 1.03 Intr + 41827 41863 37 2 1 95 72 37 0.045 0.12 1.04 Term + 51489 51636 148 0 1 101 38 91 0.200 2.77 1.05 PlyA + 54944 54949 6 1.05 2.07 PlyA - 55003 54998 6 1.05 2.06 Term - 59401 58786 616 0 1 73 48 165 0.495 4.94 2.05 Intr - 63275 63004 272 2 2 27 71 165 0.209 5.04 2.04 Intr - 82006 81672 335 2 2 84 68 118 0.156 4.69 2.03 Intr - 82510 82091 420 2 0 81 66 147 0.044 5.72 2.02 Intr - 83421 82678 744 1 0 70 63 195 0.016 6.62 2.01 Init - 89525 89444 82 2 1 55 91 21 0.493 0.24 2.00 Prom - 93345 93306 40 -4.56 3.00 Prom + 95135 95174 40 -1.06 3.01 Init + 99747 99772 26 2 2 90 95 -5 0.157 -0.47 3.02 Intr + 100033 100229 197 1 2 92 110 56 0.499 7.26 3.03 Intr + 115431 115582 152 1 2 80 92 209 0.899 20.38 3.04 Intr + 117296 117412 117 1 0 77 87 167 0.924 16.16 3.05 Term + 130231 130293 63 0 0 67 43 88 0.074 0.09 3.06 PlyA + 131334 131339 6 1.05 4.06 PlyA - 131800 131795 6 1.05 4.05 Term - 133201 133163 39 1 0 83 42 43 0.020 -3.61 4.04 Intr - 136723 136697 27 1 0 103 62 72 0.084 4.41 4.03 Intr - 150119 150075 45 2 0 57 95 56 0.088 1.81 4.02 Intr - 151727 151602 126 2 0 96 55 38 0.068 2.18 4.01 Init - 160651 160607 45 1 0 60 111 30 0.106 3.18 4.00 Prom - 165677 165638 40 -4.26 5.00 Prom + 174858 174897 40 -3.26 5.01 Init + 175150 175286 137 0 2 57 79 33 0.259 -1.05 5.02 Term + 179475 179619 145 0 1 30 47 196 0.645 7.08 5.03 PlyA + 180612 180617 6 1.05 6.00 Prom + 183391 183430 40 -5.66 6.01 Init + 184030 184109 80 0 2 74 37 59 0.053 -0.06 6.02 Intr + 187510 187681 172 1 1 34 115 71 0.096 4.35 6.03 Term + 199677 199913 237 2 0 62 47 178 0.739 7.27 6.04 PlyA + 200488 200493 6 1.05 7.00 Prom + 201022 201061 40 -4.96 7.01 Sngl + 201484 202167 684 0 0 66 47 262 0.703 16.09 7.02 PlyA + 202317 202322 6 1.05 8.05 PlyA - 203334 203329 6 1.05 8.04 Term - 211502 211337 166 1 1 80 53 112 0.431 4.29 8.03 Intr - 211926 211862 65 0 2 41 107 51 0.434 -0.18 8.02 Intr - 214630 214469 162 1 0 101 60 135 0.610 12.17 8.01 Init - 216296 216153 144 2 0 32 90 87 0.577 3.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 27186 27251 66 2 0 67 95 73 0.843 6.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:34967088_35185690|GENSCAN_predicted_peptide_1|133_aa XVPLFLDDQEAGVYWQATQRWLLQQAQRKKHAEWKNGSDRNKTEVDHLFGRHNFVMFGVT TVYKRKMMETYEEPSLPINPQTSTHFPVPLGSNPMSLLWSIRPYISYGVCSHATMVLTLR PRMMESHYPECSR >gi568815576f:34967088_35185690|GENSCAN_predicted_CDS_1|402_bp naggtacccctgttcttggatgatcaagaggcaggggtctactggcaggcaactcagagg tggttactgcagcaagcccagaggaaaaagcatgcagaatggaagaatggatcagacaga aataagactgaagtcgaccacctgtttggtcgtcacaattttgtgatgtttggtgtcaca actgtctataagagaaagatgatggagacctatgaagaaccatctttgcccatcaaccct caaacaagcacacacttcccagtacccctgggatcaaatcccatgtccttgctgtggtcc atcaggccctacatcagctatggggtctgctcccatgccacgatggtcctcactctaaga cccaggatgatggagagtcactatccagaatgtagcagatga >gi568815576f:34967088_35185690|GENSCAN_predicted_peptide_2|822_aa MQWDSFPYHSLELSSWEKLGFVSQKKTGIPTGVLTGILTGILTGIPTCILVGILTGNPTG ILTSILSGIPTGILTGILTGIPTCILISILTGNHTGILTGILSGIPTGILTGIPTGILTD ILTGNHTGILTGILSGIPTGILTGIPTGILTGIHTGILTGIPTGISTGIPTDISIGIPIG ISKGISIGFPTSIPTGILTGILTGIPTCILIGILTGNHTGILTSILSGIPTGILTGIHTG ILTGNRTGILTGIPTDISTDIPTAISIGIPIGISKGILTDISIGIPIGISKGISAGFPTS IRTSIPTGILTGIHTGILTGIHTGILSSIPTGIPTGILTGIHTGILSSIPTGILTGIYTG IPTGIPTGIPTRIPIGIPTDISIGIPVGISKGISIGFPTSIPTGIPTGILTGIHTGIPTD ISIGIPIGISKGTSTGFPTSIPTGILTGIPTGFLTDISTGIPTGISRGIPIGLSKRISTG ISIGIPKGILTGIPTGILIGIHRGILMGFFTGIPMGIHTGILTGSPQESSGWHLAGAPLG QSFQKKEQAAVFAVLQPLLVIPRQTVWSGPPATPADLQQRGLTVRRKTNKQKEIASTSTK RMSTQKPHQKVTNIKDQRIRKSYFKFHMEPKKSLYSQDNLKQKNKAGGITLPDFKLYYKA TVTKTAWYWYQNRYIDQWNRTEALEIMPHIYNHLIFDKPDKNKQWGKDSLFNKWCWENWL AICRKLKLDPFLTPYKKINSRWIKDFNVRPKTIKTLEENLGNTIQDIGIGKDFVTKTPKA IATKAKIDKWDLIKELPHSKRNYHQSEQTTYRMGENFCYLSI >gi568815576f:34967088_35185690|GENSCAN_predicted_CDS_2|2469_bp atgcagtgggacagctttccttaccactctctggagcttagttcctgggaaaaattgggg tttgtcagccagaagaagacaggtatccccacaggggttctcacaggcattctcacaggt attctcactggtatccccacatgcattctcgtaggtatactcacgggtaatcccacgggc attctcacaagcattctctcaggtatccccacaggcattctcacaggtattctcactggt atccccacatgcattctcataagtattctcacgggtaatcacacaggcattctcacaggc attctctcaggtattcctacaggcattctcacaggtatccccacaggcattctcacagat attctcactggtaaccacacgggcattctcacaggcattctctcaggtattcctacaggc attctcacaggtatccccacaggcattctcactggtattcacacaggtattctcactggc attcccacaggtatttccacaggtattcctacagatatttccataggtattcccataggt atttccaaaggtatttctataggctttcccacaagtatccccacaggcattctcacaggt attctcactggtatccccacatgcattctcataggtattctcactggtaatcacacgggc attctcacaagcattctctcaggtatccccacaggcattctcactggtattcacacaggt attctcactggtaatcgcacaggtattctcacaggcattcccacagatatttccacagat attcctacagctatttccataggtattcccataggtatttccaaaggtattcttacagat atttccataggtattcccataggtatttccaaaggtatttctgcaggctttcccacaagt atccgcacaagtattcccacaggtattcttactggtattcacacaggtattctcactggt attcacacaggcattctctcaagtattcccacaggcatccccacaggtattcttactggt attcacacagggattctctcaagtattcccacagggattctcaccggtatttatacaggc attcccacaggtatcccaacaggcattcccacacgcattcccataggcattcccacagat atttccataggtattcctgtaggtatttccaaaggtatttctataggctttcccacaagt atccccacaggtattcccacaggtattcttactggtattcacacaggtattcctacagat atttccataggtattcccataggtatttccaaaggtacttctacaggctttcccacaagt atccccacaggtattctcactggtatccccacaggctttctcacagatatttccacaggc attcctacaggtatttccagaggtattcccataggtctttccaaacgtatttctacaggc atttccataggcattcccaaaggcattctcacaggcatccccacaggtattctcattggt attcacagaggtattctcatgggttttttcactggtatccccatgggtattcacacaggt attctcaccggatctccacaggagagctctggctggcatctggcgggtgcccctttggga caaagcttccagaagaaggaacaggcagcagtctttgctgtactgcagcctctgctggtg atacccaggcaaacggtctggagtggacctccagcaactccagcagacctgcagcagagg ggcctgactgttagaaggaaaactaacaagcagaaagaaattgcatcaacatcaacaaaa aggatgtccacacaaaaaccccaccagaaggtcaccaacatcaaagaccaaagaattaga aaaagctactttaaatttcatatggaaccaaaaaagagcctgtatagccaagacaacctt aagcaaaagaacaaagctggaggcatcacgctacctgacttcaaactatattacaaggct acagtaaccaaaacagcatggtactggtaccaaaacaggtatatagaccaatggaacaga acagaggccttagaaataatgccacacatctacaaccatctgatctttgacaaacctgac aaaaacaaacaatggggaaaggattccctatttaataagtggtgttgggaaaactggcta gccatatgcagaaaactgaaactggaccccttccttacaccttataaaaaaattaactca agatggattaaagacttcaatgtaagacctaaaaccataaaaacccttgaagaaaaccta ggcaataccattcaggacataggcattggcaaagactttgtgactaaaacaccaaaagca attgcaacaaaagccaaaattgacaaatgggatctaattaaagagcttccgcatagcaaa agaaactatcatcagagtgaacagacaacctacagaatgggagaaaatttctgctatcta tccatctga >gi568815576f:34967088_35185690|GENSCAN_predicted_peptide_3|184_aa MVTPLMSNWGMERNSLGCCEAPKKLSLSFSIEAILKRPARRSDMDRPEGPGEEGPGEAAA SGSGLEKPPKDQPQEGRKSKRRVRTTFTTEQLHELEKIFHFTHYPDVHIRSQLAARINLP EARVQIWFQNQRAKWRKQEKIGNLGAPQQLSEASVALPTNLDVAAVTDLPPTFSNSVSSS VRYD >gi568815576f:34967088_35185690|GENSCAN_predicted_CDS_3|555_bp atggtgactcctttgatgtcaaactggggtatggagagaaatagcttggggtgctgtgag gccccgaagaagctgagcctgtccttctccattgaggcgatcctaaagaggcctgccagg aggagtgatatggacagaccagaagggccaggtgaagagggccccggagaagctgcggcc tcaggctctgggctagaaaagcctccaaaggaccagccccaggaaggaaggaagagcaag cggagggttcgtaccaccttcaccactgagcagctgcatgagctggagaagatcttccac tttacccactacccagacgttcacatccgcagccagctggcagccaggatcaacctccca gaagctcgggtgcagatctggttccagaatcagcgagccaagtggcggaagcaggagaag attggcaacctgggggctccacagcagctgagtgaagccagtgtggccctgcccacaaat ctggatgtggctgcagtgaccgatttgccacctactttcagcaactcggtttcctcatct gtccgatacgactga >gi568815576f:34967088_35185690|GENSCAN_predicted_peptide_4|93_aa MGGIATQGEKERTFQGSLSMDGFREGQGAVPTQPVPRVGCQTLASRFKPHSPCQVLLGYP EDAVKKVTLPSLPSIDLLVTLMRKMKYGEIVLN >gi568815576f:34967088_35185690|GENSCAN_predicted_CDS_4|282_bp atgggtggtattgctacacagggagaaaaggagaggactttccagggatccctgtccatg gatggattcagggaaggacaaggggccgtgcccacccagccagtaccccgggtgggctgt cagaccctggcctccagatttaagcctcattctccatgccaggtgctactgggttacccc gaagatgcagtgaagaaagtcacgctgccatccctgccctccatcgacctcctggtgacc ttgatgaggaaaatgaagtacggagagattgttttgaattaa >gi568815576f:34967088_35185690|GENSCAN_predicted_peptide_5|93_aa MWTFWLHIQAPYAPNQLPLGTLWAGSVPTNSGVHLVKNKERKEGSSTWKELLEILKRCSN FLQACENHQLQYFRPRFLAVSMSLDFGSSSSNL >gi568815576f:34967088_35185690|GENSCAN_predicted_CDS_5|282_bp atgtggaccttctggctgcacatccaagctccatatgcccccaaccaacttccccttggt actctctgggctgggagtgtacccaccaacagtggggtccaccttgtaaagaataaggaa aggaaagagggctccagcacttggaaggagcttctagagatcctcaagcgttgcagcaac ttcctgcaggcttgtgagaaccaccagcttcaatacttcaggcccaggttcctggccgtg tcaatgtccctggattttggcagcagctcctctaacctctga >gi568815576f:34967088_35185690|GENSCAN_predicted_peptide_6|162_aa MGTRGEDSLLQPREVSEESCPDDLPELSPQYKLDTESNKVLHAQDISGAKVQRQNGHRLC SQRSSERIPSQSPPTWTLCLLNNKERSSSPAMEQSWTENDFVELREEGFRRSNFSKLKAE VRTHRKEAKNLEKRLDKWLTRITSVKNSLNDLMELKTMAGEL >gi568815576f:34967088_35185690|GENSCAN_predicted_CDS_6|489_bp atggggacccggggagaagacagccttctacagccaagagaggtctcagaagaaagctgc cctgatgatcttccagaactaagtccgcagtacaaactagacactgaatctaacaaggtg ctccatgctcaggacatttctggagccaaggtgcagaggcagaatggacacaggctttgt agtcaaagatcctcagagaggattccaagtcagtcacctcccacctggaccctctgtctc ctcaacaacaaggaacgcagctcctcaccagcaatggaacaaagctggacggagaatgac tttgttgagttgagagaagaaggcttcagacgatcaaacttctccaagctaaaggcggaa gttcgaacccatcgcaaagaagctaaaaaccttgaaaaaagattagacaaatggctaact agaataaccagtgtaaagaactccttaaatgacctgatggagctgaaaaccatggcagga gaactataa >gi568815576f:34967088_35185690|GENSCAN_predicted_peptide_7|227_aa MKAEIKMFFETNENKDTTYQNLWDTFKAVYRGKFIALNAHKRKKERSKIDTVTSQLKELE KQEQTQSKASRRQEITKIKAELKEIDTQKTLQKINESRSWFFEKINKIDRLLARLIKKKR EKNQIDAIKNDKGDITTNPTEVQTTIREYYKHLYTNKLENLEEMDKFLDTYTLPRLNQKE VESLNRPITGFEIEAIINSLPTKKSPGPDGFTAKFYQRYRRSWYHSF >gi568815576f:34967088_35185690|GENSCAN_predicted_CDS_7|684_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggatacatttaaagcagtgtatagagggaaatttatagcactaaatgcccac aagagaaagaaggaaagatctaaaattgacaccgtaacatcacaattaaaagaactagag aagcaagagcaaacacagtcaaaagctagcagaaggcaagaaataactaagatcaaagca gaactgaaggagatagacacacaaaaaacccttcaaaaaatcaatgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagactgctagcaagactaataaagaagaaaaga gagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccaccaatcccaca gaagtccaaactaccatcagagaatactacaaacacctctacacaaataaactagaaaat ctagaagaaatggataaattccttgacacatacaccctcccaagactaaaccagaaagaa gttgaatctctgaatagaccaataacaggctttgaaattgaggcaataattaatagccta ccaaccaaaaaaagtccaggaccagacggattcacagccaaattctaccagaggtacagg aggagctggtaccattccttctga >gi568815576f:34967088_35185690|GENSCAN_predicted_peptide_8|178_aa MSRKALVILRELDEGPEWKLAVTKLGGFAVGTKTHSSTDSLKMSSGQKRVGYFHSIPRGN ACAEIAGDKARAVIWNWIHGTWIMEDLNAKLMCLDKILQASGEHNHDSEERNSKRRKVDE AQESCVLFMAAPYEEDRESPPVIWNHLELPRKCRTTLPAILPLCSVVHFDVPVWMPQP >gi568815576f:34967088_35185690|GENSCAN_predicted_CDS_8|537_bp atgtcaagaaaagccttagtcatcctcagagagcttgatgaaggacctgagtggaagtta gctgtgacaaagcttggaggctttgctgttgggacaaagacacacagtagtacagactct ttgaagatgtcaagtggccagaagcgagtaggttacttccacagtatacctagaggaaat gcatgtgcagaaatagcaggagataaggccagagcagtcatttggaactggatccatgga acatggatcatggaagacctcaatgccaagctgatgtgtttggacaagatcctgcaggca agtggggagcacaaccatgatagtgaagaaagaaatagtaaaaggaggaaggttgatgag gcacaagagagctgtgtgttgtttatggcagctccctatgaggaagacagggagtcacct cctgtcatctggaatcacctggagcttccccgaaagtgccgaaccactctgccagccatt ctaccattgtgcagcgtggtacacttcgatgtgcccgtgtggatgccccaaccctga