GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:25:32 Sequence gi568815595f:98164285_98365259 : 200975 bp : 37.41% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4416 4680 265 2 1 78 52 112 0.309 3.82 1.02 Intr + 5026 5188 163 2 1 102 48 57 0.230 1.21 1.03 Intr + 13242 13367 126 0 0 31 72 105 0.158 2.07 1.04 Intr + 14063 15501 1439 2 2 44 60 281 0.294 9.19 1.05 Intr + 22606 22716 111 2 0 69 68 73 0.378 2.83 1.06 Intr + 27169 27326 158 2 2 66 106 22 0.758 0.61 1.07 Intr + 29145 29214 70 2 1 31 110 100 0.664 4.34 1.08 Intr + 29655 29774 120 1 0 79 92 57 0.892 4.75 1.09 Intr + 31372 31448 77 2 2 63 65 64 0.193 -0.08 1.10 Term + 31776 31886 111 2 0 78 35 55 0.300 -3.22 1.11 PlyA + 32445 32450 6 1.05 2.00 Prom + 35710 35749 40 -2.55 2.01 Init + 36386 36471 86 1 2 59 108 41 0.907 3.54 2.02 Term + 37543 37726 184 1 1 81 54 143 0.956 6.23 2.03 PlyA + 40009 40014 6 1.05 3.00 Prom + 46072 46111 40 -5.15 3.01 Init + 64542 64629 88 2 1 85 71 118 0.818 10.65 3.02 Intr + 80036 80116 81 0 0 55 56 91 0.399 1.29 3.03 Intr + 80300 80520 221 0 2 78 30 121 0.602 2.40 3.04 Intr + 81019 81165 147 0 0 106 47 48 0.713 2.01 3.05 Intr + 81280 81493 214 0 1 27 80 130 0.443 3.47 3.06 Term + 95613 95875 263 1 2 86 45 214 0.851 11.60 3.07 PlyA + 97395 97400 6 1.05 4.03 PlyA - 98453 98448 6 1.05 4.02 Term - 106028 105782 247 1 1 -4 53 203 0.195 1.88 4.01 Init - 126651 126338 314 0 2 60 64 220 0.150 13.64 4.00 Prom - 144409 144370 40 -3.75 5.07 PlyA - 144530 144525 6 1.05 5.06 Term - 152683 152475 209 2 2 98 50 116 0.334 5.32 5.05 Intr - 165111 164965 147 1 0 31 64 110 0.001 2.19 5.04 Intr - 165347 165203 145 2 1 30 91 102 0.010 3.63 5.03 Intr - 170081 169968 114 2 0 70 83 41 0.028 1.52 5.02 Intr - 196659 196566 94 2 1 13 86 111 0.076 2.45 5.01 Init - 198902 198835 68 2 2 65 30 124 0.394 4.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 162573 162687 115 2 1 73 66 94 0.955 6.12 S.002 Intr + 163348 163568 221 2 2 58 93 108 0.931 5.40 S.003 Intr + 165219 165325 107 0 2 64 81 90 0.860 4.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:98164285_98365259|GENSCAN_predicted_peptide_1|879_aa MEEENATLLTEFVLTGFLYQPQWKIPLFLAFLVIYLITIMGNLGLIAVIWKDPHLHIPMY LLLGNLAFVDAWISSTVTPKMLNNFLAKSSIQVFSIVTILISYTFVLFTVLEKKSDKGVR KAFSTCGAHLFSVCLYYGPLLLILNQEEVEFLNRPTGFEIVAIINSLLTKKSPGPDGFTA EFYQRPKSPSVYKQLQQSLRIQNHVQKSQAFLYTNNRQTESQIMSDLPFTIASKRIKYLG IQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRTNIMKMAILPKVIYRFNAIPN KLPMAFFTELEKTTLKFIWSQKRAHITKSVLSQKNKAGGSTLPDFKLYYKATVTKTAWYW YQNRDIDQWNRTQPSEIMPHIYNHLIFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLD PFLTPYTKVNSRWIKDLHVRHKTIKTLEENLGNTIQDIGMGKDFMSKTPKATATKAKIDK WDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIK KWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVRMTIIKMSGNNRC WRRCGEIGTLLHCWLDCKLVQPLWKSVWRFLRDLELEIPFDPAIPLLGIYPKDYKSCCYK DTCTRTDLAQEDSHHCKMAETKTQYCHAVTGHVPKDMKQDGDPCEIWCRDSDRGTSLGRS IPGPPALCYMRKIHLRPLVLRPTSPRNISPILNWQLKTEAARTPQKPPGPSQMLTVEAYF NRIKACYHSPATAWASKTYKLSLNSCILPVQNRTSLTGSWSHSLQRLSKASDPIALGNAY ADKGLFRPPPSPTHQDGGFAPAQDWQIDFTHVPRVKKLK >gi568815595f:98164285_98365259|GENSCAN_predicted_CDS_1|2640_bp atggaagaggaaaatgcaacattgctgacagagtttgttctcacaggatttttatatcaa ccacagtggaaaatacccctgttcttggcattcttggtaatatatctcatcaccatcatg gggaatcttggtctgattgctgtcatctggaaagaccctcaccttcatatcccaatgtac ttactccttgggaatttagcttttgtggatgcttggatatcatccacagtgaccccaaag atgctgaataacttcttagctaagagttcaattcaggtattcagcattgtgactattctt atatcttacacatttgttctcttcacagtcttagaaaagaaatctgataagggtgtaagg aaagccttttccacctgtggagcccatctcttctctgtctgtttatactatggccccctt ctcttaatactaaaccaggaagaagttgaatttctgaatagaccaacaggctttgaaatt gtggcaataattaatagcttgctaaccaaaaaaagtccaggaccagatggattcacagct gaattctaccagaggccaaaatctccttcagtttataagcaacttcagcaaagtctcagg atacaaaatcatgtacaaaaatcacaagcattcttatacaccaacaacagacaaacagag agccaaatcatgagtgacctcccattcacaattgcttcaaagagaataaaatacctagga atccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaat gaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaacc aatatcatgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccaac aagctaccaatggctttcttcacagaattggaaaaaactactttaaagttcatatggagc caaaaaagagcccacatcaccaagtctgtcctaagccaaaagaacaaagctggaggcagc acgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagatcaatggaacagaacacagccctcagaaataatgccgcat atctacaaccatctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattcc ctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggat cccttccttacaccttatacaaaagttaattcaagatggattaaagacttacatgttaga cataaaaccataaaaaccctagaagaaaacctagggaataccattcaggacataggcatg ggcaaggacttcatgtctaaaacaccaaaagcaacggcaacaaaagccaaaattgacaaa tgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaac aggcaacctacaaaatgggagaaaatttttgcaacctactcatctgacaaagggctaata tccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaa aagtgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcagcaaaaaaa cacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatg agataccatctcacaccagttagaatgacgatcattaaaatgtcaggaaacaataggtgc tggagaagatgtggagaaataggaacacttttacactgttggttggactgtaaactagtt caaccattgtggaagtcagtgtggcgattccttagggatctagaactggaaataccattt gacccagccatcccattactgggaatatacccaaaggattataaatcatgctgctataaa gacacatgcacacgaactgacttagcacaagaagacagccaccattgtaaaatggcggag actaaaacacagtattgccatgcggttacaggtcatgttcccaaagacatgaaacaagac ggagacccatgtgaaatttggtgccgtgactcagatcgggggacctcccttgggagatca atccctggtcctcctgctctttgctacatgagaaagatccacctacgacctctggtcctc agaccaaccagtccaaggaatatctcaccaattttaaattggcagctgaagactgaggct gcccgaacacctcagaagcctcctggaccatcacagatgcttacagtggaggcatacttt aataggattaaagcctgttatcactcgcctgctacagcatgggcttctaaaacctataaa ctttccttaaattcctgcattttacctgtccaaaaccggacaagtcttacaggaagctgg agtcattcactacaaaggctatcaaaggcatcagatcccattgctctaggcaacgcttat gctgataagggattgttcaggccccctccctcccctacacatcaagatgggggatttgcc cctgcccaggactggcaaattgactttactcacgtgccccgagtcaagaaactaaaataa >gi568815595f:98164285_98365259|GENSCAN_predicted_peptide_2|89_aa MTVKKEFYRSMDGSFNRRIVHREDISVSKSYGLQSSACSSQDATVASGYTGLRSPGSSVS IDLRKIAPFGSLCGGSDPVTRLCLGVEII >gi568815595f:98164285_98365259|GENSCAN_predicted_CDS_2|270_bp atgacagtcaagaaggaattctataggtccatggatggtagcttcaacagaagaattgtg catagggaagacatttctgtatccaagtcctatggattgcagtccagtgcctgtagttct caggatgccactgtagctagtggctataccggtctgaggtctcctggcagctctgtttcc atagatctacggaaaattgccccatttgggagtctctgtggtggctctgaccctgtgaca cgtctctgcttgggtgttgagattatctga >gi568815595f:98164285_98365259|GENSCAN_predicted_peptide_3|337_aa MQPEATGFLLCPVTPMDIITIVKSEFGVQDTGNIWCRRPVTGELLRESSPLSSCSHQTKG DTFYPCIQKLRRRSRTWEDSLPLVSDHRGDACCGHSPTFPWWQVNRKDACFGCSPTLQPR TQSGVPTGSLAESGANSSAASTPPSYNPSITSPPHTESGLQFHSTTSSPQPAQQFPLREE FQYLTQSYSLTWSDLNVILTSTLSPYERERVHFLAQSYSDTCWLHEPGLQEGTRAVPRED LHWQYQTDSPGRGASTCCHCHPWSQGVLQDYPGCSLKAQGLLNQLVVKLPSLRLTLQAVG SPQAQGKSRNIVEESTARIEDLKSQLGALPNCGDVRT >gi568815595f:98164285_98365259|GENSCAN_predicted_CDS_3|1014_bp atgcagccagaagccacaggattcctactctgcccagttactcctatggatatcattact attgtcaaatccgagtttggtgttcaagacacgggtaacatttggtgtcgaagacccgtg acaggggaactcctacgggagagcagtcccctgtcctcgtgctcccaccagacaaaagga gacacattttatccgtgcattcaaaaactccgacgtcggtcacggacatgggaagacagt cttcccttggtgtctgatcaccgcggtgacgcctgctgtggtcattcacccaccttccct tggtggcaggtcaatcgcaaggacgcctgctttggctgctcacccacattacagcccagg actcagtcaggggtgcctactggaagcctggctgaatcaggtgccaattcttccgcagcc tccactccaccatcctataacccttctattacctcccctcctcacaccgagtctggctta cagtttcattccacgactagctccccgcaacctgcccaacaatttcctcttagagaggaa ttccaatatctaactcagtcctacagtttaacctggagtgacttaaatgtcatcctgacc tctaccctctccccgtatgaacgagaaagagttcattttctagcccagtcctactcagac acctgctggcttcatgagccaggcctccaagagggcaccagggcagttccccgagaggat ctccattggcaataccagacggactcaccaggcagaggagcctccacctgttgccactgc cacccctggtcacaaggagtactgcaagactaccctggatgttcccttaaggcccaagga ctcttaaatcagcttgtggtaaagctgcctagcctgagactcacccttcaggcagtgggc tcccctcaagcccagggaaagtctagaaatattgttgaagagtcaactgctagaattgag gacctaaagagccagcttggtgctctacccaactgtggtgatgttcgtacctaa >gi568815595f:98164285_98365259|GENSCAN_predicted_peptide_4|186_aa MNFPELKEHIIAQCKKAKNHDKMMQVVTGKIASTKRNITDLIQLKNTLQELHNAITSIDS RIDQVEKKISELEDYLSEIKQADRNREKRMKRNKQNLKELRDGIMHEGTNPAEIEIWKLS DREFKITVLKKLRETQNNTKKEFRMISMKFNKEIERSKNNQTENLELKNAIGILKNALEP FNSKMD >gi568815595f:98164285_98365259|GENSCAN_predicted_CDS_4|561_bp atgaacttccctgagctaaaggagcacattatagcccaatgcaagaaagctaagaatcat gataaaatgatgcaggtggtgacaggcaaaatagccagtacaaagagaaatataactgac ctgatacagctgaaaaacacactacaagaacttcacaatgcaatcacaagtatcgatagc agaatagaccaagtggagaaaaaaatctcagagctcgaagactatctttctgaaataaaa caggcagacaggaatagagaaaaaagaatgaaaaggaataaacaaaacctcaaagaatta cgggatgggattatgcacgagggaaccaatcctgcagaaatagaaatatggaagctctca gacagagaattcaaaataactgtgttgaagaaactcagggaaactcaaaataacacaaag aaggaattcagaatgatatcaatgaaatttaacaaagagattgaaagaagtaaaaataat cagacagaaaatctggagctgaaaaatgcaattggcatactgaagaatgcattagagccc tttaatagcaaaatggattga >gi568815595f:98164285_98365259|GENSCAN_predicted_peptide_5|258_aa MASWDEKDLTVPQPNTRKVSALRKDFKGMLPAWAAFIINKLKRIWALNASSDTQSSTEQF YSSTGGIILTPPHLQTKKEAERVSDLVKVIWLGRAVIPSPGLLVGLGKSPPIAGTLPKIF VVVIVVAKSFRPWGSLRERPALSVFRESVGSSSFQVQSTDSTLALLSPHPWGARTCSQLP PSDAQGWVPETCIVWNQGLRDLGLCRMCLVNSTFTGSMLGKSHRHSPFSINQGHNALQKA TGTSAQESRVLSKVSKIA >gi568815595f:98164285_98365259|GENSCAN_predicted_CDS_5|777_bp atggcctcgtgggatgagaaagacctgaccgtcccccagcccaatacccgtaaagtgtct gcgctgaggaaggattttaaaggaatgctgccagcctgggcagcattcatcatcaacaag ctgaagcgcatttgggccctaaatgccagcagcgatacacagagttctacagaacagttc tacagttctacagggggtattattctcacccctccccacttacaaacgaagaaagaggca gagagagtttctgatttagtcaaggttatatggttaggaagagctgtgatcccctcacca ggtctgctcgtgggccttgggaagtcacctccgattgctggcactctgcctaagattttt gttgttgtcattgttgttgctaagtctttcaggccatggggctcccttagggagaggcca gccctctcagtcttccgagagagtgtgggatcctcctcctttcaagtgcaaagcacagat tccaccttggcactcctgagcccacatccttggggcgccaggacctgttctcagctccct ccttcggatgctcagggttgggttccagaaacatgtattgtatggaatcaaggtttaagg gacctagggctgtgcaggatgtgccttgttaatagtacgtttacaggcagtatgcttggt aaaagtcatcgccattctccattctcaattaaccaggggcacaatgcactgcagaaagcc acagggacctctgctcaggaaagtcgggtattgtccaaggtttctaagatagcctga