GENSCAN 1.0 Date run: 3-Nov-116 Time: 07:01:38 Sequence gi568815594f:174199122_174417380 : 218259 bp : 36.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 318 313 6 1.05 1.03 Term - 5376 5243 134 1 2 73 47 110 0.484 2.67 1.02 Intr - 12628 12510 119 0 2 21 26 141 0.252 0.39 1.01 Init - 14904 14612 293 0 2 59 99 292 0.970 24.07 1.00 Prom - 33086 33047 40 -5.85 2.12 PlyA - 33123 33118 6 1.05 2.11 Term - 38478 38291 188 1 2 125 42 49 0.935 0.67 2.10 Intr - 40069 39873 197 0 2 91 99 80 0.964 7.54 2.09 Intr - 42097 41979 119 1 2 72 93 70 0.939 4.24 2.08 Intr - 57065 56934 132 2 0 21 23 140 0.481 0.72 2.07 Intr - 60704 60578 127 1 1 90 88 4 0.823 0.36 2.06 Intr - 63979 63643 337 2 1 93 110 195 0.755 15.85 2.05 Intr - 75393 75350 44 2 2 72 91 42 0.052 -0.13 2.04 Intr - 84495 84417 79 1 1 45 42 136 0.174 2.39 2.03 Intr - 84780 84659 122 2 2 74 41 156 0.565 8.72 2.02 Intr - 88091 87881 211 0 1 46 61 146 0.969 4.75 2.01 Init - 89917 89794 124 1 1 38 71 114 0.454 5.08 2.00 Prom - 90458 90419 40 -8.05 3.00 Prom + 92232 92271 40 -8.85 3.01 Init + 93990 94096 107 2 2 30 109 102 0.469 4.28 3.02 Intr + 102918 103065 148 0 1 98 63 105 0.997 8.32 3.03 Intr + 104582 104728 147 1 0 75 86 68 0.963 4.81 3.04 Intr + 105126 105248 123 2 0 71 91 65 0.957 4.96 3.05 Intr + 109568 109738 171 1 0 63 92 187 0.988 15.82 3.06 Intr + 110729 110935 207 1 0 97 63 117 0.988 8.45 3.07 Intr + 111662 111737 76 1 1 62 99 71 0.887 3.77 3.08 Intr + 117045 117169 125 1 2 47 94 45 0.228 0.38 3.09 Term + 121362 121445 84 2 0 76 43 75 0.305 -1.53 3.10 PlyA + 123837 123842 6 1.05 4.03 PlyA - 124215 124210 6 1.05 4.02 Term - 125446 125329 118 0 1 32 37 108 0.330 -2.87 4.01 Init - 132725 132502 224 2 2 55 121 89 0.549 6.98 4.00 Prom - 133928 133889 40 -2.55 5.06 PlyA - 134271 134266 6 1.05 5.05 Term - 148413 147520 894 0 0 -30 42 459 0.018 20.91 5.04 Intr - 181821 181580 242 1 2 96 98 106 0.864 8.75 5.03 Intr - 182115 181881 235 0 1 86 97 69 0.720 3.94 5.02 Intr - 182812 182711 102 1 0 68 37 98 0.777 2.15 5.01 Init - 184777 184589 189 1 0 73 98 69 0.759 5.47 5.00 Prom - 193901 193862 40 -3.35 6.03 PlyA - 194186 194181 6 1.05 6.02 Term - 194846 194658 189 2 0 58 42 93 0.245 -1.83 6.01 Init - 202507 202319 189 1 0 73 98 171 0.713 15.66 6.00 Prom - 213260 213221 40 -3.65 7.05 PlyA - 214024 214019 6 1.05 7.04 Term - 214810 214200 611 2 2 48 38 219 0.803 6.57 7.03 Intr - 215603 214959 645 0 0 44 40 228 0.158 4.62 7.02 Intr - 217431 217184 248 2 2 -41 58 178 0.086 -1.92 7.01 Init - 217895 217510 386 2 2 88 44 336 0.510 25.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 148335 147520 816 0 0 58 42 375 0.946 25.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:174199122_174417380|GENSCAN_predicted_peptide_1|181_aa MHVQTPKRPGKRGGSAKSIPRREETNARERFFYTHYGSCTWLPSVTGARGCEDGDREPRR GPREPNTGVPMVLRCGSPMSSPGASESALVVLWDLPFRKVRLFLCLDRARSGRHRVSAID CPATCGAESVVVVTSLRDKIDASGRPALMSALGWPQREKEAQVANSGVSDDRGASFTATL S >gi568815594f:174199122_174417380|GENSCAN_predicted_CDS_1|546_bp atgcatgttcagactcccaagagacccggaaaacgcggaggatctgccaaaagcattcca agaagagaagaaaccaacgccagggagagattcttttatacacattatggcagctgcaca tggcttccgtcagtaactggggccagggggtgtgaggacggtgaccgggagccaaggaga gggccacgggaaccgaacaccggagtccctatggtgttgcgctgtgggagtccaatgtct agtcctggagcctccgagagcgcgctggtggtcctttgggaccttccctttcggaaagta cggctgttcctgtgtcttgaccgtgccagatccggtagacaccgtgtgagcgctattgac tgtccagcgacttgcggagctgagagcgtagtggtggtcacttccctgagagacaagatt gatgcatctggaaggccagcactgatgtctgctctggggtggccacagagggaaaaggaa gcacaagttgccaactctggggtgtctgatgaccgtggggcctcattcacagctactctc tcctga >gi568815594f:174199122_174417380|GENSCAN_predicted_peptide_2|559_aa MKDKTHMGICTDAEKALDKIRYSFVIKTPNKLGMEEPYSKLALFQNELGYYLGSHNDLYP NGLEIEGIHRFPLLMLILHFNKPPPLSVKMFCTVIGPKLLLVSQLSTSKKKRIPRCRFHT AFKTPVKSDSAESLLNGQSTSNLQLRSFSDLDAGGTLSSEGRSAVSGILIAVTSTGVDKS QINEIQDQDYDYSGLQMGQGLWRVVRNQQLQQEGYSEQGYLTREQSRRMAASNISNTNHR KQVQGGIDIYHLLKARKSKEQEGFINLEMLPPELSFTILSYLNATDLCLASCVWQDLAND ELLWQGLCKSTWGHCSIYNKNPPLGFSFRKLYMQLDEGSLTFNANPDEQTSLKYYKNKNT YSHREQDSPKDTVHVTDMGNQAPQQLLAVAHGGVNYFMSKGILDDSPKEIAKFIFCTRTL NWKKLRIYLDERRDVLDDLVTLHNFRNQFLPNALREFFRHIHAPEERGEYLETLITKFSH RFCACNPDLMRELGLSPDAVYVLCYSLILLSIDLTSPHVKNKMSKREFIRNTRRAAQNIS EDFVGHLYDNIYLIGHVAA >gi568815594f:174199122_174417380|GENSCAN_predicted_CDS_2|1680_bp atgaaggataaaactcacatgggcatctgcacagatgcagaaaaagcacttgacaaaatt cgatactcttttgtgataaaaactcccaataaactaggtatggaagaaccttatagcaaa ttagctctatttcagaatgagctgggttactacctggggtcccataacgacctatatccc aatggcctagaaatagaaggaatacaccgattccccctcttgatgctgattcttcacttt aataagccaccaccattgtcagtcaagatgttttgcactgtcatcggtcccaaactcctg ctggttagccagctgtctacttccaagaagaaaaggattcctcgctgccgattccacacc gcgttcaagactcctgtgaagagcgactccgctgagagcctcctcaatggccagtccact tctaacttgcagctgagaagtttttctgatcttgatgctggaggtaccctgagttctgag ggtcgtagtgctgtttctggtattctcatcgcggtcacctctaccggtgtggacaaatcc caaattaatgaaattcaggaccaggattatgactactcaggcttgcagatgggtcaaggg ttgtggagagtggtcagaaaccagcagctgcaacaagaaggctacagtgagcaaggctac ctcaccagagagcagagcaggagaatggctgcgagcaacatttctaacaccaatcatcgt aaacaagtccaaggaggcattgacatatatcatcttttgaaggcaaggaaatcgaaagaa caggaaggattcattaatttggaaatgttgcctcctgagctaagctttaccatcttgtcc tacctgaatgcaactgacctttgcttggcttcatgtgtttggcaggaccttgcgaatgat gaacttctctggcaagggttgtgcaaatccacttggggtcactgttccatatacaataag aacccacctttaggattttcttttagaaaattgtatatgcagctggatgaaggcagcctc acctttaatgccaacccagatgagcaaacatcattaaaatattataaaaataagaatact tacagccacagggaacaagattctccaaaggacaccgttcatgtcacggatatgggtaac caagctccacagcagctgctggcagtggctcatgggggagtgaactactttatgtccaag ggtatcctggatgattcgccaaaggaaatagcaaagtttatcttctgtacaagaacacta aattggaaaaaactgagaatctatcttgatgaaaggagagatgtcttggatgaccttgta acattgcataattttagaaatcagttcttgccaaatgcactgagagaattttttcgtcat atccatgcccctgaagagcgtggagagtatcttgaaactcttataacaaagttctcacat agattctgtgcttgcaaccctgatttaatgcgagaacttggccttagtcctgatgctgtc tatgtactgtgctactctttgattctactttccattgacctcactagccctcatgtgaag aataaaatgtcaaaaagggaatttattcgaaatacccgtcgcgctgctcaaaatattagt gaagattttgtagggcatctttatgacaatatctaccttattggccatgtggctgcataa >gi568815594f:174199122_174417380|GENSCAN_predicted_peptide_3|395_aa MFKIRSAVVGLLLGLGWVWFLLGSQTNKSSCETLDSLIKGDPAASLPIISYSFTSYSPYV TELIMESNVELIAKNDLRFIDAVYKLLRDQFNYKPILTKKQFIQCGFAEWKIQIVCDILN CVMKKHKELSSLQKIPSQQRKKISSGKSEPPLGNEKISAEAVGVDISGRFMTSGKKKAVV IRHLYNEDNVDISEDTLSPITDVNEAVDVSDLNATEIKMPEVKVPEIKAEQQDVNVNPEI TALQTMLAECQENLKKLTSIEKRLDCLEQKMKGKVMVDENTWTNLLSRVTLLETEMLLSK KNDEFIEFNEVSEDYASCSDMDLLNPHRKSEVERPASIPLSSGYSTASSDSTPRASTVNY CGLNEISESPGLSPFFETEGIKLISVEGSDVYLKS >gi568815594f:174199122_174417380|GENSCAN_predicted_CDS_3|1188_bp atgttcaaaatcaggtctgcggttgtgggcctgttactggggcttgggtgggtgtggttc cttctgggttcccagacaaacaagtcttcctgtgagaccttggacagtttgataaaggga gacccagcagcatctttgcccatcatcagctattcttttacctcatactcaccttatgta acagaacttataatggaatccaatgtagagctcatagcaaaaaatgacttgcgctttata gatgctgtctataagcttcttcgtgatcaatttaattataaaccaattttgacaaaaaag cagtttatccaatgtgggtttgcagaatggaaaatccaaattgtttgtgatattttgaat tgtgtgatgaaaaagcacaaggaattaagcagtcttcagaagattccatcacaacaaaga aagaaaatcagttctggtaagtcagaacctcctttgggcaatgagaaaatatctgcagag gctgttggcgttgatatcagtggcaggtttatgacctcaggaaagaagaaagctgtggtg attcgtcacttgtataatgaagataatgttgacatttctgaggatacattaagtccaata acagatgttaatgaagcagttgatgtgtctgacttaaatgctactgaaataaagatgcct gaagtaaaggttcctgaaatcaaggctgagcaacaggatgtaaatgttaatcctgagatt actgcactacaaactatgcttgctgaatgccaagaaaatcttaagaaactgacttcgata gagaaaaggttagactgtttggaacaaaaaatgaaaggaaaagtgatggtagatgaaaac acctggactaatcttcttagtcgtgtcactcttcttgaaacagaaatgcttttgtctaaa aagaatgatgaatttatagagtttaatgaagttagtgaagactacgcttcttgtagtgac atggaccttctgaatcctcacagaaaaagcgaagtagagaggccagcaagtattcctctg tcctctggctatagtacagcatcatcagattcaactcccagagcctctactgttaattac tgtggtttgaatgagatttcagagtccccaggtttgtctcctttctttgaaactgaaggc atcaaactgatcagcgtggaaggaagtgatgtttatcttaagtcctaa >gi568815594f:174199122_174417380|GENSCAN_predicted_peptide_4|113_aa MKNMSEGLSADILRNFWKIEIRQTTKCYPCSRKEKVISFDMPIKRAGAEEEAGFQAEFWT SWMSFTETSEYSRKRRLTKSSVGMDAEQLKLSYIASGKSKWASQFGKEFGSFS >gi568815594f:174199122_174417380|GENSCAN_predicted_CDS_4|342_bp atgaaaaatatgagtgaggggctatcagcagacattttgagaaatttctggaaaatagaa attagacaaacaactaagtgttacccatgtagcaggaaggaaaaagtcatcagttttgat atgcccattaaaagagctggagctgaggaggaagctggtttccaggcagaattctggact tcgtggatgagcttcactgaaacatcagagtacagcaggaaaagaagactgacaaaatca agtgttggtatggatgctgagcaactgaaactctcatacattgctagtgggaaaagcaaa tgggccagccagtttggaaaagagtttggcagtttttcataa >gi568815594f:174199122_174417380|GENSCAN_predicted_peptide_5|553_aa MGPGDQHSAYGGPTLTPASEFPYYLLIIIGHFPERGMWQDNRIIVERRSAGKHVNKCLCI INKKMQQQRTILEAESNPRQTTKPASALILDFSASRTSLPVCLLFSWTWQTTAEERTIEK ENPEICRGSPLSIQPNIDQDMHVSKLSKAGKRNTQAAGETTLRVHPELRIAYVPTGGTRD LRKERDTETKYRERKVGPGVLSMRRTPAGTVSEFPQYVLIIIGRFSDRRMWQDNRVTVER GSAGKHMNKCHCIINKEAKNLNKRLQELLTGTLSLEKNINDLMELKNTARELHEAYTSIN SRINQVEERVSGIEDQLNKIKHEDKIREKKNERKEQSLQEILDYMKRPNLRLIGVPESDE ENGTKLENTLQDIIQENFPNLVRQANIQFPEIQRTQLRYSSRRGTPRHIIIRFTKVEMKE QMLRAAIEKGQVTHKGKPIRLIADLSGETLQARREWGPIFNIFKEKNFQPRISYSAQLSF ISEGEIKPFTDKQMLRDFCYHQACLIRAPEGSTKYGKEKLVPATPKHIPKYKEKQHYGET ASTNEQNNKLASK >gi568815594f:174199122_174417380|GENSCAN_predicted_CDS_5|1662_bp atgggcccaggagaccagcattcagcatacggaggacccacgctgacaccggcctctgag ttcccttactatttattgatcattatcgggcatttcccagagagggggatgtggcaggac aataggataatagtggagagaagatcagcaggtaaacacgtgaacaaatgtctctgcatc ataaacaagaaaatgcagcaacaaagaactatcttggaagcagagagcaaccctcgccag acaaccaaacctgctagtgccttgatcttggatttttcagcttccagaactagtctccct gtctgtttgttatttagttggacatggcagactaccgcagaggagagaactatagagaaa gaaaacccagagatctgcagagggtcccccttgagtattcagccaaatattgatcaggat atgcatgtgagcaaattatccaaagctggaaaaagaaacacccaagcagcaggggaaaca acccttagggttcacccagagctgagaatagcttatgtgcccaccggtggaaccagagac ttgagaaaagaaagagacacagagacaaagtacagagaaagaaaagtcggcccaggggtg ctcagcatgcggaggacccctgccggcacagtctctgagttccctcagtatgtattgatc attatcgggcgtttctcggacaggcggatgtggcaggacaatagggtaacagtggagaga gggtcagcaggaaaacatatgaacaaatgtcactgcatcataaacaaggaagctaagaac cttaataaaaggttacaggaactgctaactggaacacttagtttagagaagaacataaat gacctgatggagctgaaaaacacagcacgtgaacttcatgaagcatacacaagtatcaat agccgaatcaatcaagtggaagaaagggtatcagggattgaagatcaacttaataaaata aagcatgaagacaagattagagaaaaaaagaatgaaaggaaggaacaaagcctccaagaa atattggactatatgaaaagaccaaacctacgattgattggtgtacctgaaagtgatgag gagaatggaaccaagttggaaaacacacttcaggatattatccaggagaacttccccaac ctagtaagacaggccaacatccaatttccggaaatacagagaacacaactaagatactcc tcaagaagaggaaccccaagacacataatcatcagattcaccaaggttgaaatgaaggaa caaatgttaagggcagccatagagaaaggtcaggttacccacaaagggaagcccatcaga ctaatagcggatctctctggagaaaccctacaagccagaagagagtgggggccaatattc aacatttttaaagaaaagaattttcaacccagaatttcatattcagcccaactaagcttc ataagtgaaggagaaataaaaccctttacagacaagcaaatgctgagggatttttgttac caccaggcctgccttataagagctcctgaaggaagcactaaatatggaaaggaaaaactg gtaccagccactccaaaacacataccaaaatataaagaaaaacaacactatggagaaact gcatcaactaatgagcaaaataacaagctagcatcaaaataa >gi568815594f:174199122_174417380|GENSCAN_predicted_peptide_6|125_aa MGPGDQRSAYGGPTLTPASEFPYYLLIIIGHFPERGMWQDNRIIVERRSAGKQVNECLCI INKKQKNNCKTHMGPQETPIAKAFITKNNKAKGIIQLDFKIYCKAVVIKTECTGIKTDTL TNRMV >gi568815594f:174199122_174417380|GENSCAN_predicted_CDS_6|378_bp atgggcccaggggaccagcgttcagcatacggaggacccacgctgacaccggcctctgaa ttcccttactatttattgatcattatcgggcatttcccggagagggggatgtggcaggac aataggataatagtggagagaagatcagcaggtaaacaggtgaacgaatgtctctgcatc ataaacaagaaacagaaaaacaattgtaaaactcatatgggtccacaagagaccccaata gccaaagcattcataaccaaaaataacaaagctaaaggcatcatacaactggatttcaaa atatattgcaaagctgtagtaatcaaaacagaatgtactggcataaaaacagacacactg accaaccgaatggtatag >gi568815594f:174199122_174417380|GENSCAN_predicted_peptide_7|629_aa MGKKQSRKAENSKNQSTSPTPKERSSSPAMEQSWTENDFDELREEGFRRSNFSQLKEEVQ THHKEAKNLEKRLDKWLTRITSVEKSLNDLMELKTTARELRDEYTSVSSQFDQLEERVSV IEDQMNAVKPNLHPIGVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQS YSSRRATPRHIIVRFTKFEMKEKMLRAARKKEIQTTIREYYKHLYANKLENLEEMDKFLD TYILPRLNQEEVESLNRRITGSEVKAIINSLPTKKSPGPEVFTAKFYQRYKEELVPFLLK LFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRTISLMNNDAKILNKILANQIQQH IKKLIHHDQVGFIPGMQGWFNISKSINIIQHINRTKDKNHMIISIDAKKAFDKIQQPFML KTLNKLVLEVLARAIRQDKEIKGIQLGKEEVKLSLFADDMTVYLENPIVSAQNLLKLISN FSKVSGYKINVQISQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDMKDLFKEN YKPLLNEIKEDTNKWKNIPFSWIGRISIMKMTILPKVIYRFNAILHQATNDFLHRIGKNY FKVHMEPKKSPHCQNNPKPKEQSWRHHAT >gi568815594f:174199122_174417380|GENSCAN_predicted_CDS_7|1890_bp atggggaaaaaacagagcagaaaagctgaaaattctaaaaatcagagcacctctcccact ccaaaggaacgcagctcctcgccagcaatggaacaaagctggacggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaacttctcccagctaaaggaggaagttcaa acccatcacaaagaagctaaaaaccttgaaaaaagattagacaaatggctaactagaata accagtgtagagaagtccttaaatgacctgatggagctgaaaaccacggcacgagaacta cgtgatgaatacacaagcgtcagtagccaatttgatcaactggaagaaagggtatcagtg attgaagatcaaatgaatgcagtgaaaccaaatctacatccgattggtgtacctgaaagt gatggagagaatggaaccaagttggaaaacactctgcaggatattatccaggagaacttc cccaacctagcaaggcaggcaaacattcagattcaggaaatacagagaacgccacaaagt tactcctcgagaagagcaactccaagacacataattgtcagattcaccaaatttgaaatg aaggaaaaaatgttaagggcagccagaaagaaagaaatacaaactaccatcagagaatac tacaaacacctctatgcaaataaactagaaaatctagaagaaatggataaattcctggac acatacatcctcccaagattaaaccaggaagaagttgaatccctgaatagacgaataaca ggctctgaagttaaggcaataattaatagcctaccaaccaaaaaaagtccaggaccagaa gtattcacagccaaattctaccagaggtacaaggaggagctggtaccattccttctgaaa ctattccaatcaatagaaaaagagggaatcctccctaactcattttatgaggccagcatc attctgataccaaagccgggcagagacacaacaaaaaaagagaattttagaacaatatcc ctgatgaacaacgatgcaaaaatcctcaataaaatactggcaaaccaaatccagcagcac atcaaaaagcttatccaccatgaccaagtgggcttcatccctgggatgcaaggctggttc aacataagcaaatcaataaacataatccagcatataaacagaaccaaagacaaaaaccac atgattatctcaatagatgcaaaaaaggcgtttgacaaaattcaacaacccttcatgcta aaaactctcaataaactagtgttggaagttctggccagggcaatcaggcaggacaaagaa ataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatg actgtatatttagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacaaaatcaatgtgcaaatatcacaagcattcttatacacc aataacagacaaacagagagtcaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaacttacaagggatatgaaggacctcttcaaggagaac tacaaaccactgctcaacgaaataaaagaggatacaaacaaatggaagaacattccattc tcatggataggaagaatcagtatcatgaaaatgaccatactgcccaaggtaatttataga ttcaatgccatcctccatcaagctaccaatgactttcttcacagaattggaaaaaactac tttaaagttcatatggaaccaaaaaagagcccgcattgccaaaacaatcctaagccaaaa gaacaaagctggaggcatcatgctacctga