GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:24:37 Sequence gi568815591r:112983612_113184721 : 201110 bp : 37.03% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 583 578 6 1.05 1.01 Sngl - 24583 24191 393 1 0 72 43 199 0.986 9.79 1.00 Prom - 24801 24762 40 -7.95 2.03 PlyA - 24854 24849 6 1.05 2.02 Term - 30747 30572 176 1 2 67 46 199 0.320 10.54 2.01 Init - 34076 33926 151 2 1 115 56 102 0.286 8.68 2.00 Prom - 36754 36715 40 -4.55 3.14 PlyA - 37152 37147 6 1.05 3.13 Term - 40496 40281 216 2 0 11 43 127 0.538 -3.34 3.12 Intr - 43527 43443 85 2 1 79 92 66 0.674 5.00 3.11 Intr - 50427 50307 121 1 1 43 70 99 0.687 2.23 3.10 Intr - 50763 50651 113 2 2 10 78 91 0.224 -0.60 3.09 Intr - 56233 56148 86 1 2 43 76 117 0.301 3.70 3.08 Intr - 68053 67037 1017 1 0 -26 25 281 0.056 0.31 3.07 Intr - 70474 69802 673 0 1 18 79 459 0.190 28.74 3.06 Intr - 70826 70577 250 0 1 30 94 189 0.278 9.27 3.05 Intr - 79038 78711 328 0 1 89 49 138 0.266 4.35 3.04 Intr - 82964 82834 131 0 2 81 82 29 0.346 1.09 3.03 Intr - 84727 84609 119 0 2 85 59 51 0.099 1.09 3.02 Intr - 101016 100029 988 1 1 78 30 464 0.001 28.59 3.01 Init - 102284 102191 94 2 1 52 44 98 0.272 2.39 3.00 Prom - 106117 106078 40 -5.65 4.09 PlyA - 106157 106152 6 -1.75 4.08 Term - 107019 106211 809 1 2 67 47 226 0.557 8.77 4.07 Intr - 127262 127082 181 2 1 37 59 171 0.557 7.62 4.06 Intr - 130513 130437 77 2 2 68 81 54 0.278 1.02 4.05 Intr - 133996 133862 135 2 0 98 36 142 0.426 9.62 4.04 Intr - 150906 150744 163 0 1 79 21 111 0.100 2.13 4.03 Intr - 164002 163898 105 1 0 58 76 64 0.091 1.69 4.02 Intr - 168433 168382 52 0 1 90 84 -1 0.237 -2.31 4.01 Init - 168878 168754 125 2 2 98 94 97 0.669 10.99 4.00 Prom - 175450 175411 40 -5.55 5.03 PlyA - 175642 175637 6 1.05 5.02 Term - 182730 182619 112 2 1 74 42 105 0.954 1.55 5.01 Intr - 185996 185817 180 1 0 56 115 157 0.745 13.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:112983612_113184721|GENSCAN_predicted_peptide_1|130_aa MVLEVLASAIRQEKEIKGIQTGIEEFKLSLFADDMILYLENPIVSAQEKLFKLISNFNKV SGYKINVQKSQAFINTNNRQAESQITNEFLLTIATKRIKYLGIHLTREVKELFKESYKLL LKEIIEDTNK >gi568815591r:112983612_113184721|GENSCAN_predicted_CDS_1|393_bp atggtattggaagttctggcaagcgcaatcaggcaagagaaagaaataaagggtattcaa acaggaatagaggaattcaaactgtctctgtttgcagatgacatgatcctatatctagaa aaccccatcgtctcagcccaagaaaagctttttaagctgataagcaacttcaacaaagtc tcaggatacaaaatcaatgtgcaaaaatcacaagcattcataaacaccaacaacagacaa gcagagagccaaatcacaaatgaattcctattaacaattgctacaaagagaataaaatac ctaggaatacatctaacaagggaagtaaaggaactcttcaaagagagctacaaactacta ctcaaggaaatcatagaggacacaaacaaatag >gi568815591r:112983612_113184721|GENSCAN_predicted_peptide_2|108_aa MEQGVALIREAPAAQEPTEWVGGSGMGGCRSRALPHGKAAKVQREIECSPGIQASDSSCG HKLQAHPMGPGDMTTPVDPRTRLAAMDPGARPALVKGSKHTPVDKLQA >gi568815591r:112983612_113184721|GENSCAN_predicted_CDS_2|327_bp atggagcagggggtggcgctcatccgggaggctccggccgcacaggagcccacggagtgg gtgggaggctcaggcatggggggctgcaggtcccgagccctgccccacgggaaggcagct aaggtccagagagaaatcgagtgcagccctgggatccaggcttcagacagctcctgtgga cacaagcttcaggcccaccccatgggcccaggggacatgactaccccagtggaccccaga accaggttagctgctatggacccaggagccagacctgccctagtaaaaggctccaagcac accccagtagacaagctccaggcctga >gi568815591r:112983612_113184721|GENSCAN_predicted_peptide_3|1406_aa MRSVFTRGCTGGEIKIIERSFAWASTSIFRARVSVVGNLLISILLVKDKTLHRAPYYFLL DLCCSDILRSAICFPFVFNSVKNGSTWTYGTLTCKVIAFLGVLSCFHTAFMLFCISVTRY LAIAHHRFYTKRLTFWTCLAVICMVWTLSVAMAFPPVLDVGTYSFIREEDQCTFQHRSFR ANDSLGFMLLLALILLATQLVYLKLIFFVHDRRKMKPVQFVAAVSQNWTFHGPGASGQAA ANWLAGFGRGPTPPTLLGIRQNANTTGRRRLLVLDEFKMEKRISRMFYIMTFLFLTLWGP YLVACYWRVFARGPVVPGGFLTAAVWMSFAQAGINPFVCIFSNRELRRCFSTTLLYCRKS SCNAFSPLPNQASSSSHSSIHGSPVSFTTSIPKVTKNIRAEIIDIQYFISLHLLLFTVRP LETNAASISFITDFHKTCYSAWQLKVNQQNALSIPKKLLSLPLLWTSLLLLCLDHLHLLV DIALIVLYLQNHTGKALFDFLLRFFEERLQALDATCLKFPWKALLLSAADPGTMALAPIK WKVCSTVILQSGLCAPLGQNFQRKEQPAIFAVLQPPLVIPSQTGSGVDCQQTAADLQQRD LTVRRKTNKQKGIASTSTKRMSTQKLHLKVNYIKDQSKEQNWTKNEFDELTEVGFRRWVI TNSSELKEHVLTQCKEAKNLEKRLEELLTRITSLKKNINDLMKPKNTAREICEAHTSINS RIDQAEERIPEIEDQLNDIKHENKIREKRMKRNKQSLQEIWDYVKRPNLRLIAVPESDGE NGTKLENTLWDIIQENFPNLARQANIQIQEIQRTPQRYSSRRVTPRHIIVRFTNVEMKEK MLRAAIEKGRVTHKGKPIRLTRYKEELLPFLLKLFQTIEKEELLPNSFYEASIILIPKPG RDTTTKENFRPISLMNIGVKILNKILANRIQQHVKKLIHHNQVGFIPGMQGWFKICKPRN VIHQINRTNDKNHMFISIDAEKAFNKIQHPFMLKTPNKLGINGTYLKIIRAIYDKPTANI ILNGQKLETFPLKTSTRKGCPLSPLLFNIVLEVLARATGQEKEIKSIQVGREEVKLSLFA DDMILYLENPIILAQEKLLKLISNFSKVSGYKINVQKSQAFLYTNNRQIESQIMSESLFT IATKRIKYLGIQLTRDVKNLFKENYKPLLKEIREDTNKWENIPCSWIGRINIVKMAIMPK TERLCYSRNLDVESLTANVKVFGDGAFRRAKSTYKQKLIYKHFLLSSVDETTLQSMLSNK NVESSGAVEVAPNYFKEESPKSSCPEVGLQNQDEFLIQAPWSLPTPRHSRKPANSVTHHY KHPNSVHMNLGDSEAIGKSRRNGQILRYIQPTKIEPEEIQNLNRPKTSNEIETVIKSLPV KKISGPDGFTTEFYQIFTEELIMNST >gi568815591r:112983612_113184721|GENSCAN_predicted_CDS_3|4221_bp atgcgatcggtatttacccgaggatgtaccggtggggaaattaaaatcatcgaacgcagt tttgcctgggcttccaccagcatattccgggccagagtcagcgtggtgggcaacctcctg atctccattttgctagtgaaagataagaccttgcatagagcaccttactacttcctgttg gatctttgctgttcagatatcctcagatctgcaatttgtttcccatttgtgttcaactct gtcaaaaatggttctacctggacttatgggactctgacttgcaaagtgattgcctttctg ggggttttgtcctgtttccacactgctttcatgctcttctgcatcagtgtcaccagatat ttagctatcgcccatcaccgcttctatacaaagaggctgaccttttggacgtgtctggct gtgatctgtatggtgtggactctgtctgtggccatggcatttcccccggttttagacgtg ggcacttactcattcattagggaggaagatcaatgcaccttccaacaccgctccttcagg gctaatgattccttaggatttatgctgcttcttgctctcatcctcctagccacacagctt gtctacctcaagctgatatttttcgtccacgatcgaagaaaaatgaagccagtccagttt gtagcagcagtcagccagaactggacttttcatggtcctggagccagtggccaggcagct gccaattggctagcaggatttggaaggggtcccacaccacccaccttgctgggcatcagg caaaatgcaaacaccacaggcagaagaaggctattggtcttagacgagttcaaaatggag aaaagaatcagcagaatgttctatataatgacttttctgtttctaaccttgtggggcccc tacctggtggcctgttattggagagtttttgcaagagggcctgtagtaccagggggattt ctaacagctgctgtctggatgagttttgcccaagcaggaatcaatccttttgtctgcatt ttctcaaacagggagctgaggcgctgtttcagcacaacccttctttactgcagaaaatcc agctgtaatgccttctcacctcttcccaatcaagcatcatcttctagtcactcttccatc catggcagccctgtttccttcaccacttctattcctaaggtcaccaagaatatcagggct gaaattattgacatacagtacttcattagtctacatcttctcctcttcactgtaaggcct ctggagacaaatgccgcatccatctcattcatcactgatttccacaaaacctgttacagt gcctggcaactgaaagtcaaccagcaaaatgccttgagtatccccaaaaaactgttgtca ttacctttgctctggaccagcctgcttttgctttgcctggaccaccttcacctcttggta gacattgctttgattgtgctctatcttcagaatcacactggtaaagccctgtttgatttc ctgttacgattctttgaagaaaggcttcaggctcttgatgctacttgtttgaaatttccc tggaaagctctgctcttgtctgcagctgatccaggcaccatggccttggcacccatcaaa tggaaagtttgctcaactgtaattttgcagtcaggattgtgtgccccactgggacaaaac ttccagaggaaggaacagccagcaatctttgctgttctgcagcctccgctggtgataccc agtcaaacagggtctggagtggactgccagcaaacggcagcagatctgcagcagagggac ctgactgttagaaggaaaactaacaaacagaaaggaatagcatcaacatcaacaaaaagg atgtccacacaaaaactccatctgaaggtcaactacatcaaagaccaaagcaaggaacaa aactggacaaagaatgaatttgatgaattgacagaagtaggcttcagaaggtgggtaata acaaactcctccgagctaaaggagcatgttctaacccaatgcaaggaagctaagaacctt gaaaaaaggttagaggaattgctaactagaataaccagtttaaagaagaacataaatgac ctgatgaagccgaaaaacacagcacgagaaatttgtgaagcacacactagtatcaatagc cgaattgatcaagcggaagaaaggataccagagattgaagatcaacttaatgatataaag catgaaaacaagattagagaaaaaagaatgaaaaggaacaaacaaagcctccaagaaata tgggactatgtgaaaagaccaaacctacgtttgattgctgtacctgaaagtgatggggag aatggaaccaagttggaaaacactctttgggatattatccaggagaacttccccaaccta gcaagacaggccaacatccaaattcaggaaatacagagaacaccacaaagatactcttca agaagagtaacaccaagacacataattgtcagattcaccaacgttgaaatgaaggaaaaa atgttaagggcagccatagagaaaggtcgagttacccacaaagggaagcccatcagacta acaaggtacaaagaggagctgctaccattccttctgaaactattccaaacaatagaaaaa gaggaactcctccctaactcattttatgaggccagcatcatcctgataccaaaacctggc agagacacaacgaccaaggaaaatttcaggccaatatccctgatgaacattggtgtgaaa atcctcaataaaatactagcaaaccgaatccagcagcacgtcaaaaagcttatccaccac aatcaagttggcttcatccctgggatgcaaggctggttcaaaatatgcaaaccaagaaat gtaatccatcaaattaacagaaccaatgacaaaaaccacatgtttatctccatagatgca gaaaaggccttcaataaaattcaacaccccttcatgctaaaaactcccaataaattaggc attaatggaacatatctcaaaataataagagctatttatgacaaacccacagccaatatc atactgaatgggcaaaagctggaaacattccctttgaaaaccagcacaagaaaaggatgc cctctctcaccactcctattcaacatagtattggaagttctggccagggcaactgggcaa gagaaagaaataaagagtattcaagtaggaagagaggaagtcaaactgtctctgtttgca gatgacatgatcctatatctagaaaaccccatcatcttagcccaggaaaagcttcttaag ctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagca ttcctatacaccaataatagacaaatagagagccaaatcatgagtgaaagcctattcaca attgctacaaagagaataaaatacctaggaatacaacttacaagggatgtgaagaacctc ttcaaggagaactacaaaccactgctcaaggaaataagagaggacacaaataaatgggaa aacattccatgctcatggataggaagaatcaatattgtgaaaatggccataatgcccaaa actgaacgtttgtgttactcccgaaatttggatgttgaaagcctaactgccaatgtgaag gtatttggagatggggcctttaggagagcaaagagtacatataaacagaaactcatttat aaacactttttattgtcctcagtggatgaaacgactctacagagcatgcttagtaataaa aacgtggaaagttcaggagcagtagaagtagcaccaaactattttaaggaagagtctcca aaaagttcttgccctgaagtgggtctgcaaaatcaagatgaattcttaattcaggctcct tggtcacttcctacaccaagacacagcagaaagccagcaaattcagtgactcaccattat aagcatcctaattcagtgcacatgaatcttggggacagtgaggcaattggaaaatctaga agaaatggacaaattcttagatacatacaacctaccaagattgaaccagaagaaattcaa aacctgaacagaccaaaaacaagtaatgagattgaaactgtgataaaaagccttccagta aagaaaatctcaggacctgatggcttcaccactgaattctatcaaatatttacagaagaa ctaataatgaattctacttaa >gi568815591r:112983612_113184721|GENSCAN_predicted_peptide_4|548_aa MDSETPKQKARDAKFMLVKRLWQDKDGHLCRIVLHDHGWNPSETPVGPQMHGARSLSMEK LYEKFFGPNCVLQKLYVEALTPNETVFRDEAFKELGHTSPMIRHQNSRALDSETCTSGFL GSQAFDFPGSEAFRLELSHATSIPGSLGGTFSPQDSNIMTSVSTQLSLVLMSLLLVLPVV EAVEAGDAIALLLGGSGVQATHMVFMGTSVEEGGSYYQLSCGIHLQIIPTIRPLLSIPVA PTHVDHHEDVDDIRSTPQLVFQPPVLVCSHTAVKNYLRLVLEVLARAIRQEKEIKGIQLG KEEVKLSLSADDMIVYLKNPTVTAQNLLKLISNFSKVSGYKINVQKSQAFLYINNRQTES QIMSELPFTIASKRIKYLGIQLTRDVKDLFKEKYKPLFNEIKEDTKKWKSIPCSWVGRIN IMKMTILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSRKNKVGGIK LPDFKLYYKATVTKTAWYWYQNRDIDQWNRTDASEITPHIYNYLIFDKPKKNKQWEKDSL FNKWCWEN >gi568815591r:112983612_113184721|GENSCAN_predicted_CDS_4|1647_bp atggactcagaaactccaaagcagaaagcaagagatgcaaaattcatgctagtaaagagg ctctggcaagataaagatggacatctgtgcagaattgttctccatgaccatggctggaat ccgagtgagacaccagtaggtccacagatgcatggagcaagatcactatcaatggagaag ctgtatgagaagttttttggaccgaattgtgtcctccaaaagttatatgttgaagcccta actcccaatgaaactgtatttagagatgaagcctttaaagagctgggacacacttctcct atgattagacatcagaactccagggctttggactctgaaacttgcaccagtggcttcctg ggttctcaggcctttgacttccctggttctgaggctttcagacttgaactgagccatgct accagcatccctggatctttaggtggaaccttcagccctcaagattccaacatcatgacc tcagtttcaacacagttgtccttagtcctcatgtcactgcttttggtgctgcctgttgtg gaagcagtagaagccggtgatgcaatcgcccttttgttaggtgggagtggagtccaggct acccacatggtcttcatgggtaccagcgtagaagagggaggttcttactatcaactgtcc tgtggaatccacctccaaattatccctacgattcgcccacttctctccatccctgtggct cctacccatgtagaccatcatgaggacgtggatgacatccgcagcaccccccagctggtc ttccagcctcctgtattagtctgttctcacactgctgtaaagaactacctcagactagtg ttggaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaattagga aaagaggaagtcaaattgtccctgtctgcagatgacatgattgtatatctaaaaaacccc actgtcacagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatac aaaatcaatgtacaaaaatcacaagcattcttatacataaataacagacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatc caacttacaagggatgtgaaggacctcttcaaggagaagtacaaaccactgttcaatgaa ataaaagaggatacaaagaaatggaagagcattccatgctcatgggtaggaagaatcaat atcatgaaaatgaccatactgcccaaggtaatttatagattcaatgccatccccatcaag ctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaa aaaagagcccgcatcgccaagtcaatcctaagccgaaagaacaaagttggaggcatcaag ctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtac caaaacagagatatagatcaatggaacagaacagacgcctcagaaataacgccgcatatc tacaactatctgatctttgacaaacctaagaaaaacaagcaatgggaaaaggattctcta tttaataaatggtgctgggaaaactga >gi568815591r:112983612_113184721|GENSCAN_predicted_peptide_5|97_aa XATGEHALAKQGNKIESRRHGTQKMMTRTKRRVKKPHSLTAAWYPESNKSYFKQEEQALQ RDMDEAGNHHTQQANTRTENQTPHVLTHKWELNKENT >gi568815591r:112983612_113184721|GENSCAN_predicted_CDS_5|294_bp nnagcaactggagaacatgctctagcaaaacaagggaacaaaatagaaagtagaaggcat gggactcagaaaatgatgaccagaaccaagagaagggtaaagaaaccccacagcttgaca gcagcttggtacccagaaagtaacaagtcctattttaagcaggaggaacaagctctccaa agggacatggatgaagctggaaaccatcatactcagcaagctaacacaagaacagaaaac caaacacctcatgttctcactcataagtgggagctgaacaaagagaatacatag