GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:03:43 Sequence gi568815587r:5902054_6103148 : 201095 bp : 40.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 659 654 6 1.05 1.03 Term - 7978 7671 308 2 2 46 44 219 0.918 7.59 1.02 Intr - 8877 8761 117 1 0 56 47 106 0.684 2.82 1.01 Init - 8936 8930 7 2 1 74 115 0 0.980 2.39 1.00 Prom - 9696 9657 40 -7.05 2.00 Prom + 11277 11316 40 -7.55 2.01 Init + 13327 13677 351 0 0 88 55 412 0.198 35.11 2.02 Intr + 15506 16262 757 1 1 -42 40 402 0.131 12.51 2.03 Term + 17907 18361 455 1 2 30 44 183 0.198 2.43 2.04 PlyA + 18555 18560 6 1.05 3.00 Prom + 18672 18711 40 -5.65 3.01 Init + 22301 22387 87 1 0 99 65 47 0.124 4.19 3.02 Intr + 26741 26891 151 1 1 -9 92 131 0.110 2.61 3.03 Intr + 27140 28361 1222 0 1 74 0 632 0.476 39.36 3.04 Intr + 28528 28941 414 1 0 -3 57 463 0.576 26.39 3.05 Intr + 29022 29872 851 0 2 70 -20 426 0.571 20.08 3.06 Intr + 30032 30569 538 0 1 26 -10 243 0.391 -0.66 3.07 Intr + 30759 31447 689 0 2 30 -11 376 0.339 12.16 3.08 Intr + 33213 33508 296 1 2 47 26 150 0.550 0.50 3.09 Intr + 33673 33872 200 0 2 54 41 181 0.323 7.23 3.10 Intr + 42226 42407 182 1 2 24 15 144 0.011 -0.71 3.11 Term + 45424 46241 818 2 2 116 42 451 0.078 35.61 3.12 PlyA + 47162 47167 6 1.05 4.02 PlyA - 47905 47900 6 1.05 4.01 Sngl - 66318 65500 819 0 0 89 42 553 0.901 44.29 4.00 Prom - 73639 73600 40 -3.95 5.05 PlyA - 74566 74561 6 1.05 5.04 Term - 84777 83888 890 1 2 69 39 626 0.874 47.43 5.03 Intr - 92113 92006 108 2 0 47 48 90 0.160 0.24 5.02 Intr - 92717 92252 466 2 1 -23 58 596 0.459 37.27 5.01 Init - 92909 92817 93 2 0 90 42 57 0.495 1.85 5.00 Prom - 94158 94119 40 -6.55 6.00 Prom + 98088 98127 40 -4.05 6.01 Init + 99473 99577 105 1 0 79 68 104 0.968 7.77 6.02 Term + 100016 100987 972 1 0 61 39 850 0.844 68.46 6.03 PlyA + 101067 101072 6 1.05 7.00 Prom + 106692 106731 40 -7.15 7.01 Init + 109778 110094 317 1 2 72 36 204 0.650 10.46 7.02 Intr + 110888 111031 144 2 0 21 25 166 0.167 2.08 7.03 Term + 113287 113755 469 1 1 62 34 180 0.465 3.46 7.04 PlyA + 114285 114290 6 1.05 8.02 PlyA - 114627 114622 6 1.05 8.01 Sngl - 125516 124695 822 2 0 74 42 725 0.322 59.99 8.00 Prom - 128181 128142 40 -7.55 9.00 Prom + 128353 128392 40 -3.95 9.01 Init + 129410 129430 21 1 0 77 100 34 0.744 3.28 9.02 Intr + 132354 132444 91 2 1 89 91 59 0.564 4.95 9.03 Term + 137190 137329 140 1 2 53 36 119 0.303 0.34 9.04 PlyA + 139332 139337 6 1.05 10.04 PlyA - 140255 140250 6 1.05 10.03 Term - 144494 143909 586 1 1 53 48 421 0.547 27.60 10.02 Intr - 156122 155267 856 0 1 87 81 681 0.431 56.78 10.01 Init - 158132 158123 10 2 1 71 97 2 0.367 0.24 10.00 Prom - 158731 158692 40 -6.15 11.00 Prom + 159220 159259 40 -2.45 11.01 Init + 165360 165704 345 2 0 47 53 313 0.550 20.96 11.02 Intr + 171014 171132 119 1 2 20 109 107 0.419 4.34 11.03 Intr + 175349 175601 253 2 1 28 84 134 0.336 3.51 11.04 Term + 176835 176924 90 2 0 107 43 75 0.558 1.64 11.05 PlyA + 178353 178358 6 1.05 12.04 PlyA - 178957 178952 6 1.05 12.03 Term - 182206 182000 207 1 0 32 45 161 0.743 2.56 12.02 Intr - 182907 182807 101 1 2 71 94 49 0.312 2.71 12.01 Init - 187186 187129 58 1 1 61 37 76 0.482 1.38 12.00 Prom - 188658 188619 40 -5.35 13.02 PlyA - 188776 188771 6 1.05 13.01 Term - 194779 194655 125 2 2 120 43 121 0.132 8.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 194503 195045 543 0 0 73 50 172 0.812 7.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_1|143_aa MPGSLRCQKTLNQLAFSVGWLTVWGKFSELVTSCLETLGVVVYPNEKEQRNNSGNITKQG SLTPPKDQTSSPAMDPNQDEISELPEKEFRRSIIKLIKEAPEKGEIQLKEIKNIIQDMKG KIFSEIDSINKKTNTTSGNEGHT >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_1|432_bp atgccaggctccttgagatgccaaaaaactctgaatcagcttgctttctcagtggggtgg ctgacggtctggggcaagttctcagaactggtcaccagctgcctggaaacacttggtgtg gtagtctacccaaatgagaaggaacagagaaacaattctggtaatattacaaaacaaggt tctttaacacctccaaaggatcaaaccagctcaccagcaatggatccaaaccaagatgaa atctctgaattaccagaaaaagaattcagaagatcaattattaagctgatcaaagaggca ccagaaaaaggtgaaatccaacttaaagaaatcaaaaacataatacaggatatgaaagga aaaatcttcagtgaaatagacagcataaataaaaaaacaaacacaacttctggaaatgaa ggacacacttag >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_2|520_aa MGKKQNRKTGNSKKQSTCPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ AKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERINK IGRPLARLIKKKREKNQIDTIKNEKGYITTDPTEIQTTIKEYYKHLYANKLENLEEMDKF LDTYTIPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQKYKEELVPFL LKLFQSIEKEGILPNSFYEASIILIPKPGKDTTKKKNFRPISLMNIDAKILSKILANRIQ QHIKKLIHDDQVGFIPGMQAWFNIHKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPF MLKKTLNKLAQNLLNLISNFSKVSRYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKR IKYLGIQLKRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRF NAIPIKLPMTFFTELEKTKVHMEPKKSPHHQVNPKPKEQS >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_2|1563_bp atggggaaaaaacagaacagaaaaactggaaactctaaaaagcagagcacctgtcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggatattcaa gccaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaaggatcaacaaa attggtagaccgctagcaagactaataaagaagaaaagagagaagaatcaaatagacaca ataaaaaatgaaaaagggtatatcaccactgatcccacagaaatacaaactaccatcaaa gaatactacaaacacctctatgcaaataaactagaaaatctagaagaaatggataaattc ctcgacacatacactatcccaagactaaaccaggaagaagttgaatctctgaacagacca atcacaggctctgaaattgtggcaataatcaatagcttaccaaccaaaaaaagtccagga ccagatggattcacagccgaattctaccagaagtacaaggaggagctggtaccattcctt ctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttatgaggcc agcatcatcctgataccaaagcctggcaaagacacaaccaaaaaaaagaattttaggcca atatccttgatgaacattgatgcaaaaatcctcagtaaaatactggcaaaccgaatccag cagcacatcaaaaaacttatccacgatgatcaagtgggcttcatccccgggatgcaagcc tggttcaacatacacaaatcaataaatgtaatccagcatataaacagaaccaaagacaaa aaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttc atgctaaaaaaaactctcaataaattagcccaaaatctccttaacctgataagcaacttc agtaaagtctcaagatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaat aacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagaga ataaaatacctaggaatccaacttaaaagggatgtgaaggacctcttcaaggagaactac aaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattccatgctca tgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattc aatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactaaagtt catatggaaccaaaaaagagcccgcatcaccaagtcaatcctaagccaaaagaacaaagc tag >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_3|1815_aa MASTNELNKAPGSNPGETEICNLSDGEFKGHLLSPPEARGDSPPGQIRLLPHGYPSVSFG YLPPGCSCCSHSSPIRLTLDRGHKGKGMGSTQSKIVQNTPLGCLLRNLPTLQLDQDLKRK RLIFFCTVAWPQYTLDNQSRWPPEGTLNFNILNDLTNFCQKQGKWSEIKYVQGFWDLHSP PDLCAPCSLVQVLLAKTSPKTSPDPDKDDLSPLSDPIDNLSSPPLQTAAHALPPPYASPP PSAPPSGHAPPTSSRVPPSPITPPPPPPPSSPPPPPLPSPAPPKSPVAAHTQARPALLAP LREVAGAEGIVRVHVPFSLVDLSKIERHLGSFSTNPTLFTKEFHYLCQVYDLTWHDIHII LTSTLSPEERERILIAARQHANQLHLTDPNVPVGTQAVPSTDPEWDYQVGQAGRRRRDIM VQCLLAGMQVASNKSVNFDKLKEIVQYPDENPAVFLNRLTDALVHYTRLDPASPAGATIL ATYFISQPWRLPCNRHSLRAQARKVEVQSPGPHLAPASSAATRDTGPAGALANSNRPACL TTVSSAAIQVIGQSSAQTPSRQCARALTADKWGTGGQTAPASERPLCLHMATPPRMVKAP SSSSNWTMTEEAQTREPLSPLPSSGGPSQPSSISVMGIDGTPSTYRQTPSLPCRLDHSFF THSFLIIPSCPVPLLGRDLLTKLGASVVFRPGPSTHLALLLPLLSADTAAQASPTLPLVF PTPVDPKVWDTNTPVVATHHKPVLIKLKDPLKFPARPQFPISLEHRRGLKPIIIRLLQQH ILITTNSPCNTPILPVRKASGTYRLVQDLRLINEAVIPTVPVVPNPYTLLSRIPPNRSHF TVLDLKDAFFSIPLDPACYFLFAFTWEDPDTGVSKQLTWTVLLQGFRDSPHFFGQALAQD LARCPLEASIRLTPNSKGLTSDRISLLQNLQPPQDAEDILSFLGLVGFFRHWVPNFGVLA RPLYQATKQTPLGPLSEPKLVANLFNKLKNCLITAPVLSLPNPLRPFHLFTDEREKVATG LLAQLVGRTHQPVAYLSKQLEPTVQGWQPCLQALAAAAKLTKEALKLTLGHPLTVFSSHR LQDLLSHNLQDNPILDPHQTLSVDGSSISTPQGQRRAAYAVVTSSQVVEAKPLPTGITSQ KAELIALTRALILSKNKKVNIYTDSKYAYLIAHTYSILWQERGFLTTKGTPIVNGPLIEK LIQVLKAPTQVAIIHCKSHQNSKDPISLGNNFANTTARATALLAPSPTPVCFLSPAYTPD YSPEELVHLMGHSGVKTNSNERFHSHTPNKSNQGWIFVDDRVVLPCSQKKLILTDMHLDT FSGWIEAYPTTHETAEVVASTLIEHIIPRFGLPRTIQSENGPAFISKIVKQVTTTLGVNW KLHTPYHPQSSGKVERANGLVKQHLIKLALETRQSWGAAKRARRPQPSKARTTRPRQSGH NNPRRSGTSKRPPGKRPLPPVERPHTVILTTLTAAKLIGLPSWGCSLRVKTRNTIKDTIQ KGNKEGIGAYELWEAGVGVARGRKRENLQTAEKQPLELYVYGEWANTTLLMTIWLEASLH QPLYYLLSLLSLLDIVLCLTVIPKVLTIFWFDLRPISFPACFLQMYIMNCFLAMESCTFM VMAYDRYVAICHPLRYPSIITDHFVVKAAMFILTRNVLMTLPIPILSAQLRYCGRNVIEN CICANMSVSRLSCDDVTINHLYQFAGGWTLLGSDLILIFLSYTFILRAVLRLKAEGAVAK ALSTCGSHFMLILFFSTILLVFVLTHVAKKKVSPDVPVLLNVLHHVIPAALNPIIYGVRT QEIKQGMQRLLKKGC >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_3|5448_bp atggcctcaacaaatgaactaaataaggcaccagggagcaatcctggagaaacagagata tgtaatctttcagatggagaattcaagggacacctcctaagccccccagaggctcgaggg gactcccctcctggtcagatcaggcttcttccccacggctacccatccgtttcgttcggt tacttgccaccaggttgcagctgctgcagccactccagtccaattcggctgacgctggac cgtggccataaagggaaggggatggggtccacccagtccaaaatcgtgcaaaacaccccc ttagggtgcctcctgcgcaacctcccaactttacaactcgaccaagatttaaaacgaaag cgactaattttcttctgcacagttgcctggccgcaatataccttagacaaccaatctcgc tggccccccgaaggcacactcaacttcaatatcctaaacgaccttaccaatttttgtcag aagcaaggcaaatggtcagaaatcaaatatgttcaagggttctgggaccttcactctcca ccagacctctgcgccccgtgttcactagtgcaagtccttttagctaaaacctctcccaaa acctctcctgatccggacaaagatgatctttctcctctctcagaccccatagataactta tcttcccctcccctccaaactgcagcccatgctctgccgccaccatacgcttccccaccc ccctcggctccaccttcaggtcacgcaccgcccacttcctcacgggtccctccttcgcca atcacgccacctcctcctccacctccatcatctcctcctcctccaccactaccctcacct gcccctcccaagtcccctgtggccgcacacacccaggccaggccggcactccttgccccg ctgcgcgaagtggctggagcagaaggcattgttcgagttcatgtgcctttctccctcgtt gatttgtccaaaatagaaagacacctaggctctttctccaccaaccctactcttttcaca aaagaatttcactatctctgtcaggtttatgacctcacttggcacgatatccatattatt cttacctccaccctttcccctgaggaaagagaacgcattctaattgcagcacggcagcat gccaatcaattacatctaacagaccccaacgtacccgttggaacacaggcggttccatcc accgatccagagtgggattaccaagttgggcaagcagggcgtcgccgccgggacataatg gttcaatgcctcctggcagggatgcaggtggcctccaataaatcagtcaactttgacaaa ctaaaggaaattgtacaatacccggatgaaaatccggcagtcttcctaaatcggctgact gatgccttagtccactatacccgcctagatccggcctcccctgcaggggcaaccatcttg gcaacatattttatttcccagccctggcggctgccgtgcaaccggcattccctaagagcc caggcaagaaaggtagaggtacaatctcccgggccccatctggcgcctgcttcaagtgcg gcaactcgggacactgggccagccggtgccctagccaacagcaaccgtcctgcctgcctt acaactgtttcaagtgcggcaatccaggtcattgggcaaagcagtgcccaaaccccaagc cgccaatgcgcccgtgccctaactgccgacaaatggggcactggaggtcagactgccccg gcctcagagcggccgctgtgtctccacatggcgacccctccccggatggtgaaggcgcct tccagctcctccaactggacgatgactgaagaggcccagactcgggaacccctctcaccc ttgccaagctcaggtgggcccagccaaccctcctcaatctctgttatggggattgatggc actccctccacctaccgccagacaccttcactgccctgccgcctagaccactcgttcttc acacactctttcctcatcatcccttcatgtccagtccctctcctagggagagatctcctg accaagttaggagcctcagtagtcttccggcccggcccatccacccacctagctcttctc ctacccctcctctcagccgatacagcagcccaggcctctccaaccctgccattagtcttc cctacccccgttgaccccaaagtttgggacaccaacactcccgttgttgccacacaccat aagccagtccttattaagttaaaggacccactcaagttccctgcccgacctcagttcccc atttccttagagcaccgtcgaggactaaaacccattattattcggcttctacaacaacac atcctaattaccactaattctccttgcaatactcctattctgccggtacgaaaagcctct gggacttaccgtctagtacaagatcttcgcctcatcaacgaggcagtcatccctacagtc ccggtagttcctaatccatacacactcctctctcgcatcccccccaacaggtctcacttc actgttctggaccttaaagatgcctttttctctatcccactagaccccgcttgttacttc ctctttgctttcacatgggaggacccagacactggcgtctctaaacaactcacctggacg gttctgctacaggggtttagagacagccctcatttctttggacaggcactggcccaagac ctcgctcgctgccctttggaggccagtatacgtctcaccccaaactcaaaaggcctaacc tcagacagaatcagccttttacaaaatcttcagcctccacaagatgctgaagatattctc tccttcttggggctagtcggctttttcagacattgggtccccaatttcggagtcctagca aggcccctgtatcaggccaccaaacagaccccccttggcccactgtctgagccaaaacta gtagccaatcttttcaataagcttaaaaattgcctcataacagccccagtcctctcactc ccaaacccgctacgcccattccatctctttaccgatgagcgggaaaaggttgctacaggg ctcctggcccaactggtaggaagaacacaccagccagtagcctacctttccaagcagctc gagcccactgtccagggctggcagccgtgcctgcaggctttggctgcggcagcgaaactc accaaagaggctcttaagcttaccttagggcaccctctcactgttttctcctcacatagg ctacaagacttactctcacacaacctccaagacaaccccatcctagatccccatcagacc ctgtctgtggatggtagctccatctccacgccgcagggccaacgacgagcggcatacgcc gtggtcacctcatctcaagtagttgaggccaaaccacttcccactggtataacctcccag aaggcagaactcatagctcttactagagctctaattctatctaaaaacaaaaaagtaaac atatacactgattccaagtatgcctacttaatagcgcatacctactccattctctggcag gaacgggggttccttaccactaaaggtactcctattgtcaatggacccctcattgagaaa ctcatccaggtgctaaaagcccccacacaggtagccatcatccattgcaaaagccaccaa aattctaaagaccctatatcattaggcaacaatttcgccaacaccactgcccgggccacg gccctcttggccccttcccccactcctgtgtgcttcctctctcctgcttacactcccgat tactcccctgaagagctagtccacctcatgggtcactcaggagtaaaaaccaactccaat gagcgctttcatagccacacccctaacaagagtaatcagggctggatattcgtggatgat agagtagttctcccgtgcagtcagaagaaactcatcctcaccgacatgcacctagatact ttctcaggttggattgaagcctatcccaccactcatgaaacagcagaggtggtagcttca accctcattgaacacataatcccgagatttggcctccctcggacaatccaatctgaaaat gggcctgcttttatctccaagatagtcaaacaggtgacaaccacacttggcgttaactgg aagctacacactccataccatccacagtcttctggaaaagtggaacgcgccaacggcctt gtcaaacaacacctaatcaaactggctctggagacacgccaatcgtggggagctgctaag agagcacgccgaccgcagccttccaaagcccggaccactcggcccagacagtctggccat aataaccccaggagatcaggtactagtaaaagacctccaggcaagaggcctctccccccg gtggaaaggccccatacggtaattcttacaacactgacggcagctaaacttataggcctt ccttcctggggttgttccttaagagtcaaaaccaggaatacaataaaagacacaatacaa aaggggaataaggagggaattggagcctatgagctgtgggaagcaggagtcggagttgca agaggaaggaagagggagaatctccagactgctgaaaaacagcccttagaattgtatgtg tatggagagtgggccaacaccaccctcctgatgaccatctggctggaggcctctctgcac cagcccctgtactacctgctcagcctcctctccctgctggacatcgtgctctgcctcact gtcatccccaaggtcctgaccatcttctggtttgacctcaggcccatcagcttccctgcc tgcttcctccagatgtacatcatgaattgtttcctagccatggagtcttgcacattcatg gtcatggcctatgatcgttatgtagccatctgccacccactgagatatccatcaatcatc actgatcactttgtagtcaaggctgccatgtttattttgaccagaaatgtgcttatgact ctgcccatccccatcctttcagcacaactccgttattgtggaagaaatgtcattgagaac tgcatctgtgccaatatgtctgtttccagactctcctgcgatgatgtcaccatcaatcac ctttaccaatttgctggaggctggactctgctaggatctgacctcatccttatcttcctc tcctacaccttcattctgcgagctgtgctgagactcaaggcagagggtgccgtggcaaag gccctaagcacatgtggctcccacttcatgctcatcctcttcttcagcaccatccttctg gtttttgtcctcacacatgtggctaagaagaaagtctcccctgatgtgccagtcttgctc aatgttctccaccatgtcattcctgcagcccttaaccccatcatttacggggtgagaacc caagaaattaagcagggaatgcagaggttgttgaagaaagggtgctaa >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_4|272_aa MGANATLLITIYLEASLHQPLYYLLSLLSLLDIVLCLTVIPKVLAIFWFDLRSISFPACF LQVFIMNSFLTMESCTFMIMAYDRYVAICKPLQYSSIITDQFVARAAIFVVARNGLLTMP IPILSSRLRYCAGHIIKNCICTNVSVSKLSCDDITLNQSYQFVIGWTLLGSDLILIVLSY FFILKTVLRIKGEGDMAKALGTCGSHFILILFFTTVLLVLVITNLARKRIPPDVPILLNI LHHLIPPALNPIVYGVRTKEIKQGIQNLLRRL >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_4|819_bp atgggggccaatgccacccttctgatcaccatctatctggaagcctctctgcaccagccc ctgtactacctgctcagcctcctctccctgctggacatcgtactctgcctcaccgtcatc cccaaggtcctggccatcttctggtttgacctcagatcaatcagcttccctgcctgcttc cttcaggtgttcatcatgaacagttttctgactatggagtcctgcacattcatgatcatg gcctatgaccgctatgtggccatctgcaagcccctacagtactcatccatcatcactgat caatttgtcgctagggctgccatctttgttgtggccaggaatggccttcttactatgcct atccccatactttcttctcgactcagatactgtgcaggacacatcatcaagaactgcatc tgtactaacgtgtctgtgtctaaactctcttgtgatgacatcaccttgaatcagagctac cagtttgttataggttggaccctgctgggctctgacctcatccttattgttctctcttac ttttttatcttgaaaactgtgctaaggattaagggtgagggagatatggccaaagctcta ggtacttgtggttcccacttcatcctcatcctcttcttcaccacagtcctgctggttctg gtcatcactaacctggccaggaagagaattcctccggatgtccccatcctgctcaacatc ctgcaccaccttattcccccagctctgaaccccattgtttatggtgtgagaaccaaggag atcaagcagggaatccagaacctgctgaggaggttgtaa >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_5|518_aa MAGGLAGMGGIQNEKETMQSLNDCLASYPDRTFEDQRAQIFANTVDNARIILQIDNARLA ADDFRIKYETELAMCQSVKSEIHGLHKVIDDTNVTRLQLETEIEALKEELLFMKNHEKEV KGLQAQIASSGLTMETDAPKSQDHAKIMADIRNNTTSWLGRTERSWTSTGLSRLRRAPQW SHAVRQDGAAQWDPAAPGVGAGRDLSRQAAPGPRVGSPAEHQGIPGLEESQHWIALPLGI LYLLALVGNVTILFIIWMDPSLHQSMYLFLSMLAAIDLVLASSTAPKALAVLLVHAHEIG YIVCLIQMFFIHAFSSMESGVLVAMALDRYVAICHPLHHSTILHPGVIGRIGMVVLVRGL LLLIPFPILLGTLIFCQATIIGHAYCEHMAVVKLACSETTVNRAYGLTMALLVIGLDVLA IGVSYAHILQAVLKVPGSEARLKAFSTCGSHICVILVFYVPGIFSFLTHRFGHHVPHHVH VLLATRYLLMPPALNPLVYGVKTQQIRQRVLRVFTQKD >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_5|1557_bp atggccgggggtctggcaggaatgggaggcatccagaatgagaaggagaccatgcaaagc ctgaatgactgcctggcctcctacccggacagaacctttgaggaccagagggctcagatc ttcgcaaatactgtggacaatgcccgcatcattctgcaaatcgacaatgcccgtcttgct gctgatgactttagaatcaagtatgagacagaactggccatgtgccagtctgtgaagagc gaaatccatgggctccacaaggtcattgatgacaccaatgtcactcggctgcagctggag acagagatcgaggctctcaaggaggagctgctcttcatgaagaaccatgaaaaggaagta aaaggcctacaagctcagattgccagctctgggttgaccatggagacagatgcccccaaa tctcaggaccatgccaagatcatggcagacatcaggaacaatacgacgagctggctggga agaacagagagaagctggacaagtactggtctcagcagattgaggagagcaccacagtgg tcacatgcagtccgccaagatggagcagctcaatgggatcctgctgcacctggagtcgga gctggcagagacctaagcagacaggcagcaccaggcccacgagtaggaagccctgctgaa catcaagggattccaggtttagaggaaagccagcactggattgcactgcccctgggcatc ctttacctccttgctttagtgggcaatgttaccattctcttcatcatctggatggaccca tccttgcaccaatctatgtacctcttcctgtccatgctagctgccatcgacctggttctg gcctcctccactgcacccaaagcccttgcagtgctcctggttcatgcccacgagattggg tacatcgtctgcctgatccagatgttcttcatccatgcattctcctccatggagtcaggg gtacttgtggccatggctctggatcgctatgtagccatttgtcaccccttgcaccattcc acaatcctgcatccaggggtcatagggcgcatcggaatggtggtgctggtgaggggatta ctactccttatccccttccccattttgttgggaacacttatcttctgccaagccaccatc ataggccatgcctattgtgaacatatggctgttgtgaaacttgcctgctcagaaaccaca gtcaatcgagcttatgggctgactatggccttgcttgtgattgggctggatgttctggcc attggtgtttcctatgcccacatcctccaggcagtgctgaaggtaccagggagtgaggcc cgacttaaggcgtttagcacatgtggctctcatatttgtgtcatcctggtcttctatgtc cctggaattttctccttcctcactcaccgctttggtcatcatgtaccccatcatgtccat gttcttctggccacacggtatctcctcatgccacctgcgctcaatcctcttgtctatgga gtgaagactcagcagatccgccagcgagtgctcagagtgtttacacaaaaggattga >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_6|358_aa MEESRGIQGKALNPKLFQGGHLDLTQAMPSSHTSQVLDSLLDLLGSHTINNGVQSWGNEV VQDVEQDGDIWRNSLPGQVSDDQNQQDCAEEEDEDEVGTTRAQGLGHSTLGLDPKHNFQY KRIGDNNKDKIRAQQSPACHKLVELIESDVITREFGHRQVTADAVLDYVSCTVSEPGRKH GNRKRNKGIPGYNKDHGPSHKLVSDDRRVSQWMADGHITVIGHDHERAGLHGQKTVHDEH LEEAGWEADRPEVKPEDGQDLGDDGEAEHDVQQGEEAEQVVQGLVQRGLQLDGDQEGGVS SHGQEEEKAEGQRQPVLPALEVGEADEEEFRDWGSGVIAGRCHVKFPDQSGDPVYLKT >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_6|1077_bp atggaagagagtcggggtattcagggcaaggcactcaaccccaagttgttccagggtgga catctggacttaacccaagccatgccttcatcgcacacaagccaggttttggattccctg cttgatctccttggttctcacaccataaacaatggggttcagagctgggggaatgaggtg gtgcaggatgttgagcaggatggggacatctggaggaattctcttcctggccaggttagt gatgaccagaaccagcaggactgtgctgaagaagaggatgaggatgaagtgggaaccaca cgtgctcaaggccttggccacagcaccctcggccttgatcctaagcacaactttcaatat aaaagaataggagataacaataaggataagatcagagcccaacagagtccagcctgccac aaactggtagagctgattgaaagtgatgtcatcacaagagagtttggacacagacaggtt actgcagatgcagttcttgattatgtttcctgcacagtatctgagcctggcagaaagcat gggaacaggaagagaaacaaaggcattccgggctataacaaagaccacggccctagccac aaactggtcagtgatgatagacgggtatctcaatggatggcagatggccacataacggtc ataggccatgaccatgaacgtgcaggactccatggtcaaaaaactgttcatgatgaacat ctggaggaagcaggctgggaagctgatcgacctgaggtcaaaccagaagatggccaggac cttggggatgacggtgaggcagagcacgatgtccagcagggagaggaggctgagcaggta gtacaggggctggtgcagagaggcctccagctggatggtgatcaggagggtggtgttagc tcccatggccaggaggaagagaaggctgaggggcagagacaaccagtgctgccagctctg gaagttggggaagcagatgaggaggaattcagagactggggcagtggagtcattgctggg agatgccatgtaaagtttcctgatcaatctggagacccagtgtaccttaaaacgtag >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_7|309_aa MPAAATGRQIQVPGWAPALYVAAAGLGTLQAASPAGTRECGGAQKLGDTRNHRAPKRESQ PRLRELPDLGSRKGCSSSFHLQCGEQGACFSPISVTATLLALPFSRFWECGEAMQQEQAP LSLQGQRVAFLSPQERRKARVYSPNLGGCICAQGCSSSIPGSKWLRYTSGHCFRECNMGL EPRTVPTGTLPSGAVRTGPPSSRFWYSRAASSLHPVPGKAIGTQCQPVRTVGAEPSKATG AELSKALGAHLLHQCALDVEHGVKGDYSGALRLNDFLAGFQTSMGPVALFLWPISYCLYP RCMMEVTCF >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_7|930_bp atgccagctgctgccacgggcagacagattcaggtgccaggatgggcaccagctctctat gtggctgcagctggactaggcacactgcaagcagcttccccggctggcaccagggaatgt ggtggtgcccagaagcttggagacaccaggaaccatagggccccaaagagggagtcacag ccccggctcagggagctcccagatctgggctcccgaaagggctgtagctcctcttttcac ctgcaatgtggtgagcaaggagcatgtttcagccctatttctgttacagcaactctttta gccttgccattcagcaggttctgggagtgtggagaggccatgcagcaggagcaggcacct ctgagcctgcaagggcagagggtggccttcctgagcccccaagagcgcagaaaggcccgg gtctacagccccaacttgggtggctgcatctgtgcccagggctgctctagctccatccct ggttcaaagtggctcaggtatacctcaggccactgcttcagagagtgcaatatggggttg gagcctcgcacagttcccactgggacactgcctagtggagctgtaagaacagggccacca tcctccagattctggtattctagggctgccagcagcttgcatcctgtgcctggaaaagcc ataggaactcaatgccagcctgtgagaacagtgggggctgagccctccaaagccacaggg gcagagctgtccaaggccttgggagcccacctcttgcatcagtgtgccctggatgtggaa catggagtcaaaggagattattctggagctttaaggcttaatgacttccttgctgggttt cagacttccatggggcctgtagcacttttcctttggccaatttcctactgcctgtacccc cgttgtatgatggaagtaacttgtttttga >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_8|273_aa MGANTTLLITIQLEASLHQPLYYLLSLLSLLDIVLCLTVIPKVLAIFWYDLRSISFPACF LQMFIMNSFLPMESCTFMVMAYDRYVAICHPLRYPSIITNQFVAKASVFIVVRNALLTAP IPILTSLLHYCGENVIENCICANLSVSRLSCDNFTLNRIYQFVAGWTLLGSDLFLIFLSY TFILRAVLRFKAEGAAVKALSTCGSHFILILFFSTILLVVVLTNVARKKVPMDILILLNV LHHLIPPALNPIVYGVRTKEIKQGIQKLLQRGR >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_8|822_bp atgggagctaacaccaccctcctgatcaccatccagctggaggcctctctgcaccagccc ctgtactacctgctcagcctcctctccctgctggacatcgtgctctgcctcaccgtcatc cccaaggtcctggccatcttctggtatgatcttaggtcgatcagcttccctgcctgcttc ctccagatgttcatcatgaacagtttcctccccatggagtcctgcacgtttatggtcatg gcctatgaccgttatgtggccatctgccacccactgcggtacccatccatcatcactaat caatttgtggccaaagctagtgtcttcattgtggtgcggaatgcgcttcttactgcaccc attcctatcctcacttccctgctccattactgtggggaaaatgtcattgagaactgcatc tgtgccaacttgtctgtgtccaggctctcctgtgataatttcacccttaacagaatctac caatttgtggctggttggaccttgctgggctcagatttattcctcatcttcctctcttac accttcattctaagagctgtgcttagattcaaagcagagggggcggcagtgaaggccctg agcacatgtggctcccacttcatcctcattcttttcttcagcaccatactgctggttgtg gtgttgacaaacgtggccagaaagaaggtccccatggacatcctgatcctgctgaacgtc cttcatcaccttattcctcctgcattgaaccctattgtgtatggggttcggaccaaagag ataaaacagggaattcagaagttactgcagagagggaggtga >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_9|83_aa MAAAIAKTQLGKHGLGSRDLEEEELCPWGNPSYCAPTVSPLATDLEASPSRGQPKQPPSS PGLEADSGTISASRALLTQCVWI >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_9|252_bp atggcggccgcaatagccaagacacagttgggaaaacatggcttgggaagcagagacttg gaggaggaggaactgtgtccctggggaaacccgtcttattgtgctccaacagtctcccct ctggcaacagatctagaggccagcccaagcagaggccagcccaagcagccgcctagttct cctggactagaggctgactctggtaccatctctgccagcagggcactgctgacccaatgt gtatggatttaa >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_10|483_aa MMTGIPGLEESQHWIALPLGILYLLALVGNVTILFIIWMDPSLHQPMYLFLSMLAAIDLV VASSTAPKALAVLLVRAQEIGYTVCLIQMFFTHAFSSMESGVLVAMALDRYVAICHPLHH STILHPGVIGHIGMVVLVRGLLLLIPFLILLRKLIFCQATIIGHAYCEHMAVVKLACSET TVNRAYGLTVALLVVGLDVLAIGVSYAHILQAVLKVPGNEARLKAFSTCGSHVCVILVFY IPGMFSFLTHRFGHHVPHHVHVLLAILYRLVPPALNPLVYGVKTQKIHHLLSLLDIVLCL TVIPKVLAIFWFDLRSIGFPACFLQMFIMNSFLPMESCTFMVKDYDHYVAICHPLQYLSI ITHQFVAKASVFIVVQNALLLSPVPILSAQLHYCRKNVIENCICANLSVSRLSCDNFTLN RLYQFVAGWTFLGSDFILIFLSYTFILRAVLRFKVEGVAVKALSTCGSHFILILFFSTCW LWC >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_10|1452_bp atgatgacaggaattccgggtttagaggaaagccagcactggatcgcactgcccctgggc atcctttacctccttgctctagtgggcaatgttaccattctcttcatcatctggatggac ccatccttgcaccaacctatgtacctcttcctgtccatgctagctgccatcgacctggtt gtggcctcctccactgcacccaaagcccttgcagtgctcctggttcgtgcccaagagatt ggttacactgtctgcctgatccagatgttcttcacccatgcattctcctccatggagtca ggggtacttgtggccatggctctggatcgctatgtagccatttgtcaccccttgcaccat tccacgatcctgcatccaggggtcatagggcacatcggaatggtggtgctggtgcgggga ttactactcctcatccccttcctcattctgttgcgaaaacttatcttctgccaagccacc atcataggccatgcctattgtgaacatatggctgttgtgaaacttgcctgctcagaaacc acagtcaatcgagcttatgggctgactgtggccttgcttgtggttgggctggatgtcctg gccattggtgtttcctatgcccacattctccaggcagtgctgaaggtaccaggaaatgag gcccgacttaaggcgtttagcacatgtggctctcatgtttgtgtcatcctggtcttctat atcccgggaatgttctccttcctcactcaccgctttggtcatcatgtaccccatcacgtc catgttcttctggccatactgtatcgccttgtgccacctgcactcaatcctcttgtctat ggggtgaagacccagaagatccaccacctcctctccctgctggacatcgtgctctgcctc accgtcatccccaaggtcctggccatcttctggtttgatcttaggtcgatcggcttccct gcctgcttccttcagatgttcatcatgaacagtttcctccccatggagtcctgcacattc atggtcaaggactatgatcattatgtggccatctgccacccactgcagtacctgtccatc atcactcatcaatttgtggccaaagctagtgtcttcattgtggtgcagaatgctttgctg ctttcacctgttcctattctctctgcccagctccattactgtaggaaaaatgtgattgag aactgcatctgtgccaacctgtctgtgtccaggctctcctgtgataatttcacccttaac agactctaccaatttgtggctggttggaccttcctgggctcggatttcatcctcatcttc ctctcctacaccttcattctaagagctgtgcttagattcaaggtggagggggtggcagtg aaggccctgagcacatgtggctcccactttatcctcatcctcttcttcagcacctgctgg ttgtggtgttga >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_11|268_aa MATERSRAMATSTPDSMAENAWMENIWERQECELMAVAPNQKMASILGSVERERTKSTKE SMARKRNMGWRRLHSVRMRKSRTPFPVNDRKNIKQKGSEIQMWVAVNPGMPARKKGASEA IGQCQSSAAKPRRSGKESVREPWARVPGALGVAARAVGKYCPWFPEKGTLDVDLLKQGRV RVTMENMVTVFRAVEKYCPWFPEKGTVYVKVWDRVGSTFWELVSTGNYVPITVWGDWALD PENVLHYILVLQEKFDPVSSAPLPPTEK >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_11|807_bp atggccacagagcggtccagggccatggccactagcacccctgactccatggcagagaat gcatggatggagaacatctgggaaagacaggaatgcgagctgatggctgtagcaccaaac cagaaaatggccagcatcttaggcagtgtggagagagagaggaccaagtcgacaaaggag agcatggcaagaaaaagaaacatgggctggcgaaggctgcattctgtccggatgagaaaa agcaggacaccattcccagtcaatgacaggaaaaacataaagcaaaagggaagtgagatc caaatgtgggtggcagtcaatcctgggatgccagccaggaagaaaggggcttccgaggca atcgggcagtgtcagtcttcagccgctaagccaagaagatctgggaaggagtcagtcaga gagccttgggcgagagttccaggggctctgggagtggctgccagggcagtgggaaaatac tgtccttggtttcctgaaaaaggaaccttagatgtagacctgttaaaacagggaagggtt cgagtaaccatggaaaatatggtcactgtattcagggccgtggaaaaatactgtccttgg tttcctgaaaaaggaaccgtatatgtaaaagtatgggatcgtgttggttcaacattctgg gaactggtctcaacagggaattatgttcccatcactgtttggggtgattgggccttggac ccagaaaatgtccttcattatatccttgtgcttcaagaaaagtttgacccagtaagctca gctcctctccctccaacagagaagtag >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_12|121_aa MRLGTVMEKNWAFLLTNTGCIPWLVVTAQECRAEICGRHSSMGGSATLRALRQTEITEIT DTKFRIWMARKLIEIKEKVETQSKESSKMIQELKDEIAVLRKNQTELLELKNSLKEFYDT V >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_12|366_bp atgcggttgggcaccgtcatggagaagaattgggccttcctgctgaccaacacgggctgt atcccatggttggtggttacagctcaagaatgcagagctgagatctgtggccggcactca agcatgggagggagtgccactctcagggcactgagacagactgaaataactgaaataaca gacacaaaattcagaatctggatggcaaggaagcttattgagatcaaagagaaagttgaa acacaatccaaggaatccagtaaaatgatccaagagctgaaagatgaaatagctgtttta agaaagaaccaaactgaacttctagagctgaaaaattcactaaaagaattttatgatacg gtttga >gi568815587r:5902054_6103148|GENSCAN_predicted_peptide_13|41_aa XLEVSGGWTTVNAAGPLGAASQWGCHIPGQRWEMSARDLVM >gi568815587r:5902054_6103148|GENSCAN_predicted_CDS_13|126_bp naattagaagtttctgggggctggactactgtgaatgctgctggtcctcttggtgcagcc agccagtggggctgccacattccaggacagcgctgggaaatgtctgcaagagatctagtt atgtga