GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:32:56 Sequence gi568815579f:57961579_58168458 : 206880 bp : 48.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 1316 1311 6 1.05 1.08 Term - 18701 16723 1979 0 2 115 38 797 0.023 62.77 1.07 Intr - 26724 26629 96 1 0 90 94 72 0.932 7.98 1.06 Intr - 27136 27017 120 2 0 -38 82 232 0.699 10.47 1.05 Intr - 27453 27351 103 0 1 63 70 158 0.785 11.25 1.04 Intr - 38318 38132 187 1 1 108 45 40 0.387 1.39 1.03 Intr - 39765 39711 55 1 1 68 84 53 0.419 0.94 1.02 Intr - 40679 40546 134 1 2 76 40 86 0.573 2.89 1.01 Init - 41503 41202 302 1 2 29 78 256 0.681 13.43 1.00 Prom - 49043 49004 40 -8.26 2.06 PlyA - 49464 49459 6 1.05 2.05 Term - 51226 51025 202 0 1 45 42 270 0.995 14.96 2.04 Intr - 51685 51477 209 1 2 51 61 117 0.465 3.18 2.03 Intr - 51848 51694 155 0 2 74 -12 153 0.222 3.69 2.02 Intr - 52026 51993 34 0 1 77 64 30 0.262 -2.70 2.01 Init - 54587 54510 78 2 0 75 68 55 0.349 3.34 2.00 Prom - 54980 54941 40 -5.46 3.07 PlyA - 55012 55007 6 1.05 3.06 Term - 55774 55649 126 1 0 74 37 176 0.967 9.38 3.05 Intr - 58034 57918 117 2 0 22 85 142 0.765 7.96 3.04 Intr - 58703 58395 309 2 0 109 -35 311 0.695 17.41 3.03 Intr - 62408 62378 31 1 1 71 81 9 0.236 -3.37 3.02 Intr - 64388 64272 117 1 0 97 72 111 0.982 9.98 3.01 Init - 67384 67176 209 1 2 68 121 173 0.955 14.92 3.00 Prom - 69568 69529 40 -7.46 4.00 Prom + 72120 72159 40 -7.66 4.01 Init + 72463 72583 121 0 1 71 101 192 0.988 17.15 4.02 Intr + 73391 73475 85 0 1 34 66 122 0.984 3.48 4.03 Intr + 73862 73924 63 2 0 113 84 9 0.616 0.83 4.04 Intr + 76150 76628 479 1 2 137 100 492 0.964 47.99 4.05 Intr + 78872 78966 95 0 2 103 99 36 0.680 5.88 4.06 Intr + 83434 83982 549 0 0 -43 -3 689 0.187 39.77 4.07 Intr + 84029 84151 123 1 0 62 -14 216 0.676 9.58 4.08 Intr + 84254 84414 161 1 2 23 5 243 0.550 8.29 4.09 Intr + 84502 85143 642 1 0 2 11 1163 0.370 91.14 4.10 Intr + 85524 85728 205 0 1 77 55 104 0.786 5.20 4.11 Intr + 88269 88305 37 2 1 85 105 -10 0.439 -1.76 4.12 Intr + 89587 89622 36 2 0 90 98 11 0.388 0.53 4.13 Intr + 89920 89996 77 2 2 127 91 -13 0.696 2.13 4.14 Intr + 90912 91050 139 2 1 74 92 43 0.795 3.34 4.15 Intr + 91851 92469 619 1 1 118 -51 596 0.067 40.08 4.16 Intr + 92597 92704 108 2 0 33 65 128 0.070 4.40 4.17 Intr + 93863 93906 44 2 2 82 56 16 0.035 -4.32 4.18 Intr + 94792 94957 166 2 1 106 64 61 0.297 4.52 4.19 Intr + 98391 98457 67 0 1 129 97 34 0.844 7.31 4.20 Intr + 100002 100128 127 2 1 59 82 157 0.787 12.45 4.21 Intr + 101868 101963 96 1 0 65 94 47 0.886 2.98 4.22 Term + 105163 106883 1721 2 2 82 40 1755 0.965 158.33 4.23 PlyA + 108979 108984 6 1.05 5.08 PlyA - 114445 114440 6 1.05 5.07 Term - 122956 122818 139 0 1 -12 39 178 0.344 0.34 5.06 Intr - 123801 123111 691 1 1 53 83 717 0.651 58.28 5.05 Intr - 125462 125328 135 0 0 36 64 141 0.362 7.04 5.04 Intr - 125826 125716 111 1 0 127 68 97 0.326 11.95 5.03 Intr - 127259 127110 150 0 0 75 94 138 0.999 13.23 5.02 Intr - 128867 128287 581 1 2 86 80 540 0.238 45.34 5.01 Init - 131157 131138 20 0 2 80 107 25 0.222 1.50 5.00 Prom - 132507 132468 40 -5.86 6.00 Prom + 132888 132927 40 -3.86 6.01 Init + 136649 136757 109 1 1 95 61 123 0.159 10.73 6.02 Term + 142074 142090 17 1 2 104 53 -3 0.201 -3.80 6.03 PlyA + 143125 143130 6 1.05 7.07 PlyA - 145655 145650 6 1.05 7.06 Term - 146116 146033 84 1 0 97 48 1 0.418 -5.35 7.05 Intr - 146400 146286 115 2 1 43 83 208 0.721 16.25 7.04 Intr - 146787 146695 93 2 0 105 56 73 0.945 4.88 7.03 Intr - 147192 146918 275 0 2 7 -6 215 0.894 0.34 7.02 Intr - 157195 156770 426 1 0 -28 81 372 0.889 18.89 7.01 Init - 163050 163045 6 0 0 72 93 0 0.244 -0.25 7.00 Prom - 163143 163104 40 -4.66 8.05 PlyA - 164694 164689 6 1.05 8.04 Term - 167933 166300 1634 0 2 116 39 1292 0.048 117.06 8.03 Intr - 189277 189174 104 0 2 62 103 48 0.011 3.52 8.02 Intr - 193013 192850 164 2 2 3 72 103 0.018 -1.03 8.01 Init - 195993 195862 132 0 0 88 91 129 0.737 11.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 18342 16723 1620 0 0 64 38 847 0.887 72.64 S.002 Term + 91851 92473 623 1 2 118 42 605 0.859 53.48 S.003 Sngl - 167913 166300 1614 0 0 65 39 1290 0.810 117.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:57961579_58168458|GENSCAN_predicted_peptide_1|991_aa MGKPRPAGPRLTQAFLQWEATVEEGGLGPGQQSPSCRRLRRRCQKGPNPAQAGRGGAAQE PVMAAPLPDAGSARLELQLPADPAARSGRRRGSSRHCPRLSRRFSPCQFTQSTVKDVRPG LETAPGSLKHAVTAAVTVKLQIMVQSAWSISPRDGSHQPVGLLGSVSSVSCLARGGKPGG GEEGHWAPSSPGSGEMGLSCSQRMMSRLSLVIPVDAALQFTTVGVWKRGVAKVGDIASVL EFGKSLRRDLLSFAVVIVDPVTFKDVAVDFTQEEWGQLDLVQRTLYRDVMLETYGHLLSV GNQIAKPEVISLLEQGEEPWSVEQACPQRTCPEWVRNLESKALIPAQSIFEEEQSHGMKL ERYIWDDPWFSRLEVLGCKDQLEMYHMNQSTAMRQMVFMQKQVLSQRSSEFCGLGAEFSQ NLNFVPSQRVSQIEHFYKPDTHAQSWRCDSAIMYADKVTCENNDYDKTVYQSIQPIYPAR IQTGDNLFKCTDAVKSFNHIIHFGDHKGIHTGEKLYEYKECHQIFNQSPSFNEHPRLHVG ENQYNYKEYENIFYFSSFMEHQKIGTVEKAYKYNEWEKVFGYDSFLTQHTSTYTAEKPYD YNECGTSFIWSSYLIQHKKTHTGEKPYECDKCGKVFRNRSALTKHERTHTGIKPYECNKC GKAFSWNSHLIVHKRIHTGEKPYVCNECGKSFNWNSHLIGHQRTHTGEKPFECTECGKSF SWSSHLIAHMRMHTGEKPFKCDECEKAFRDYSALSKHERTHSGAKPYKCTECGKSFSWSS HLIAHQRTHTGEKPYNCQECGKAFRERSALTKHEIIHSGIKPYECNKCGKSCSQMAHLVR HQRTHTGEKPYECNKCGKSFSQSCHLVAHRRIHTGEKPYKCNQCERSFNCSSHLIAHRRT HTGEKPYRCNECGKAFNESSSLIVHLRNHTGEKPYKCNHCEKAFCKNSSLIIHQRMHSGE KRFICSECGKAFSGHSALLQHQRNHSEEKLN >gi568815579f:57961579_58168458|GENSCAN_predicted_CDS_1|2976_bp atgggcaaaccgaggcccgcggggccgcgacttacccaggcctttttgcagtgggaggcg acggtggaggagggtggtctgggaccgggacaacaaagtccatcctgccgccggcttcga cgcaggtgccagaaaggtcccaaccccgcccaggcagggcgaggcggcgcggcccaggaa cctgtcatggcggcgccgctgcccgacgccggaagtgcccgcctggaactacagctccca gcagaccccgcggcgcgctccggtcgacgccggggaagcagccgccattgtccgcggctg agccgtcgtttctccccctgccagttcacacaaagcactgtgaaggacgtcagacctggg ttggaaacagctccgggttcccttaaacatgcagtgacggcggcggtcactgtcaagttg caaataatggtgcaaagtgcctggtcaatcagtcctcgggatggcagccatcaacccgtg ggcctcctgggctctgtgtcctcagtatcctgcctggcacgtggagggaagcctggagga ggggaggagggccactgggctcccagcagcccaggttcaggtgagatggggctgtcctgt tcccagaggatgatgtccaggttatccctggtcatcccagttgatgcggctctccagttt acaacagttggagtttggaagaggggtgttgccaaggtgggggacatagccagtgtgctg gaatttggcaagagtcttcgaagagacctgctcagctttgctgtggttattgtggaccca gtgaccttcaaggacgtggccgtggacttcacccaagaagagtgggggcagctggacctt gttcagaggaccctgtaccgtgatgtgatgctggagacctatggtcacctgctctctgtg ggaaatcagattgccaagcctgaggtcatctccctgttggagcaaggagaagagccgtgg tcagtggagcaggcatgtcctcaacgcacttgtccagaatgggtgagaaatcttgaaagc aaagcattgatcccagcacagagcatttttgaggaagaacaatcccatggcatgaagttg gaaagatatatatgggatgatccttggttctccaggttagaagttttgggatgtaaagac caattagaaatgtaccacatgaaccagagtacagctatgaggcagatggtcttcatgcaa aagcaagtactatcccagagaagctctgaattctgtggacttggggcagagtttagccag aacttaaactttgttccatctcagagagtttctcagatagaacatttctataagcctgat acacatgctcaaagttggagatgtgactcagccataatgtatgcagataaggttacctgt gaaaataatgattatgacaaaactgtttatcagtccattcaacctatttaccctgcaaga atacaaactggagataatcttttcaaatgtactgatgctgttaaatctttcaatcatata atacattttggtgatcataaaggaattcacacaggagaaaaactctatgaatataaggaa tgccatcaaatctttaaccagagcccatcatttaatgaacacccaaggcttcatgttgga gaaaaccagtataattacaaagaatatgagaatatcttttatttctcatcctttatggaa catcaaaaaattggtactgtagagaaagcgtataaatacaatgaatgggagaaagtcttt gggtatgactctttccttactcaacatacaagcacttacactgcagagaaaccctatgac tacaatgaatgtgggacgtctttcatctggagctcttaccttattcaacataagaaaact catactggagaaaaaccctatgaatgtgataaatgtggaaaagtttttaggaatcgctca gcccttacgaaacatgaacggactcacactggaataaaaccctatgaatgtaataaatgt ggaaaagccttcagctggaattctcatcttattgtacataagagaattcatacaggagaa aaaccttatgtttgtaatgagtgtgggaaatctttcaactggaactctcatcttattgga catcagaggactcatacaggagagaaaccttttgaatgtactgaatgtgggaaatcattc agctggagctcccatcttattgcccatatgagaatgcatactggagagaaaccctttaaa tgtgatgaatgtgaaaaagcttttagggactactcagcccttagtaaacatgaaagaact cattctggagcaaaaccatataaatgtactgaatgtggaaaatccttcagctggagctcc catcttattgcccatcagagaactcacacgggagagaaaccatataactgtcaggaatgt ggcaaagcattcagagaacgctcagccctcactaaacatgagataattcattctggaatt aagccctatgaatgtaataaatgtggaaaatcctgtagccagatggctcaccttgttaga catcaaaggactcatactggagaaaaaccctatgaatgcaataaatgtggaaaatccttc agtcagagctgtcaccttgttgctcatcggagaattcacactggtgagaaaccctataaa tgtaatcagtgtgaaagatcctttaactgtagttctcacctcattgcacaccggagaact catactggagagaaaccatacaggtgtaatgaatgtgggaaagcatttaatgagagttca tcccttattgtacacctaagaaaccatactggagaaaagccctacaaatgtaatcattgt gaaaaagcattttgtaagaattcttcccttattattcatcagagaatgcatagtggagag aaacgctttatatgcagtgaatgtggaaaagcctttagtggtcactcagccctacttcaa caccagagaaatcacagtgaagagaaactgaattga >gi568815579f:57961579_58168458|GENSCAN_predicted_peptide_2|225_aa MGDLRSLLISGCNEPCLIVVSFGDIQSLVSTLREIPTGTKRCLLCPEEQYSSRTRDGCLP RTETFLAFDDPLGLMLALVALMLASLAVLTVLKHRDTPVVGANNRALSCTPLTSLTLCAL CPLPCLGCPTAATCRLYQTTVAVVFTMAVSSVLAFRVTRGLPDAFNKSKFLTFSVLLFCS FWTTFLPLYHSAQGKSTVAVEIFSILASMAGLLGGIFAPNVTSSC >gi568815579f:57961579_58168458|GENSCAN_predicted_CDS_2|678_bp atgggggatttgagaagcctgctgatcagcggctgcaatgagccgtgtctcattgttgtg tcttttggggacatacagtctcttgtctccaccttgcgagaaatacccacaggtacgaag agatgtcttctgtgcccagaggagcagtactcaagccgcaccagagatggctgcctgccc aggacagagaccttcctggcctttgacgaccccctgggactcatgctcgccttggtggcg ctcatgctggccagcctggcagtgctgactgttctgaagcaccgagacacacctgtggtc ggggccaacaacagagctctcagctgcacgccgctcacctccctgaccctctgtgccctc tgtcccttgccttgccttggttgtcccacagctgccacctgccgcctctaccagaccaca gttgctgttgtgttcaccatggctgtctcttccgtcctggccttcagggtcaccaggggt ctgccagacgccttcaacaagtccaagttcctcaccttcagcgtgctgctgttctgcagc ttctggacaaccttcctgcccctgtaccacagcgcacagggcaagtccactgtggccgtg gagatcttttccatcctggcttccatggcagggctgctgggtggcatctttgccccaaat gttacatcatcctgctaa >gi568815579f:57961579_58168458|GENSCAN_predicted_peptide_3|302_aa MLLCSLLPALVGSLSGAAVSGPIGWRLPGKSPRFDCPGDVVVRASFSIFHLYNITLFDFT APPAGLESSSVSTWGYWEAQGFTFAMEEINRDAHLLPSLRLGFSIRNSGLGIVALWEAKV SPSSTLASLSDRTQFPSFFQTLLSHLTTTHAVVQLMLHFRWSWVSVLAQGDDFELQGRSL VVQELGQAGVCIEFQLCIPTRESLKMKNIIWLMENCTATIILKESKVHIAYTVVYAIAQA LAGCKHGDQGCADAWDFQPWLLLRPLKNVHFKTPDGTEIMFDANGDLITEFDVVYGQKTT EG >gi568815579f:57961579_58168458|GENSCAN_predicted_CDS_3|909_bp atgttgctgtgctctctgcttcccgcccttgtgggatctctctctggggctgctgtttct ggcccaataggctggcggttgccagggaagagcccccgctttgactgtccaggggatgtg gtggtcagggccagcttctccatcttccacctgtacaacatcaccctgtttgatttcact gctccaccagctggcttggagtcttcaagcgtttccacctggggctactgggaagcccaa ggattcacatttgccatggaggagatcaacagggacgcccacctgctccccagcctcagg ctgggcttctccatccggaactctgggctgggtatagtggccctgtgggaggccaaggtc agcccctcctccacactggccagcctcagcgacaggacccagttcccatccttctttcag accctgctcagtcacctcacgaccacccatgcagtggtgcagctgatgcttcacttccga tggtcttgggtgagcgtcctggcgcagggggacgactttgagctgcagggcaggtctctg gtcgtccaggagctgggccaggctggggtctgcattgaattccaactctgcatccccacc cgggagtccctgaagatgaaaaacatcatctggctgatggagaactgtacggccaccatc atcctgaaggaaagcaaagtacacatcgcctacacagtggtctatgccatcgcccaggcc ctggcaggctgcaagcatggggaccaggggtgtgccgatgcctgggacttccagccctgg ctgctgcttcgtcctctcaagaacgtgcatttcaagacccctgatgggacagagatcatg tttgatgccaacggagatttaattacagaatttgatgttgtctatggacagaagaccact gagggctga >gi568815579f:57961579_58168458|GENSCAN_predicted_peptide_4|1919_aa MAAARGAQGAGMGSARGVGRARQPRGRHDRVAQARAKPARASGISGKALLPIPAIEFGAR AADLSAHARPQPRTLEDAEGLQPPSLESPGPLIADSVRLPHRLCPEPLGLGIQSTHTPQR GTVAREMLPRPKAPASPRRPQTPTPSEQDADPGPASPRDTEAQRLRFRQFQYHVASGPHL ALGQLWTLCRQWLRPEARSKEQMLELLVLEQFLGALPSKMRTWVQSQGPRSCREAASLVE DLTQMCQQEVLVSLDSVEPQDWSFGEEEDGKSPRSQKEPSQRPRRDGGEVPGAERVLGEL RHYYHGFRLLRIHTKIAARMLWRILNCHTLTRQARRQLLRFCADLFRLVPFLLFVVVPFV EFLLPVAVKLFPNMLPSTSETQSIKEKRLKELQVKLELAEFLQDTIQEMALKNEAAKGSA TKDFSVFFQKIRETGERPSNEEIMGFSKLFEDELTLDNLTRPQLLTMRLRSIKAEDKLFA EEGVDSLNVKELQAACRARGMRALGLPDTLLPADQLKSTLQTLPEIVAKEAQVKVAEVEG KQVDKAKLEATLQEKATMQPGAQPQPEVPDTVLQSETMKDTAPVLEGLKEEEMTKEEISI LSNACSKLQEQKKSLTRKKEELELLKEDMQDYSEDLQELKKELSKTGEEKYMEGSKASKR LTKRVQQIIGQINGLISQLEMDQWASKLAPAKGVPMGENVISVAELINAMKQVKHIPKSK LTSLAAALNENKDGKVNVDHLIKVIELVDKEDVHVSTSQVAEIVATLEKEEKGEENEDHM DSAHTELAATPTLRGVRRHSLAPSGLKMPPFTCGPCFLDFSINRPVFLHAFSWIWVTVET ERSPGISGMHYLTAPLLPLISHRRQAQQSLYSSFAQLLGILDSFLRPLCIQASELILDAV AAAPALPEESEWLETTQLQQSLHTRAEAEAPRAPGLLGSRARLPLKPSIWDEPEDLLAGP SSDLRAEGTVISSPKGPSAQRISPRRRNRNTDQSGRHQPSLKHTKGGTQEAVAGISVVPR GPRGGRPFQCADCGMVFTWVTHFIEHQKTHREEGPFPCPECGKVFLHNSVLTEHGKIHLL EPPRKKAPRSKGPRESVPPRDGAQGPVAPRSPKRPFQCSVCGKAFPWMVHLIDHQKLHTA HGHMTQAVSRKTLDTCSEAKHGMGLPRVSAEKQQFHSTACPHWGLLALSYALIAEQSPLP YLTTPSIQQVLQSEGMSDSMSEGKSLWRHRDTIVGCSFQPGKPPSTSEGRAGPAVPARSQ GMTPGVRVSTDPEQVTFEDVVVGFSQEEWGQLKPAQRTLYRDVMLDTFRLLVSVGHWLPK PNVISLLEQEAELWAVESRLPQGVYPDLETRPKVKLSVLKQGISEEISNSVILVERFLWD GLWYCRGEDTEGHWEWSCESLESLAVPVAFTPVKTPVLEQWQRNGFGENISLNPDLPHQP MTPERQSPHTWGTRGKREKPDLNVLQKTCVKEKPYKCQECGKAFSHSSALIEHHRTHTGE RPYECHECLKGFRNSSALTKHQRIHTGEKPYKCTQCGRTFNQIAPLIQHQRTHTGEKPYE CSECGKSFSFRSSFSQHERTHTGEKPYECSECGKAFRQSIHLTQHLRIHTGEKPYQCGEC GKAFSHSSSLTKHQRIHTGEKPYECHECGKAFTQITPLIQHQRTHTGEKPYECGECGKAF SQSTLLTEHRRIHTGEKPYGCNECGKTFSHSSSLSQHERTHTGEKPYECSQCGKAFRQST HLTQHQRIHTGEKPYECNDCGKAFSHSSSLTKHQRIHTGEKPYECNQCGRAFSQLAPLIQ HQRIHTGEKPYECNQCGRAFSQSSLLIEHQRIHTKEKPYGCNECGKSFSHSSSLSQHERT HTGEKPYECHDCGKSFRQSTHLTQHRRIHTGEKPYACRDCGKAFTHSSSLTKHQRTHTG >gi568815579f:57961579_58168458|GENSCAN_predicted_CDS_4|5760_bp atggcggccgcccgcggcgcgcagggtgcggggatggggtccgcccgcggcgttgggcgc gctcgccagcccaggggtcgccatgaccgagtggcccaggcccgagcgaagcccgcgcgc gcatccggcatcagcggcaaagcccttctgcccatccccgcaatagaatttggggctcgg gctgcagatctcagtgcccatgccagacctcagcccaggaccctggaggatgctgagggg ctacagcccccctccctagagtctccagggcccctgattgctgattctgtccgcctgcca caccggctctgtccggagccactgggactcgggatccagtccacacacacccctcagagg ggcactgtggccagagaaatgcttccacggcccaaagcccctgcctcccccagacgcccc cagaccccaaccccgagtgagcaggacgcagaccctgggccagcaagccccagggacacc gaagcccagcgtctgcgcttccggcagttccagtaccacgtggcgagcgggccgcacctc gcgctgggccagctctggacgctgtgccgccagtggctgaggcccgaggcgcgctccaag gagcagatgctggagctgctggtgctggagcagttcctgggcgcgctgcccagcaagatg cggacctgggtgcagtcacagggcccccgaagctgcagggaggccgccagcctggtggag gacctcacacagatgtgccagcaggaagttctggtatctctggactcggtcgaaccccag gactggagtttcggtgaggaggaagatgggaagagtccaaggtcccagaaagaaccatcg cagcgcccccgcagagatggtggtgaagtcccgggggcagagagggtgctgggcgagctg aggcactactaccatggcttccgcctgctacggatccacaccaagatcgcagcacgcatg ctctggcgcatcctcaactgccacaccctgacccgccaggcgcgcaggcagttgctccgg ttctgtgccgacctcttccgcctggtgccgttcctcctgttcgtggtggtgccgttcgtg gagtttctgctgcctgttgctgtgaaactcttccccaacatgttgccatccacatccgag actcagtccatcaaggagaagaggctgaaggagcttcaggtcaagctggagctagccgag ttcctccaggacaccatccaggagatggccttgaagaacgaggcagccaagggcagtgcc accaaagacttctctgtgtttttccagaagatccgggagacgggggagaggcccagcaat gaggaaatcatgggtttttccaaattatttgaggatgagctgacccttgacaacctgaca cggccgcagctgctcacgatgcggctgcgctccataaaggcagaggacaagctgtttgct gaggaaggggtggacagcctgaacgtcaaggagctgcaggcggcatgtcgggcacgaggc atgcgggccctgggcctcccggacaccctcttgccagccgaccagctcaagtccacactg cagactctcccagagattgtggcaaaggaagcacaggtgaaagtggccgaggtggagggc aagcaggtggacaaggccaagctagaggccacactgcaggagaaggcgaccatgcagcca ggggcccagccccagccagaagtgcctgacactgtcctgcagtcagagaccatgaaggac actgcccccgtgctagagggcttgaaggaggaagagatgactaaggaggagatcagcatc ctcagcaatgcctgctctaagctgcaggaacagaagaagtcactcaccaggaagaaggag gagctggagctgctgaaggaggacatgcaggactacagcgaggacttgcaggagctcaag aaggaactttcaaagactggtgaagaaaaatacatggaaggatctaaagccagcaagaga ttgacaaaaagggtgcagcagataatcgggcagatcaacggcttgatctcgcagctggag atggaccagtgggccagcaagctggccccggccaagggcgtgcccatgggggagaacgtc atcagtgttgccgagctcatcaacgccatgaagcaagtcaagcacattcccaaaagcaag ctcaccagcctggccgcagcactgaatgaaaataaggatggcaaggtcaacgtcgaccac ctcatcaaggtgattgagctggtggacaaagaagatgttcacgtctccaccagtcaggtg gccgagattgtagcaacactggaaaaagaggagaaaggggaggagaatgaggaccacatg gactctgcccacacggagctggctgctacgcccaccctcagaggcgtcaggagacacagc ctggccccctcagggctgaagatgcctccattcacctgtggaccttgctttctagatttc agcatcaatagacctgtcttcctgcatgcttttagttggatctgggtcactgtggagaca gagcgatctcctggcatctcaggtatgcattatctcaccgcccctctgctgcccttgatt agtcacagacgccaagcccagcaatccctctactccagttttgcccagctattgggaatc ttggacagcttccttaggcccctgtgcatccaggcgtctgagctgattctggatgcagtg gcagcagccccagcactccccgaggaaagtgagtggctggagactacccagctccagcag agtctgcacaccagggcggaggccgaagcgccccgcgcccctggcttgctggggtcccgg gcccgcttgcctctgaagccgagtatctgggacgagcctgaggaccttctcgcagggccc tcctcagacctgcgggcagaagggactgtgatctcgagccccaagggtccaagtgctcag agaatcagtccccgaaggagaaacaggaacactgaccagagcggccgccaccagccatcc ctcaagcacaccaaaggtggtacccaagaggctgttgcaggcatctcggtagtgccgcgt gggccccgaggtgggcggcccttccagtgtgccgactgtgggatggtcttcacctgggtc acccacttcatcgagcaccagaagacccatcgcgaggaagggccctttccgtgccccgag tgtggcaaggtcttcctgcacaactccgtcctcactgagcatggcaagatccacctgctg gagccaccgaggaagaaagccccccggagcaagggcccccgggagtccgtcccacccagg gatggagcccagggcccagtggcccctcgcagccccaaaagacccttccagtgtagcgtc tgcgggaaggccttcccctggatggtccacctcattgaccaccagaagctccacacggcc cacggccacatgacacaagctgtttctcggaagaccctggacacctgctccgaagccaag cacgggatggggcttcccagggtctcagctgagaagcagcagttccacagcacagcctgt ccccactgggggctcctggctctatcctatgccttaattgctgaacagtctccactgcca tacttgaccacacccagcatacaacaggtcctacagagtgaagggatgagtgactcaatg agtgaggggaagtccctgtggagacaccgggacaccattgtgggctgcagtttccagccg gggaagcctccctcgacctctgaggggagagcagggccagccgttcctgccagaagccag ggcatgacccctggggtgcgcgtctccacagacccggagcaagtgacgtttgaggacgtg gtagtgggcttcagccaggaggagtgggggcagctgaagcctgcccagaggaccctgtac cgtgatgtaatgctggacaccttcaggcttctggtctctgtgggacattggttaccgaag ccgaatgtcatctccctgctggagcaagaggcagagctgtgggcggtggagtctagactt ccccaaggcgtgtacccagacttggaaactagacccaaagtcaaactgtcagttctaaag caaggcatctctgaagaaatatccaacagtgtcatcttggtagaaagattcctgtgggat ggtctgtggtactgcaggggtgaggacactgagggccactgggaatggagttgtgagagt ctagagagcctggcagtgccggtggccttcacgcctgtgaagacgcctgttctggagcag tggcagaggaatgggtttggggaaaacataagtctgaaccctgatctcccacatcaacca atgactcctgaaagacaaagcccccacacatggggaacacgtggaaaaagggagaagcca gacctaaatgttttacagaaaacctgtgtaaaagagaaaccctacaaatgtcaggaatgc ggaaaggcctttagtcacagctcagcacttatcgaacaccaccggacgcacacaggagag agaccttacgaatgtcacgaatgcttaaaaggcttccggaacagctcggcacttaccaaa caccagagaatccatactggggagaaaccctataaatgcactcagtgtgggaggaccttc aaccaaattgccccactgatccagcaccagagaactcacacaggtgagaagccctatgaa tgcagcgaatgtgggaaatccttcagttttaggtcctccttcagccagcacgagcgaact cacacaggcgagaagccctacgagtgcagtgagtgtgggaaagccttccggcaaagcatc cacctcacccagcatctgcgaatccacactggggagaaaccctatcagtgtggtgagtgt ggcaaggccttcagccacagctcatccttgaccaaacaccagcgaatccacacaggggag aagccctacgagtgccatgagtgtggaaaagccttcacccagatcacaccactgattcag caccagaggacccacacaggagaaaagccctatgagtgtggtgagtgtgggaaagccttc agtcagagcacactcctgaccgagcatcggaggattcacacaggagagaagccctatgga tgcaacgagtgtgggaaaaccttcagccacagctcctcactcagccagcatgagcggaca cacacaggagagaagccctatgagtgcagtcagtgtgggaaggccttccggcagagcaca cacctcacccaacaccagcgaatccacacaggggagaagccctatgaatgcaatgactgc ggcaaggcattcagtcacagctcgtccctcaccaaacatcagcgaatccacactggggag aagccctacgaatgcaaccagtgtggcagagccttcagccagcttgctcccctcattcag catcagaggatccacacaggagagaaaccctatgaatgtaaccagtgtggcagagccttc agccagagctcccttctcatcgaacaccagaggattcacaccaaggaaaagccgtatggg tgcaatgagtgtgggaaatccttcagccacagctcctcgctcagccagcacgaaaggacg cacactggggaaaagccctatgagtgtcacgattgcggaaagtcctttaggcagagcacc cacctcactcagcaccggaggatccacacaggagagaagccatatgcatgcagggactgt ggaaaggcctttacccacagctcctcccttaccaagcaccagagaactcacactggataa >gi568815579f:57961579_58168458|GENSCAN_predicted_peptide_5|608_aa MALFCLSWGGCALALCKAIALSVSSAGLWVLAPPEPTGHLVIPGTSPLEPPCPWLDSHIF QCRFGKMLPLEKAFASPRSSPAPPDLPTPGSAAGVQQEEPETIPERTPADLEFSRLRFRE FVYQEAAGPHQTLARLHELCRQWLMPEARSKEQMLELLVLEQFLGILPDKVRPWVVAQYP ESCKKAASLVEGLADVLEEPGMLLGSPAGSSSILSDGVYERHMDPLLLPGELASPSQALG AGEIPAPSETPWLSPDPLFLEQRRVREAKTEEDGPANTEQVGGHEQVGGRSQPRLMFQKL KSFPEDPQHLGEWGHLDPAEENLKSYRKLLLWEGGRRQESAGCACEEAAPAGVLPELPTE APPGDALADPPSGTTEEEEEQPGKAPDPQDPQDAESDSATGSQRQSVIQQPAPDRGTAKL GTKRPHPEDGDGQSLEGVSSSGDSAGLEAGQGPGADEPGLSRGKPYACGECGEAFAWLSH LMEHHSSHGGRKRYACQGCWKTFHFSLALAEHQKTHEKEKSYALGGARGPQPSTREAQAG ARAGGPPESVEGEAPPAPPEAQSAPNEKGPREALTSVNTPWASARIRGEPPDVEEDSAFL QPFSAAPC >gi568815579f:57961579_58168458|GENSCAN_predicted_CDS_5|1827_bp atggctctgttctgcttgagctggggtggctgcgccctggccttatgcaaggccattgcc ttgtccgtgtcctctgcaggtctgtgggttctggcaccacctgagcccactgggcatctg gtcatccctggcacctctcctttggagccaccttgtccctggctagacagtcacattttc cagtgccgttttggaaagatgttgcctttggagaaggcgtttgcctcccccaggagctcc ccagccccgccggatctgcccacgccggggtcagcagccggagtccagcaggaagaaccc gagaccatccctgagaggacccctgctgacctggagttctcccgcctgcgtttccgggaa tttgtctaccaggaggctgccgggccccaccagaccctggcccggctgcatgagctgtgc cgccagtggctgatgcctgaggcgcgctccaaggagcagatgctggagctgctggtgctg gagcagttcctgggcatcctgcctgataaggtccggccctgggtggtggcacagtaccct gagagctgcaagaaggcagcctccctggtggagggcctcgctgatgtcctggaagagcca gggatgctgctgggctcccctgcgggctcatcctcaattcttagcgatggagtgtacgag aggcacatggaccctctgctgctaccaggcgagctcgcgagccccagccaggcccttgga gctggggagatcccggcaccttctgagacaccctggctttctccggaccccctgtttctg gaacagaggagggtcagagaagcaaagaccgaagaggacggccctgccaacaccgagcag gtgggtgggcacgagcaggtgggtgggaggtctcagccacgactaatgtttcagaagctg aagtcctttccagaggaccctcagcacctgggggagtggggccacctggaccctgccgag gagaacctgaagagctaccggaagctgctcctgtgggaaggcgggaggcggcaggagagc gctgggtgcgcctgcgaggaggccgcccccgcgggggtgctgcctgagctgcctacggag gcgccccctggggacgcccttgccgatcccccgtcgggcaccactgaggaggaggaagag cagcctgggaaggccccggacccgcaggacccccaggacgcggagtccgactctgccacc ggatcgcagaggcagtccgtcatccagcagcctgccccggacaggggcacggcgaaactg ggaaccaagaggccgcaccccgaggatggggacgggcagagcctcgagggcgtctctagc tccggcgacagcgcagggctggaggccgggcagggccctggggctgacgagccgggcttg tcccgcgggaagccctatgcctgcggcgagtgcggggaggccttcgcgtggctctcgcac ctgatggagcaccacagcagccatggcggccggaagcgctacgcctgtcagggctgctgg aagaccttccacttcagcctggccctagccgagcaccagaagacccacgagaaggagaaa agctacgcgctggggggcgcccggggcccccaaccgtccacccgcgaagcccaggcgggg gctagggcgggcggtcccccagagagcgtggagggcgaggctccccccgcacccccagag gcgcagagcgctcccaacgagaagggtccacgggaagccctcacctctgtaaacacaccc tgggccagcgctcgcatccgaggggagccgccggatgtggaagaagactcggctttcctg cagccatttagtgccgccccatgctag >gi568815579f:57961579_58168458|GENSCAN_predicted_peptide_6|41_aa MAAGKLRLRTGPHRGLRLPQCREAVPRTGASKGAVTGLPIP >gi568815579f:57961579_58168458|GENSCAN_predicted_CDS_6|126_bp atggcggccggcaaactgcgcctgcgcactgggcctcaccgcggactacgactcccacaa tgccgcgaggctgtgccgcgcaccggggcgagcaagggagccgtgacaggcctccctatt ccctga >gi568815579f:57961579_58168458|GENSCAN_predicted_peptide_7|332_aa MTEGDTAAYEWDPCERDSAETTRAQGGAGSQADIRLVNLRDKVLVRSGSRQPTCAPGVFP SVKPHLPAEAKDKEPRKAALRALPGSGCQAAAATAPPRAGPGPALVALGPAPSVRCHRNI ASGNSARPCGESEGELTRTLRRRERKEQHKGEGHEEVFDRDTCPYEHNGSISGNEYNDSE CGLSVRPKSHFISHQRTHTGESPYTCNDCGKSLSKNYNLIVHQKIHTGEKSYKCKKSHTV EKLFECNECEKSFSDVSNLIVHLRIHMNIAGLIWHQRTHTGEKPYECNRCGKAFRDSSCL TKHKKRLHVEEFMLFESNKYVFPSINGRKNPG >gi568815579f:57961579_58168458|GENSCAN_predicted_CDS_7|999_bp atgactgaaggagacacagcagcctacgagtgggacccgtgtgaacgagattctgccgag accacgcgcgcccagggtggtgcaggctcgcaagcggatatacggctggtcaacctgaga gacaaggtgctcgtccgcagcggctcaaggcagcccacgtgcgctcccggcgtctttccg tctgttaaacctcacctaccagcagaagcgaaggacaaagaaccccgaaaggcagcactg cgagcgctcccaggttctggctgccaggcagctgcggccacggccccgccccgggccggc cccggccccgccctcgtcgcactaggccccgcccccagcgtccggtgtcaccggaatatc gcatccgggaactctgcgcgcccatgcggagagtcggagggcgagctgacgaggacgcta cggcggcgggaaagaaaggaacaacacaaaggagaaggacacgaggaggtgtttgatagg gacacatgtccttatgaacacaatggaagtatctctggaaatgaatacaatgacagtgaa tgtggactgtccgttagaccgaaatctcatttcatcagtcaccagaggactcatactgga gagagcccttatacctgtaatgactgtggaaaatccctctctaagaattacaacctgatt gtacatcaaaaaatccacacaggagagaaatcctataaatgcaagaaaagtcatactgtg gaaaaactattcgaatgtaatgaatgtgaaaaatccttttctgatgtctccaaccttatt gtccatctgagaattcacatgaacatcgcgggcctcatctggcaccagaggactcatact ggtgagaagccctacgagtgtaatcggtgtggcaaagccttcagggacagctcctgtctg accaagcacaagaagagacttcatgttgaagagtttatgttgtttgaaagtaacaaatat gtttttccatctataaatggaagaaaaaatcctggatga >gi568815579f:57961579_58168458|GENSCAN_predicted_peptide_8|677_aa MAACRSRALPRGEAAKSRREIERSAGGPALLGDPAHPPQLLARVLWNVSPGEISGRGMDR VAPLGRSRMPPGGGDQPERRGSVQQLDKGQRAFLARGLGVGRGKLGDSAHAHILVPAAAQ AQSPGTWLPPAANLWDMRLKMTTRNFPEREVPCDVEVERFTREVPCLSSLGDGWDCENQE GHLRQSALTLEKPGTQEAICEYPGFGEHLIASSDLPPSQRVLATNGFHAPDSNVSGLDCD PALPSYPKSYADKRTGDSDACGKGFNHSMEVIHGRNPVREKPYKYPESVKSFNHFTSLGH QKIMKRGKKSYEGKNFENIFTLSSSLNENQRNLPGEKQYRCTECGKCFKRNSSLVLHHRT HTGEKPYTCNECGKSFSKNYNLIVHQRIHTGEKPYECSKCGKAFSDGSALTQHQRIHTGE KPYECLECGKTFNRNSSLILHQRTHTGEKPYRCNECGKPFTDISHLTVHLRIHTGEKPYE CSKCGKAFRDGSYLTQHERTHTGEKPFECAECGKSFNRNSHLIVHQKIHSGEKPYECKEC GKTFIESAYLIRHQRIHTGEKPYGCNQCQKLFRNIAGLIRHQRTHTGEKPYECNQCGKAF RDSSCLTKHQRIHTKETPYQCPECGKSFKQNSHLAVHQRLHSREGPSRCPQCGKMFQKSS SLVRHQRAHLGEQPMET >gi568815579f:57961579_58168458|GENSCAN_predicted_CDS_8|2034_bp atggcggcctgcaggtcccgagccctgccacgcggggaggcagctaagtcccggcgagaa atcgagcgcagcgccggtgggccagcactgctgggggacccagcgcaccctccgcagctg ctggcccgggtgttgtggaatgtgagccctggggaaataagtggaagaggcatggaccgt gtggctccactggggcgttcgcgaatgcctcctggaggaggagaccaaccagaaagaaga ggcagcgtccagcaactggacaaaggccagagggcctttctggcaagaggactgggcgtt gggcgggggaagctcggggactccgcgcacgcgcacatccttgtcccggcagccgcgcaa gcgcaatcgcccggaacatggctgccgccggccgccaacctttgggatatgagattgaaa atgacgactcggaattttcctgagagagaagtaccctgtgatgtagaagtggaaagattc acaagggaagttccctgcttgtccagtttaggtgatggttgggactgtgagaaccaggag ggacacttgaggcaatcagctttaactctggagaaaccagggactcaggaagcaatttgt gaatatcctggttttggggagcatttgattgcaagctcagaccttccaccgtctcagaga gttctggcaacaaatggtttccatgcacctgactcaaatgttagtggtctggattgtgac cccgccttacccagctatcctaaaagttatgcagataagagaactggtgacagtgatgcc tgtggaaaaggcttcaaccattccatggaagttattcatggaagaaatccagtgagagag aagccctacaaataccctgaaagtgttaagtcttttaatcattttacctctcttggtcat cagaaaataatgaaaagaggcaagaaatcgtatgaaggtaagaattttgagaacatcttt actctgagctcatcgcttaatgaaaaccagagaaatctccctggagagaaacaatataga tgtactgaatgtggcaaatgcttcaaacggaactcttctcttgttttgcatcaccgaact cacaccggagagaagccttatacttgtaatgagtgtggaaagtccttctccaagaactac aacctgattgtgcatcaaagaatccacacaggagagaagccctatgaatgcagtaaatgt gggaaagctttcagtgatggctcagctctgacacagcaccagagaattcacacaggcgag aaaccttatgaatgcctagagtgtggaaaaaccttcaaccgaaattcatccttaattttg caccaaagaactcatacaggggaaaaaccatatagatgtaacgaatgtgggaaacccttc actgacatctcccaccttacagtgcatctcagaatccacaccggtgagaagccctatgag tgtagcaaatgtggaaaggctttccgggacggctcgtacctcacccagcatgagaggact cacactggagaaaagccctttgagtgtgcagagtgcgggaaatccttcaacagaaactct cacctcattgtgcatcaaaagatccattctggggagaaaccctatgaatgtaaagaatgt ggcaagactttcatcgagagtgcgtacctcatcaggcatcagaggattcatactggcgag aagccctatggctgcaaccagtgtcagaaacttttcaggaatatcgctggcctcattagg caccagaggactcatactggtgagaagccctatgagtgtaatcagtgtggcaaagccttc agggacagctcctgtctgaccaagcaccagagaattcacactaaggagaccccatatcag tgtccagaatgtgggaagtccttcaagcagaactctcacctggcagtacatcagagactc catagcagggagggtcccagccggtgtcctcagtgtggaaaaatgttccaaaagagctca tcccttgttcgacatcaaagagcacacctgggagagcaacccatggaaacataa