GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:49:25 Sequence gi568815576f:37586393_37791333 : 204941 bp : 54.29% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2633 2871 239 0 2 82 92 75 0.125 5.16 1.02 Intr + 20782 20902 121 0 1 52 100 5 0.000 -1.33 1.03 Intr + 27798 27882 85 1 1 83 61 276 0.957 23.78 1.04 Intr + 30530 30605 76 2 1 117 93 92 0.999 12.61 1.05 Intr + 32056 32154 99 0 0 115 81 218 0.996 24.61 1.06 Intr + 33846 33969 124 2 1 17 72 254 0.999 17.36 1.07 Intr + 34421 34521 101 0 2 69 100 79 0.996 7.53 1.08 Intr + 35224 35304 81 0 0 120 59 101 0.862 10.73 1.09 Intr + 36935 37075 141 1 0 70 50 248 0.963 20.26 1.10 Intr + 37121 37241 121 1 1 99 71 167 0.438 16.67 1.11 Intr + 38577 38684 108 1 0 110 81 209 0.962 23.16 1.12 Intr + 39405 39557 153 1 0 107 68 127 0.989 13.16 1.13 Intr + 43070 43134 65 0 2 141 91 38 0.961 8.33 1.14 Intr + 43606 43778 173 0 2 111 65 183 0.979 17.56 1.15 Intr + 44511 44707 197 0 2 39 81 175 0.997 11.48 1.16 Intr + 45604 45773 170 2 2 113 56 325 0.999 31.98 1.17 Intr + 46013 46123 111 1 0 90 116 154 0.999 19.48 1.18 Intr + 46209 46308 100 2 1 133 20 195 0.235 17.38 1.19 Intr + 47327 47544 218 0 2 21 -39 160 0.004 -4.65 1.20 Intr + 52658 52878 221 1 2 42 71 77 0.066 -0.97 1.21 Intr + 53377 53454 78 1 0 76 113 113 0.134 11.76 1.22 Intr + 54734 54776 43 2 1 99 98 88 0.895 9.63 1.23 Intr + 54982 55086 105 0 0 120 89 203 0.995 24.51 1.24 Intr + 56147 56223 77 1 2 81 72 66 0.983 3.11 1.25 Intr + 56503 56614 112 1 1 79 83 143 0.999 13.78 1.26 Intr + 56706 56782 77 2 2 103 75 110 0.968 10.01 1.27 Intr + 57252 57396 145 0 1 55 72 194 0.930 15.29 1.28 Intr + 58245 58313 69 2 0 98 67 107 0.969 9.47 1.29 Intr + 58478 58568 91 1 1 93 75 106 0.926 9.87 1.30 Intr + 58973 59118 146 0 2 104 59 147 0.963 13.91 1.31 Intr + 60426 60537 112 2 1 90 51 139 0.999 10.86 1.32 Intr + 60875 60956 82 0 1 138 94 157 0.999 20.49 1.33 Intr + 61049 61129 81 2 0 91 99 116 0.999 12.25 1.34 Intr + 61927 62043 117 1 0 97 72 190 0.999 18.38 1.35 Intr + 63760 63857 98 1 2 72 57 131 0.841 8.55 1.36 Intr + 64150 64333 184 2 1 108 100 42 0.973 6.76 1.37 Intr + 67387 67481 95 1 2 131 64 27 0.869 4.71 1.38 Intr + 69534 69613 80 1 2 108 50 46 0.169 2.57 1.39 Intr + 69850 69910 61 0 1 106 37 25 0.056 -2.00 1.40 Intr + 72263 72964 702 0 0 74 93 1075 0.044 98.67 1.41 Term + 79163 79479 317 0 2 134 52 454 0.984 41.85 1.42 PlyA + 80518 80523 6 1.05 2.00 Prom + 82416 82455 40 -0.61 2.01 Init + 85107 85154 48 2 0 51 77 64 0.598 1.35 2.02 Intr + 87108 87227 120 2 0 41 105 58 0.489 3.99 2.03 Intr + 90594 90673 80 2 2 51 78 86 0.865 2.74 2.04 Intr + 92091 92262 172 0 1 75 64 287 0.952 25.46 2.05 Term + 93211 93357 147 0 0 113 44 362 0.959 32.51 2.06 PlyA + 93380 93385 6 1.05 3.00 Prom + 96973 97012 40 -3.31 3.01 Init + 100001 100083 83 1 2 55 101 149 0.879 13.30 3.02 Intr + 101518 101623 106 1 1 106 85 223 0.589 24.52 3.03 Intr + 101920 101968 49 0 1 96 106 46 0.999 5.94 3.04 Intr + 102458 102600 143 0 2 86 90 264 0.999 26.98 3.05 Intr + 104305 104402 98 0 2 93 101 53 0.969 6.41 3.06 Term + 104782 104944 163 1 1 101 53 127 0.555 8.22 3.07 PlyA + 105350 105355 6 1.05 4.00 Prom + 109244 109283 40 -4.51 4.01 Init + 110645 110697 53 1 2 90 77 63 0.820 4.21 4.02 Term + 111196 111286 91 1 1 110 42 60 0.846 1.09 4.03 PlyA + 112279 112284 6 1.05 5.00 Prom + 113094 113133 40 -5.81 5.01 Init + 114974 115087 114 1 0 82 80 100 0.801 8.77 5.02 Intr + 122308 122482 175 0 1 38 39 92 0.145 -0.67 5.03 Intr + 123049 123194 146 2 2 32 89 107 0.838 5.71 5.04 Intr + 124035 124174 140 2 2 50 90 110 0.993 7.17 5.05 Intr + 126818 127019 202 2 1 108 77 96 0.762 10.31 5.06 Intr + 129371 129542 172 1 1 87 96 -3 0.808 0.43 5.07 Intr + 130849 130984 136 2 1 33 -26 156 0.545 -1.17 5.08 Intr + 131459 131637 179 2 2 53 56 146 0.828 7.98 5.09 Intr + 131691 132118 428 1 2 43 -8 221 0.016 2.08 5.10 Intr + 137287 140111 2825 0 2 81 99 839 0.237 72.78 5.11 Intr + 144128 144229 102 2 0 74 61 60 0.456 1.69 5.12 Intr + 146906 147020 115 2 1 32 99 60 0.521 2.45 5.13 Intr + 148007 149050 1044 1 0 109 110 197 0.325 15.37 5.14 Intr + 154503 154630 128 2 2 38 43 128 0.003 3.28 5.15 Term + 161488 161701 214 1 1 53 43 146 0.148 3.63 5.16 PlyA + 162804 162809 6 1.05 6.00 Prom + 163250 163289 40 -7.79 6.01 Init + 164749 164808 60 0 0 69 97 75 0.981 6.16 6.02 Intr + 165380 165436 57 1 0 129 94 69 0.977 11.17 6.03 Intr + 168709 168798 90 0 0 50 66 216 0.927 16.29 6.04 Intr + 169158 169267 110 2 2 66 70 154 0.990 10.98 6.05 Intr + 171221 171746 526 2 1 87 80 796 0.999 72.24 6.06 Intr + 172762 172872 111 0 0 113 110 69 0.971 12.68 6.07 Intr + 179278 179425 148 0 1 88 80 228 0.346 22.32 6.08 Intr + 181682 181784 103 0 1 94 63 141 0.999 11.93 6.09 Intr + 182636 182795 160 2 1 92 80 314 0.999 31.50 6.10 Intr + 182870 182983 114 1 0 93 105 152 0.997 18.55 6.11 Intr + 185258 185476 219 1 0 63 53 187 0.764 11.73 6.12 Intr + 185951 186151 201 1 0 64 16 96 0.354 0.00 6.13 Term + 186209 186370 162 1 0 138 44 337 0.928 32.55 6.14 PlyA + 188099 188104 6 1.05 7.04 PlyA - 190280 190275 6 1.05 7.03 Term - 197679 197565 115 2 1 73 47 74 0.297 0.15 7.02 Intr - 198608 198447 162 1 0 127 94 -31 0.306 1.01 7.01 Init - 204371 204220 152 2 2 114 66 95 0.769 9.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 22495 22324 172 0 1 74 43 191 0.984 10.81 S.002 Init - 22745 22726 20 2 2 96 80 11 0.970 0.69 S.003 Intr - 154367 154223 145 2 1 96 91 114 0.964 12.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:37586393_37791333|GENSCAN_predicted_peptide_1|1858_aa XGPSSALGHTDSAAITAFLMIAVSGPSGLPPRSTPYIYKPPSSRGNTYEPPGPHLPESES KGFQKQLGLNGHFPEGQNQQAQDRWRGRHGPVSQPAARGRQSAMDPGESGTRRRQVVTPQ DRATNPLNKELDWASINGFCEQLNEDFEGPPLATRLLAHKIQSPQEWEAIQALTVLETCM KSCGKRFHDEVGKFRFLNELIKVVSPKYLGSRTSEKVKNKILELLYSWTVGLPEEVKIAE AYQMLKKQGIVKSDPKLPDDTTFPLPPPRPKNVIFEDEEKSKMLARLLKSSHPEDLRAAN KLIKEMVQEDQKRMEKISKRVNAIEEVNNNVKLLTEMVMSHSQGGAAAGSSEDLMKPTRT LTRPSCCPQELYQRCERMRPTLFRLASDTEDNDEALAEILQANDNLTQVINLYKQLVRGE EVNGDATAGSIPGSTSALLDLSGLDLPPAGTTYPAMPTRPGEQASPEQPSASVSLLDDEL MSLGLSDPTPPSGPSLDGTGWNSFQSSDATEPPAPALAQAPSMESRPPAQTSLPASSGLD DLDLLGKTLLQQSLPPESQQVRWEKQQPTPRLTLRDLQNKSSSCSSPSSSATSLLHTVSP EPPRPPQQPVPTELSLASITVPLESIKPSNILPVTVYDQHGFRILFHFARDPLPGRSDVL VVVVSMLSTAPQPIRNIVFQSAVPKVMKVKLQPPSGTELPAFNPIVHPSAITQVLLLANP QKEKVRLRYKLTFTMGDQTYNEMGDVDQFPPPETWGPAKGPVASAESPERALGCGGPGPS QLEVGDRQGLAGAPAATDTEGEAPGCSRLPPAVLSPPHNLDAAAPQRENGQAQDSWPIRP LETASPEFQGLVHGHTAARGPRKPKLPASHPVVGGLVPGGQRSSAEESARQYPRQAQRGQ DSPQLAPKMMKRQLHRMRQLAQTGSLGRTPETAEFLGEDLLQVEQRLEPAKRAAHNIHKR LQACLQGQSGADMDKRVKKLPLMALSTTMAESFKELDPDSSMGKALEMSCAIQNQLARIL AEFEMTLERDVLQPLSRLSEEELPAILKHKKSLQKLVSDWNTLKSRLSQATKNSGSSQGL GGSPGSHSHTTMANKVETLKEEEEELKRKVEQCRDEYLADLYHFVTKEDSYANYFIRLLE IQADYHRRSLSSLDTALAELRENHGQADHSPSMTATHFPRVYGVSLATHLQELGREIALP IEACVMMLLSEGMKEEGLFRLAAGASVLKRLKQTMASDPHSLEEFCSDPHAVAGALKSYL RELPEPLMTFDLYDDWMRAASLKEPGARLQALQEVCSRLPPENLSNLRYLMKFLARLAEE QEVNKMTPSNIAIVLGPNLLWPPEKEGDQAQLDAASVSSIQVVGVVEALIQSADTLFPGD INFNVSGLFSAVTLQDTVSDRLASEELPSTAVPTPATTPAPAPAPAPAPAPALASAATKE RTESEVPPRPASPKVTRSPPETAAPVEDMARRSPRGATGRKERFACSYGTDSSLADMFFE IPLPLSSEETEAQTEVTCQGHAVAPPPGPSSAPIGGCVERARESAPCPRRERGAGGRRPA GCMARCERLRGAALRDVLGRAQGVLFDCDGVLWNGERAVPGAPELLERLARAGKAALFVS NNSRRARPELALRFARLGFGGLRAEQLFSSALCAARLLRQRLPGPPDAPGAVFVLGGEGL RAELRAAGLRLAGDPSAGDGAAPRVRAVLVGYDEHFSFAKLREACAHLRDPECLLVATDR DPWHPLSDGSRTPGTGSLAAAVETASGRQALVVGKPSPYMFECITENFSIDPARTLMVGD RLETDILFGHRCGMTTVLTLTGVSRLEEAQAYLAAGQHDLVPHYYVESIADLTEGLED >gi568815576f:37586393_37791333|GENSCAN_predicted_CDS_1|5577_bp nttggccccagctctgccctgggccacacagattctgctgccatcactgcttttctgatg attgcagtctcaggaccatctgggctgcctccccgcagcaccccgtacatatacaagcct ccttcctcaaggggaaacacctatgagcccccagggcctcatctcccagaatcagaatcc aaaggattccagaagcagcttggtttaaatggacattttcctgaaggccaaaatcaacag gcacaggaccgctggagaggtcgccacgggccggtttcccagccggccgcgagggggcga cagagtgcaatggacccaggtgagtcggggaccaggcgccgccaggtcgtgacgccacaa gatagagccacgaaccccctgaacaaggagctcgactgggccagcatcaacggcttctgc gagcagctcaacgaggactttgaggggcctccactcgccacccggctgctggcccacaag atccagtccccacaggagtgggaggcgatccaggccttgacggtgctggaaacatgcatg aagagctgcggcaagcggttccacgacgaagtgggcaagttccgctttctcaacgagctc atcaaggtcgtgtctcccaagtatctgggctctcggacatcggagaaggtgaagaacaag atcttggagctcctctacagctggacagtgggcctgcccgaggaggtgaaaatcgcagag gcctaccagatgctaaagaagcaggggattgtaaagtccgaccccaagcttccagatgac actacctttccccttcctcctccacggccgaagaatgtgatctttgaagatgaggagaaa tccaagatgctggcccgcctgctgaagagctcccatcccgaagacctccgcgcagccaat aagctcatcaaagagatggtgcaggaggaccagaagcggatggagaagatctcgaagagg gtgaatgccatcgaggaggtgaacaacaatgtgaaactgctcacggagatggtgatgagc cacagccagggcggcgcagcagctggcagcagcgaggacctcatgaagcccacgcggacc ctgacccgcccatcctgctgccctcaggaactgtaccagcgctgtgagcggatgcggccc acgctcttccgactggcgagtgacacagaggacaatgatgaggccttagcggagatcctg caggccaatgacaacctcacccaggtgatcaacctgtataagcagctggtgcggggtgag gaggtcaacggtgatgccacagccggctccatccctgggagcacctcggccctgctggat ctctcaggcctggatctcccgcctgcgggcaccacctacccagctatgcccacccgccct ggcgagcaggccagccctgagcagcccagtgcctcagtttccctgcttgacgacgagctc atgtctctgggcctcagtgaccccacacccccttcaggcccaagcctggatggtaccgga tggaacagcttccagtcgtcggatgccactgagcccccagcccctgctctggcccaggcc cccagtatggaaagccgacccccagcgcagacatccctgccagcaagcagcggtctggac gacctagacctcctggggaagaccctcctgcagcagtcgctgcccccggaatcccagcaa gtgcggtgggagaagcagcagccaaccccccggctcacactccgggacctgcagaataag agcagcagctgcagctcccccagctccagcgccaccagccttctccacaccgtgtcccca gagccccccaggcctccgcagcagcccgtaccaaccgagctctcactggccagcatcact gtgcccctggagtccatcaaacccagcaacatcctgcccgtgactgtgtatgaccagcac ggcttccgcatcctcttccattttgcccgggacccactgccagggcgctccgacgtgctg gtggtggtggtttccatgctgagcaccgccccccagcccatccgcaacatcgtgttccag tcagctgtccccaaggttatgaaggtgaagctgcagccaccctcgggcacggagctgcca gcttttaaccccatcgtccacccctcagcaatcacccaggtcctgctgcttgccaacccc cagaaggagaaggttcgcctccgctacaagctcaccttcaccatgggtgaccagacctac aacgagatgggggatgtggaccagttccccccacctgaaacctggggccctgcaaagggg cctgtggccagtgctgagtcaccagagagggcgctgggctgtggcggaccaggaccgtcc cagctggaagtgggcgaccgccagggcctggcaggagccccagctgctacagacaccgag ggggaggcccctggctgctcacgacttcctcctgctgtgctcagccctccacacaacctc gatgctgcagcaccccagagggaaaatgggcaggcccaggacagctggcccatcagacca ttagaaacagcgagtccggagttccaggggcttgtccacggccacacagcagcccgtggc cccaggaagccaaagctcccagccagtcatccagtggtggggggtttagttccagggggc cagaggtcctctgcggaagagagtgcaaggcagtatccgcggcaggcccagagaggccag gacagcccccagctcgcccccaagatgatgaagaggcagctgcaccgcatgcggcagctg gcccagacgggcagcttgggacgcaccccggagaccgctgagttcctgggtgaggacctg ctgcaggtagaacagcggctggagccggccaagcgggcagcccacaacatccacaagcgg ctgcaggcctgtctgcagggccagagcggggcagacatggacaagcgggtgaagaagctt cccctcatggctctgtccaccacgatggctgagagcttcaaggagctggaccctgattcc agcatggggaaggccttggagatgagctgtgccatccagaatcagctggcccgcatcctg gccgagtttgagatgaccctggagagggacgtcctgcagccactcagcaggctgagtgag gaggagctgccagccatcctcaaacacaagaaaagcctccagaagctcgtgtccgactgg aacacactcaagagcaggctcagtcaggcaaccaagaattcaggcagcagtcaaggccta ggaggcagcccgggtagtcacagccatacgaccatggccaacaaggtggagacgctgaag gaggaggaggaggagctgaagaggaaagtggagcaatgcagggacgagtacttggctgac ctgtaccactttgttaccaaggaggactcctatgccaactacttcattcgtctcctggag attcaggccgattaccatcgcaggtcactgagctcgctggacacagccctggctgagctg agggagaaccacggccaagcagaccactccccttcgatgacagccacccacttccccagg gtgtatggggtgtcgctggcaacccacctgcaagagctgggccgggagattgccctgccc atcgaggcctgcgtcatgatgctgctttctgagggcatgaaggaagagggtctcttccgt ctggctgctggggcctcggtgctgaagcgtctcaagcagacaatggcctcggacccccac agcctggaggagttctgctccgacccgcacgctgtggcaggtgccctcaagtcctatctg cgggagctgccagagcctctgatgaccttcgacctctatgatgactggatgagggcagcc agcctgaaggagccaggggcccggctgcaggccctccaagaggtgtgcagccgcctaccc cccgagaacctcagcaacctcaggtacctgatgaagttcctggcacggctggccgaggag caggaggtgaacaagatgacacccagcaacatcgccatagtcctgggacccaacttgctg tggccacctgagaaagaaggggaccaggcccagctggatgcagcctccgtgtcttccatc caggtggtgggcgtcgtcgaggcgctgatccagagcgcagacaccctcttccctggagac atcaacttcaacgtgtcaggcctcttctcagctgttaccctccaggacacagtcagtgac aggctggcctctgaggaacttccgtccactgccgtgcccaccccagccaccaccccggct ccggctccggctccagctccagctccggccccagccttggcttcagcagctaccaaggaa aggacagagtctgaggtgcctcccagaccagcctcccccaaggtcaccaggagtcccccg gagacagctgccccagtggaggacatggctcggaggagtcctaggggagccaccggaagg aaggagaggtttgcctgctcctacgggactgattcttctcttgccgacatgttttttgaa ataccattacctcttagcagtgaggagactgaggcccagacagaagtgacctgccaaggc cacgccgtcgccccgccccccggtccttccagcgcgccaattggcggctgcgtggaacgt gccagggagagcgcgccgtgcccgcggagagagcgcggcgcgggaggccggcggccggcc ggctgcatggcgcgctgcgagaggctgcgcggagcggccctgcgcgacgtgctgggccgg gcgcagggggtcctgttcgactgtgacggggtgctgtggaacggcgagcgcgccgtgccg ggcgccccggagctgctggagcggctggcgcgggccggcaaggcggctctgtttgtgagc aacaacagccggcgcgcgcggcccgagctggccctgcgcttcgcgcgcctcggcttcggg gggctgcgcgccgagcagctcttcagctccgcgctgtgcgccgcgcgcctgctgcgccag cgcctgcccgggcctccggacgcgccgggcgccgtgttcgtgctgggcggcgaggggctg cgcgccgagctgcgcgccgcggggctgcgcctggccggggacccgagcgcgggggacggc gcggccccgcgcgtgcgcgccgtgcttgtgggctacgacgagcacttctccttcgccaag ctgagggaggcgtgcgcgcacctgcgcgaccccgagtgcctactcgtggccaccgaccgt gacccatggcacccgctgagcgacggcagccggacccctggcaccgggagcctggccgct gcagtggagacagcctcgggacgccaggccctggtggtgggcaagcccagcccctacatg ttcgagtgcatcacggagaacttcagcatcgaccccgcacgcacgcttatggtgggtgac cgcctggagaccgacatcctctttggccaccgctgcggcatgaccactgtgctcacgctc acaggagtctcccgcctagaagaggcccaggcctacctagcggccggccagcacgacctc gtgccccattactatgtggagagcatcgcagacttgacagaggggttggaggactga >gi568815576f:37586393_37791333|GENSCAN_predicted_peptide_2|188_aa MAPLWAALLQHRQLKEQFQKGDGQVEKEVPVTTPQHAPHFAQCSPRPFLHQVPECKGLVA SNLNLKPGECLRVRGEVAPDAKSFVLNLGKDSNNLCLHFNPRFNAHGDANTIVCNSKDGG AWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDGYEFKFPNRLNLEAINYMAADGDF KIKCVAFD >gi568815576f:37586393_37791333|GENSCAN_predicted_CDS_2|567_bp atggcgcctctgtgggccgccctccttcagcaccgccagctgaaggagcagtttcagaag ggggacggccaggtggagaaggaggtccccgtgacaaccccccagcatgcccctcatttt gcccagtgctccccacgccccttcctccaccaggttcctgagtgtaagggtctggtcgcc agcaacctgaatctcaaacctggagagtgccttcgagtgcgaggcgaggtggctcctgac gctaagagcttcgtgctgaacctgggcaaagacagcaacaacctgtgcctgcacttcaac cctcgcttcaacgcccacggcgacgccaacaccatcgtgtgcaacagcaaggacggcggg gcctgggggaccgagcagcgggaggctgtctttcccttccagcctggaagtgttgcagag gtgtgcatcaccttcgaccaggccaacctgaccgtcaagctgccagatggatacgaattc aagttccccaaccgcctcaacctggaggccatcaactacatggcagctgacggtgacttc aagatcaaatgtgtggcctttgactga >gi568815576f:37586393_37791333|GENSCAN_predicted_peptide_3|213_aa MGRNKKKKRDGDDRRPRLVLSFDEEKRREYLTGFHKRKVERKKAAIEEIKQRLKEEQRKL REERHQEYLKMLAEREEALEEADELDRLVTAKTESVQYDHPNHTVTVTTISDLDLSGARL LGLTPPEGGAGDRSEEEASSTEKPTKALPRKSRDPLLSQRISSLTASLHAHSRKKVKRKH PRRAQDSKKPPRAPRTSKAQRRRLTGKARHSGE >gi568815576f:37586393_37791333|GENSCAN_predicted_CDS_3|642_bp atgggccgcaacaagaagaagaagcgagatggtgacgaccggcggccgaggctcgttctt agcttcgacgaggagaagaggcgggagtacctgacaggcttccacaagcggaaggtcgag cgaaagaaggcagccattgaggagattaagcagcggctgaaagaggagcagaggaagctt cgggaggagcgccaccaggaatacttgaagatgctggcagagagagaagaggctctggag gaggcagatgagctggaccggttggtgacagcaaagacggagtcggtgcagtatgaccac cccaaccacacagtcaccgtgaccaccatcagtgacctggacctctcgggggcccggctg ctcgggctgaccccacctgagggaggggctggagacaggtctgaggaggaggcgtcatcc acggagaaaccaaccaaagccttgcccaggaagtccagagaccccctgctctctcagcgg atctcctccctcacagcatcactacatgcacacagccgcaaaaaggtcaagaggaaacat ccccgacgggcccaggactccaaaaagcccccaagggcccctcgtaccagcaaggcccag cgccgccgtctcacaggcaaagcacggcacagcggggagtga >gi568815576f:37586393_37791333|GENSCAN_predicted_peptide_4|47_aa MTLALPPRQPLCVETQPRSDDGGALALAVPGEEVKFLSSPPRLATKA >gi568815576f:37586393_37791333|GENSCAN_predicted_CDS_4|144_bp atgaccctggcactcccaccccggcagcccctgtgcgtggaaacccagccaaggtctgat gatggaggagccttggccctggctgtcccaggggaggaggtgaaattcctcagctctcca ccaagattggccacaaaagcctga >gi568815576f:37586393_37791333|GENSCAN_predicted_peptide_5|2039_aa MEEVPGDALCEHFEANILTQNRCQNCFHPEEAHGARYQPPFPRGENNAPKGEVIGLRSPA GNRRSQDGARGPSEARASADPPDRKSELGTDRGAHSSFHCLGGCCHHSPESGPGRAAVWG PEPEPPGDEGADSRQPPPPPEPAAQELRSPSGAEVPYCDLPRCPPAPEDPLSASTSGCQS VVDPGLRPGPKRGPSPSAGLPEEGPTAAPRSRSRELEAVPYLEGLTTSLCGSCNEDPGSD PTSSPDSATPDDTSNSSSVDWDTVERQEEEAPSWDELAVMIPRRPREGPRADSSQRAPSL LTRSPVGGDAAGQKKEGVKLQTFVVSVTAHKGSVDPKSEQQQDLLQRAKEQSFHSMEEDP SGPWVVDGTGRCGAGAALIGEARAAQEPTEAGGSSGMAGCRSRALPGGKAAKARREIQRS AGAKPLIARGRQGQPAAPSAGPTKPTPTRNSSWPTSAARSLGSRSRLSLHTSVQAEGASS GLGQPRKGLPQCSGGLKGSSSAAKVGAQAEEAPRVSEGCEGRQHAVTSQYHPSDSASITE RKAPLGWGEREDTTFTKRLLSIPRRENPRTPCVQQDDPRASSPNRTTQRENSRTSCAQRD NPKASRTSSPNRATRDNPRTSCAQRDNPRASSPSRATRDNPTTSCAQRDNPRASRTSSPN RATRDNPRTSCAQRDNPRASSPSRATRDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSC AQRDNPRASSPNRAARDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASSP NRATRDNPTTSCAQRDNPRASRTSSPNRATRDNPRTSCAQRDNPRASSPNRTTQQDSPRT SCARRDDPRASSPNRTIQQENPRTSCALRDNPRASSPSRTIQQENPRTSCAQRDDPRASS PNRTTQQENPRTSCARRDNPRASSRNRTIQRDNPRTSCAQRDNPRASSPNRTIQQENLRT SCTRQDNPRTSSPNRATRDNPRTSCAQRDNLRASSPIRATQQDNPRTCIQQNIPRSSSTQ QDNPKTSCTKRDNLRPTCTQRDRTQSFSFQRDNPGTSSSQCCTQKENLRPSSPHRSTQWN NPRNSSPHRTNKDIPWASFPLRPTQSDGPRTSSPSRSKQSEVPWASIALRPTQGDRPQTS SPSRPAQHDPPQSSFGPTQYNLPSRATSSSHNPGHQSTSRTSSPVYPAAYGAPLTSPEPS QPPCAVCIGHRDAPRASSPPRYLQHDPFPFFPEPRAPESEPPHHEPPYIPPAVCIGHRDA PRASSPPRHTQFDPFPFLPDTSDAEHQCQSPQHEPLQLPAPVCIGYRDAPRASSPPRQAP EPSLLFQDLPRASTESLVPSMDSLHECPHIPTPVCIGHRDAPSFSSPPRQAPEPSLFFQD PPGTSMESLAPSTDSLHGSPVLIPQVCIGHRDAPRASSPPRHPPSDLAFLAPSPSPGSSG GSRGSAPPGETRHNLEREEYTVLADLPPPRRLAQRQPGPQAQCSSGGRTHSPGRAEVERL FGQERREEQPTGSRLGSCFIEAPIPQFGGKFTLRPSALTEKSEAAGAFQAQDEGRSQQPS QGQSQLLRRQSSPAPSRQVTMLPAKQAELTRRSQAEPPHPWSPEKRPEGDRQLQGSPLPP RTSARTPERELRTQRPLESGQAGPRQPLGVWQSQEEPPGSQGPHRHLERSWSSQEGGLGP GGWWGCGEPSLGAAKAPEGAWGGTSREYKESWGQPEAWEEKPTHELPRELGKRSPLTSPP ENWGGPAESSQSWHSGTPTAVGWGAEGACPYPRGSERRPELDWRDLLGLLRAPGEGVWAR VPSLDWEGLLELLQARLPRKDPAGHRDDLARALGPELGPPGTNDVPEQESHSQPEGWAEA TPVNGHSPALQSQSPVQLPSPACTSTQWPKIKVTRGPATATLAGLEQTGPLGSRSTAKGP SLPELQADKRPAEGKAGSPLKGRLVTSWRMPGDRPTLFNPFLLSLGVLSCLSWGQHVLSK GKAAGSSVWGAWKMDSTSLLVAAAFLREVSSCDYRALSGGDGLARRQQCGQKRVVMFEK >gi568815576f:37586393_37791333|GENSCAN_predicted_CDS_5|6120_bp atggaggaggtgcctggggatgccctgtgtgaacactttgaggccaacatacttacccag aaccgctgtcaaaactgcttccaccctgaggaggcccatggagcaagataccagcctcca tttccgagaggagaaaacaatgctccgaaaggggaagtcatcggcctgaggtcaccagcc gggaaccggaggagccaggatggggcccgaggtccatctgaggccagagccagtgctgac ccgccagacaggaaatcagagctggggactgaccgtggggcccactcaagtttccactgc ctcggcggctgctgccaccacagcccggagtcggggcctgggagggcagcagtgtggggg cctgagccggagccccccggggacgagggtgctgacagtcgacagccaccaccaccacca gagcccgcagcccaggagctcaggagcccttcaggtgctgaggtgccctactgcgacctg cctcgatgtccacctgcccctgaggacccactcagcgcctcaacctccggctgccagtct gtggtggacccaggcctcaggccagggcccaagaggggcccatccccctcagcagggctc ccagaagagggtcccacagctgcccccaggagcaggagccgggagcttgaggcagtaccc tatctggagggcctgaccacttccttgtgtggcagctgcaacgaggaccccggctctgac cccacctccagccctgactccgccacccctgatgataccagcaactcgtcctctgtggac tgggacactgttgagaggcaggaggaggaggcccccagctgggacgagctcgcagtgatg atcccgaggaggcctcgggaggggccgagagctgacagctcccaaagggctccgtctctc ctcaccaggtcccctgtgggaggagatgctgcaggccagaaaaaggagggagtgaagctg cagaccttcgtggtgagcgttacagctcataaaggcagtgtggacccaaagagtgagcag cagcaagatttattgcaaagagcgaaagaacaaagcttccacagcatggaagaggacccg agcggcccttgggtggttgatgggactgggcgctgtggagcaggggcggcgctcatcggg gaggctcgggctgcacaggagcccacggaggcggggggaagctcaggcatggcgggctgc aggtcccgagccctgcccggtgggaaggcagctaaggcccggcgagaaatccagcgcagc gctggtgctaagcccctcattgcccggggccggcagggccagccggccgctccgagtgcg gggcccaccaagcccacgcccacccggaactccagctggcccacaagcgccgcgcgcagc ctcggttcccgctcgcgcctctccctccacacctccgtgcaagctgagggagccagctcc ggcctcggccagcccaggaaggggctcccacagtgcagcggtggcctgaagggctcctca agtgccgccaaagtgggagcccaggcagaggaggcgccgagagtgagcgagggctgtgag ggccgccagcatgctgtcacctctcagtatcacccctcagattctgcatcgattactgag agaaaggcacccttaggctggggagagcgggaagataccactttcaccaagaggctgctc agcatccccagacgggaaaaccccaggacaccctgtgtccagcaggacgatcccagagcc tcctctcccaacagaaccactcaacgagagaattccagaacatcctgtgcccagcgggac aatcccaaagcctccagaacctcctctcccaatagagccacacgagacaaccccagaaca tcctgcgcccagcgggacaatcccagagcctcctctcccagtagagctacacgagacaac cccacaacatcctgtgcccagcgggacaatcccagagcctccagaacctcctctcccaat agagccacacgagacaaccccagaacatcctgtgcccagcgggacaatcccagagcctcc tctcccagtagagctacacgagacaaccccacaacatcctgtgcccagcgggacaatccc agagcctccagaacctcctctcccaatagagccacacgagacaaccccagaacatcctgc gcccagcgggacaatcccagagcctcctctcccaatagagctgcacgagacaaccccaca acatcctgtgcccagcgggacaatcccagagcctccagaacctcctctcccaatagagcc acacgagacaaccccagaacatcctgtgcccagcgggacaatcccagagcctcctctccc aatagagctacacgagacaaccccacaacatcctgtgcccagcgggacaatcccagagcc tccagaacctcctctcccaatagagccacacgagataaccccagaacatcctgtgcccag cgggacaatcccagagcctcctctcccaacagaaccacccaacaagacagccccagaaca tcctgtgcccgacgggacgatcccagagcctcctctcctaacagaaccatccaacaagag aaccccagaacatcctgtgccctacgggacaatcccagagcctcctctcccagcagaacc atccaacaagagaaccccagaacatcctgtgcccaacgggacgatcccagagcctcctct cctaacagaaccacccaacaagagaaccccagaacatcctgtgcccgacgggacaatccc agagcctcctctcgcaacagaaccatccagcgagacaaccccagaacatcctgtgcccag cgggacaatcccagagcctcctctcctaacagaaccatccaacaagagaacctcagaaca tcctgtacccgacaggacaatcccaggacctcctctcccaatagagccacacgagacaac cccagaacatcctgtgcccagcgggacaatctcagagcctcctctcccatcagagccacc caacaggacaaccccagaacttgtattcaacagaacatccccagatcatcttctacccaa caagacaaccctaaaacctcttgtaccaaacgagataacctcagacccacttgtacacag cgggaccgcacacagtccttttcctttcaacgagacaaccctggaacctcctcatctcaa tgctgcacccaaaaggagaatctgagaccatcatctccccaccgctccactcaatggaac aatcccaggaattcatctccccatcgtactaacaaagacatcccctgggcctcgtttccc ctccggccaactcagagtgatggtccccgaacctcttccccatctcgctccaagcaaagc gaggttccctgggcatccatcgccctccggccaacccaaggtgacaggcctcagacatcc tctcccagcaggccagcccagcatgacccaccccagtcctcctttggccccacccagtac aacttgccatcccgggccacctcttcctcccataacccaggccaccagagcacctcccga acttcctcacctgtgtaccccgctgcctatggggctcccctgacctctcctgagccctcc cagcctccatgtgctgtgtgcattgggcaccgggatgcccctcgagcctcttcgccccct cgctatttgcagcacgaccccttccccttcttcccagagccccgcgcccctgagagtgaa ccgccccaccacgagcctccctatataccacctgctgtgtgcattggacaccgagatgcc ccccgggcgtcctcgcccccccgccacacccaatttgaccccttccccttcctcccagac acatcagatgccgagcatcagtgtcagtccccccaacacgagccccttcagctccctgca cctgtgtgtattgggtaccgagatgcaccccgggcctcctccccaccacgccaggcccca gagccttccctcttattccaggacctccccagggccagcacagagagccttgtcccttcc atggactctctgcacgagtgcccccacatccccacccctgtgtgcattgggcaccgggat gcaccctccttctcatccccaccacgccaggctcctgagccatccctcttcttccaggat ccccctggaactagtatggagagcctggccccctccactgactctctgcatggctcccca gtgctgatcccccaagtgtgcatcgggcaccgggatgcaccccgagcctcctccccaccc cgccacccacccagtgacctagcgttcctggcaccctcaccttcaccgggcagctctggg ggctcccggggctcagcgcctcccggggagaccaggcacaacttggagcgggaggagtac actgtgctggccgacctgcccccacccaggaggctggcccagagacagccagggccccag gcgcagtgcagcagcgggggccgcacccacagccctggccgtgcagaggtggagcgcctc ttcgggcaagagcgcagggaggaacagcccactgggtcacgtctgggctcttgcttcatc gaagctcctataccccagttcgggggaaaattcactctaaggccttcagctctcacagag aagtccgaggcagcgggggccttccaggcccaggacgagggacggtcacagcagcccagc caaggccagagccaacttctccgaagacagtccagccctgcccccagcaggcaggtgacc atgctccctgccaaacaggcagaactgacccggcggagccaagcagagccccctcatcct tggagtcctgagaagagacctgagggagatcggcagctccaggggtccccgctgcccccc aggacatcagccaggacccctgagagggagctgcggacacagagacctctggagagtggc caagcaggcccaagacagcctctgggggtgtggcagagtcaggaggaaccgccagggtcc cagggccctcatagacacctagaaaggagctggagcagccaggagggaggcctgggccct gggggctggtggggatgtggagagcccagcctgggggcagccaaagccccggagggagca tgggggggcacttccagggagtacaaggagagctgggggcagccagaggcctgggaggag aagcccactcatgagctccccagagaactaggaaagagaagcccactcacgagcccccct gagaactggggaggccccgcagagtcctcacaatcctggcactctgggacacccactgct gtgggctggggggcagagggagcgtgtccatacccgcgtggctctgagaggcgacccgag cttgactggagggatctgcttggccttctccgggcaccaggagagggggtctgggcccgt gtccccagcctggactgggagggcctcttggagctcctgcaggccaggctgccccgcaag gacccagctggacacagggatgacctggccagggctttagggccagagctgggtccccca ggcacaaacgatgtccctgagcaggagtcacacagccagccagaaggctgggccgaggcc accccagtcaatggacacagccccgcactgcagtcccagagcccggtccagctgcccagc cctgcctgcacctccacccagtggccaaagatcaaagtgacaagaggaccagcgaccgca actctggcaggcctggagcagacgggccccctggggagcaggagcactgcgaagggcccc agcttgccagagctgcaggcagacaagaggccagcagagggcaaggctgggagcccgctc aagggccgactggtgacctcatggcggatgcccggggaccggcccacgctgttcaatccg ttcctgctgtctctgggggtcctcagttgcctgtcctgggggcagcacgtgctgagcaag ggtaaggctgccggaagcagcgtgtggggtgcttggaagatggacagcacatccctgctg gtggcagcagccttcctgagggaggtgtcctcctgtgattatagggccttgtcaggtgga gatggactagcgaggagacagcagtgtggacagaaacgggtggtcatgtttgagaagtag >gi568815576f:37586393_37791333|GENSCAN_predicted_peptide_6|686_aa MLQLVAPRPRGCAPLGGTQKPDLLNFKKGWMSILDEPGEADELDGEIDLRSCTDVTEYAV QRNYGFQIHTKDAVYTLSAMTSGIRRNWIEALRKTVRPTSAPDVTKLSDSNKENALHSYS TQKGPLKAGEQRAGSEVISRGGPRKADGQRQALDYVELSPLTQASPQRARTPARTPDRLA KQEELERDLAQRSEERRKWFEATDSRTPEVPAGEGPRRGLGAPLTEDQQNRLSEEIEKKW QELEKLPLRENKRVPLTALLNQSRGERRGPPSDGHEALEKEVQALRAQLEAWRLQGEAPQ SALRSQEDGHIPPGYISQEACERSLAEMESSHQQVMEELQRHHERELQRLQQEKEWLLAE ETAATASAIEAMKKAYQEELSRELSKTRSLQQGPDGLRKQHQSDVEALKRELQVLSEQYS QKCLEIGALMRQAEEREHTLRRCQQEGQELLRHNQELHGRLSEEIDQLRGFIASQGMGNG CGRSNERSSCELEVLLRVKENELQYLKKEVQCLRDELQMMQKVGPSAGLGAVGDSGAIWM PSCEHLLCAKPSTRFILPQALLCFSHPLTALRSWSKSSPPKQDEDDNDACVPGWYQGGGR GKPRTHGHLGEPPGGMKRGICGEVWLLVLPMCPDKRFTSGKYQDVYVELSHIKTRSEREI EQLKEHLRLAMAALQEKESMRNSLAE >gi568815576f:37586393_37791333|GENSCAN_predicted_CDS_6|2061_bp atgctgcagctggtagcccccagaccccggggctgtgcccccctgggcggcacccagaag cccgatctgctcaacttcaagaagggatggatgtcgatcttggacgagcctggagaggca gatgagctggatggtgagatcgacctgcgttcctgcacggatgtcactgagtacgcggtg cagcgcaactatggcttccagatccacaccaaggatgctgtctataccttgtcggccatg acctcaggcatccggcggaactggatcgaggctctgagaaagaccgtacgtccaacttca gccccagatgtcaccaagctctcggactctaacaaggagaacgcgctgcacagctacagc acccagaagggccccctgaaggcaggggagcagcgggcgggctctgaggtcatcagccgg ggtggccctcggaaggcggacgggcagcgtcaggccttggactacgtggagctctcgccg ctgacccaggcttccccgcagcgggcccgcaccccagcccgcactcctgaccgcctggcc aagcaggaggagctggagcgggacctggcccagcgctccgaggagcggcgcaagtggttt gaggccacagacagcaggaccccagaggtgcctgctggtgaggggccgcgccggggcctg ggtgcccccctgactgaggaccagcaaaaccggcttagtgaggagatcgagaagaagtgg caggagctggagaagctgcccctgcgggagaataagcgggtgcccctcactgccctgctc aaccaaagccgcggagagcgccgagggcccccaagtgacggccacgaggcactggagaag gaggttcaggctcttcgggcccagctggaggcgtggcgtctccaaggggaggctcctcag agtgcactgagatcccaggaggatggccacatccccccgggctacatctcacaggaggca tgtgagcgcagcctggcagagatggagtcctcgcaccagcaggtgatggaggagctgcag cggcaccacgagcgggagctgcagcgcctgcagcaggagaaggagtggctcctggctgag gagacggcagccacggcctcagccattgaagccatgaagaaggcctaccaggaagagctg agccgagagctgagcaaaacacggagtctccagcagggcccggatggcctccggaagcag caccagtcagatgtggaggcactgaagcgagagctgcaggtgctatcggagcagtactcg cagaagtgcctggagattggggcactcatgcggcaggctgaggagcgcgagcacacgctg cgccgctgccagcaggagggccaggagctgctgcgccacaaccaggagctgcatggccgc ctgtcagaggagatagaccagctgcgcggcttcattgcctcgcagggcatgggcaatggc tgcgggcgcagcaacgagcggagttcctgcgagctagaggtgctgcttcgcgtaaaagaa aacgaactccagtacctaaagaaggaggtgcagtgcctccgggacgagctccagatgatg cagaaggtaggtccttccgctgggctgggggccgtcggggactctggagccatctggatg ccatcctgtgagcacctgctctgtgccaagccctccactcgcttcatccttcctcaggct ctcctgtgcttctcccatccactcactgccctgcggtcttggtcaaaatcttctcccccg aaacaggatgaggatgacaatgacgcctgtgtccctgggtggtaccagggaggtgggagg ggtaagcccagaacccacggccatcttggggagccacctggagggatgaagcgaggtatc tgcggggaggtctggctgctggtgctgcccatgtgcccggacaagcgcttcacctcggga aagtaccaggacgtctatgtggagctgagccacatcaagacacggtctgagcgggagatc gagcagctgaaggagcacctgcgtcttgccatggccgccctccaggagaaggagtcgatg cgcaacagcctggctgagtag >gi568815576f:37586393_37791333|GENSCAN_predicted_peptide_7|142_aa MPGWENIFSSMNHTLNALTGAVTSTRDNRDCSLVRAMPNWGPNAAGISPCRVSAGTGRVP SCLCHSGAWNRLDQDINRNNSCYSDSVGLATLRASLNPISSSHLRHTWPVQPAVLLLSED TATWEQLECGSGDATVEVTASR >gi568815576f:37586393_37791333|GENSCAN_predicted_CDS_7|429_bp atgcccggctgggaaaatattttctcctccatgaaccatactctcaatgcactaactggt gctgtgacaagcaccagggacaacagggactgctctctggtgagggccatgcccaactgg ggcccaaatgcagctggcatcagtccctgcagggtgagcgcagggacagggcgtgtccct tcatgtctgtgtcactcaggtgcctggaacaggcttgaccaggatattaacaggaataac agctgctactcagactcagtgggcctggccacgctcagggcctcacttaatcccatttct tcatctcatctcagacacacttggccagtgcagccagctgtcctgctcttgagtgaggac acagccacgtgggagcagctggaatgtggctctggagatgccacagttgaggtcacagct agtagatga