GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:42:25 Sequence gi568815586f:119568245_119780322 : 212078 bp : 43.38% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 25521 25790 270 2 0 62 101 351 0.966 31.54 1.02 Intr + 26866 26958 93 0 0 55 60 73 0.457 1.36 1.03 Intr + 54518 54793 276 1 0 71 41 107 0.150 2.01 1.04 Intr + 61492 61628 137 0 2 101 100 214 0.933 23.27 1.05 Intr + 77516 77606 91 2 1 87 47 28 0.021 -1.40 1.06 Intr + 78491 78550 60 1 0 124 82 9 0.043 2.83 1.07 Intr + 99992 100159 168 1 0 113 86 310 0.471 33.44 1.08 Intr + 104057 104220 164 1 2 79 86 105 0.999 8.17 1.09 Intr + 105720 105813 94 0 1 98 63 122 0.984 10.67 1.10 Intr + 106096 106246 151 0 1 96 75 90 0.740 8.24 1.11 Intr + 108293 108426 134 0 2 61 94 148 0.924 13.06 1.12 Intr + 111689 111757 69 1 0 111 90 46 0.987 6.48 1.13 Term + 112004 112081 78 1 0 86 49 114 0.960 5.06 1.14 PlyA + 112551 112556 6 -3.74 2.38 PlyA - 112748 112743 6 -0.45 2.37 Term - 112956 112783 174 0 0 65 53 164 0.571 8.36 2.36 Intr - 122210 121907 304 1 1 62 100 384 0.656 33.59 2.35 Intr - 124392 124358 35 0 2 65 50 82 0.548 -0.88 2.34 Intr - 127232 127106 127 1 1 19 71 93 0.598 1.38 2.33 Intr - 129594 129415 180 2 0 39 91 141 0.962 8.48 2.32 Intr - 129810 129732 79 1 1 32 98 66 0.830 0.61 2.31 Intr - 132581 132501 81 0 0 83 84 66 0.825 5.31 2.30 Intr - 133508 133380 129 0 0 90 113 101 0.990 13.47 2.29 Intr - 133714 133606 109 1 1 86 53 179 0.999 14.06 2.28 Intr - 136211 136119 93 2 0 93 106 182 0.991 20.66 2.27 Intr - 140074 139935 140 2 2 70 121 117 0.999 13.38 2.26 Intr - 142142 142007 136 2 1 129 64 103 0.964 12.14 2.25 Intr - 142425 142296 130 2 1 51 95 92 0.512 6.90 2.24 Intr - 143161 143122 40 2 1 85 61 12 0.299 -4.52 2.23 Intr - 144451 144347 105 2 0 55 94 82 0.421 5.69 2.22 Intr - 145050 144959 92 2 2 114 94 75 0.985 10.34 2.21 Intr - 145404 145224 181 1 1 95 81 179 0.986 16.83 2.20 Intr - 146090 145953 138 0 0 75 103 109 0.999 11.54 2.19 Intr - 150165 150001 165 1 0 78 83 160 0.997 14.43 2.18 Intr - 150617 150455 163 2 1 31 97 209 0.541 15.65 2.17 Intr - 152341 152234 108 1 0 56 113 91 0.973 8.88 2.16 Intr - 153205 153065 141 1 0 57 94 114 0.991 9.45 2.15 Intr - 160362 160258 105 0 0 26 105 153 0.998 11.21 2.14 Intr - 162386 162251 136 1 1 89 82 212 0.989 21.27 2.13 Intr - 166113 165920 194 0 2 23 75 253 0.907 15.79 2.12 Intr - 167113 166916 198 1 0 117 80 413 0.909 42.95 2.11 Intr - 174220 174167 54 1 0 88 74 72 0.771 4.98 2.10 Intr - 184003 183806 198 1 0 124 90 300 0.998 33.35 2.09 Intr - 189301 189127 175 0 1 47 78 204 0.998 15.24 2.08 Intr - 190456 190347 110 1 2 55 103 94 0.855 6.68 2.07 Intr - 192811 192695 117 1 0 99 78 196 0.847 20.36 2.06 Intr - 198938 198843 96 2 0 129 37 73 0.988 6.51 2.05 Intr - 202666 202541 126 1 0 74 93 104 0.996 10.38 2.04 Intr - 204666 204526 141 0 0 123 83 269 0.999 30.45 2.03 Intr - 207595 207542 54 1 0 54 121 95 0.997 8.58 2.02 Intr - 208164 208114 51 0 0 82 69 62 0.874 2.80 2.01 Intr - 208598 208428 171 2 0 70 61 86 0.716 4.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:119568245_119780322|GENSCAN_predicted_peptide_1|594_aa SLATSRRPEPQTTQTVRSSALPAPPASPMSQYAPSPDFKRALDSSPEANTEDDKTEEDVP MPKNYLWLTIVSCFCPAYPINIVALVFSIMEHPSEPHQEGDERLLGPRLIDCSSWPGGGG GQIYPWAALFPSTRGENGFRKGFVGRGASTLNTERGFSSVFQANSSPIPRFTQGLPYMSL ERRRVTYKMPEWQQVMNVGQDSPVACTQIPFSQSLNSYNDGDYEGARRLGRNAKWVAIAS IIIGLLIIGISCAVHFTRNVEFPSLHPHWFTDAPKLSAPGTVTKCLLPQGPLSLLWVHLD DPEKSPYFKTPIMGNTSSERAALERHGGHKTPRRDSSGGTKDGDRPKILMDSPEDADLFH SEEIKAPEKEEFLAWQHDLEVNDKAPAQARPTVFRWTGGGKEVYLSGSFNNWSKLPLTRS HNNFVAILDLPEGEHQYKFFVDGQWTHDPSEPIVTSQLGTVNNIIQVKKTDFEVFDALMV DSQKCSDVSGMNTVILYHMRAELSSSPPGPYHQEPYVCKPEERFRAPPILPPHLLQVILN KDTGISCDPALLPEPNHVMLNHLYALSIKDGVMVLSATHRYKKKYVTTLLYKPI >gi568815586f:119568245_119780322|GENSCAN_predicted_CDS_1|1785_bp agcctcgccacaagccggcggccagagccccagaccacacagaccgtgcgctcctccgcc ctcccggcgccgccggcctcgcccatgtctcagtacgcccctagcccggacttcaagagg gctttggacagcagtcccgaggccaacactgaagatgacaagaccgaggaggacgtgccc atgcccaagaactacctgtggctcaccatcgtctcgtgtttttgccctgcgtaccccatc aacatcgtggctttggtcttttccatcatggaacacccgagcgaaccccaccaggagggc gacgagcgcctgctaggccctcgccttattgactgcagcagctggcccgggggtggcggc gggcagatctacccgtgggcagcccttttcccaagtaccaggggagaaaatggattcaga aaaggatttgtggggagaggagcttccacacttaatactgaaagaggcttttcatctgtt tttcaagccaacagcagccccattcccagattcactcaaggtctcccttacatgtccctg gaaagaaggagggtcacatacaagatgccagagtggcagcaggtgatgaacgtggggcag gattcacctgtggcttgcacccaaatccccttcagccagtctctgaacagctacaacgat ggagactacgaaggagccaggcggcttgggcggaatgctaagtgggtagccatcgcctcc atcatcattggccttctcatcatcggcatttcttgtgcagttcacttcacaaggaatgta gaatttcccagtttgcacccacactggttcactgatgcacccaagctgtctgctcctgga actgtgaccaagtgtttactcccccagggacccttgtcattactttgggtccacctggat gatccagaaaaatctccttattttaagacccccatcatgggcaataccagcagtgagcgc gccgcgctggagcggcatggtggccataagacgccccggagggacagctcggggggcacc aaggacggggacaggcccaagatcctgatggacagccccgaagacgccgacctcttccac tccgaggaaatcaaggcaccagagaaggaggaattcctggcctggcagcatgatctggaa gtgaatgataaagctcccgcccaggctcggccaacggtgtttcgatggacggggggcgga aaggaagtttacttatctgggtccttcaacaactggagtaaacttcccctcaccagaagc cacaataactttgtagccatcctggatctgccggaaggagagcatcagtacaagttcttt gtggatggtcagtggacgcacgacccttccgagcccatagtaaccagccagcttggcaca gttaacaacatcattcaagtgaagaaaactgactttgaagtatttgatgctttaatggtg gattcccaaaagtgctccgatgtgtctggtatgaacacagttattttataccacatgcgt gcagagctgtccagttctcccccaggaccctaccatcaggagccctacgtctgcaaaccc gaagagcgctttcgggcaccccctattctccccccacatctcctccaggtcatcctgaac aaggacacggggatttcctgtgatccagctttgcttcctgagcccaatcacgtcatgctg aaccacctatacgcgctgtctatcaaggatggagtgatggtgctcagcgcaacccaccgg tacaagaagaagtacgtcaccaccttgttatacaagcccatatga >gi568815586f:119568245_119780322|GENSCAN_predicted_peptide_2|1591_aa EYQAQVEEMRLMMNQLEEDLVSARRRSDLYESELRESRLAAEEFKRKATECQHKLLKAKD QGKPEVGEYAKLEKINAEQQLKIQELQEKLEKAVKASTEATELLQNIRQAKERAERELEK LQNREDSSEGIRKKLVEAEERRHSLENKVKRLETMERRENRLKDDIQTKSQQIQQMADKI LELEEKHREAQVSAQHLEVHLKQKEQHYEEKIKVLDNQIKKDLADKETLENMMQRHEEEA HEKGKILSEQKAMINAMDSKIRSLEQRIVELSEANKLAANSSLFTQRNMKAQEEMISELR QQKFYLETQAGKLEAQNRKLEEQLEKISHQDHSDKNRLLELETRLREVSLEHEEQKLELK RQLTELQLSLQERESQLTALQAARAALESQLRQAKTELEETTAEAEEEIQALTAHRDEIQ RKFDALRNSCTVITDLEEQLNQLTEDNAELNNQNFYLSKQLDEASGANDEIVQLRSEVDH LRREITEREMQLTSQKQTMEALKTTCTMLEEQVMDLEALNDELLEKERQWEAWRSVLGDE KSQFECRVRELQRMLDTEKQSRARADQRITESRQVVELAVKEHKAEILALQQALKEQKLK AESLSDKLNDLEKKHAMLEMNARSLQQKLETERELKQRLLEEQAKLQQQMDLQKNHIFRL TQGLQEALDRADLLKTERSDLEYQLENIQVLYSHEKVKMEGTISQQTKLIDFLQAKMDQP AKKKKGLFSRRKEDPALPTQVPLQYNELKLALEKEKARCAELEEALQKTRIELRSAREEA AHRKATDHPHPSTPATARQQIAMSAIVRSPEHQPSAMSLLAPPSSRRKESSTPEEFSRRL KERMHHNIPHRFNVGLNMRATKCAVCLDTVHFGRQASKCLECQVMCHPKCSTCLPATCGL PAEYATHFTEAFCRDKMNSPGLQTKEPSSSLHLEGWMKVPRNNKRGQQGWDRKYIVLEGS KVLIYDNEAREAGQRPVEEFELCLPDGDVSIHGAVGASELANTAKAGRFGVLLILKLLET SASRCLLFFFSFTYEKKLLGNSLLKLEGDDRLDMNCTLPFSDQVVLVGTEEGLYALNVLK NSLTHVPGIGAVFQIYIIKDLEKLLMIAGEERALCLVDVKKVKQSLAQSHLPAQPDISPN IFEAVKGCHLFGAGKIENGLCICAAMPSKVVILRYNENLSKYCIRKEIETSEPCSCIHFT NYSILIGTNKFYEIDMKQYTLEEFLDKNDHSLAPAVFAASSNSFPVSIVQVNSAGQREEY LLCFHEFGVFVDSYGRRSRTDDLKWSRLPLAFAYREPYLFVTHFNSLEVIEIQARSSAGT PARAYLDIPNPRYLGPAISSGAIYLASSYQDKLRVICCKGNLVKESGTEHHRGPSTSRRF WYPQEVLEAVPQATKGQLYNQMYSQIHCPENGSFPRSVARQLLNPVVVMRKQLSPNKRGP PTYNEHITKRVASSPAPPEGPSHPREPSTPHRYREGRTELRRDKSPGRPLEREKSPGRML STRRERSPGRLFEDSSRGRLPAGAVRTPLSQVNKGRGQSASQVFTVNTVTYYDWNKKLDN LPANWSVLRIIQLNGEIRQQVEKSVLRTDYC >gi568815586f:119568245_119780322|GENSCAN_predicted_CDS_2|4776_bp gagtaccaggctcaagtggaagaaatgaggttgatgatgaatcagttggaagaggatctt gtctcagcaagaagacggagtgatctctacgaatctgagctgagagagtctcggcttgct gctgaagaattcaagcggaaagcgacagaatgtcagcataaactgttgaaggctaaggat caagggaagcctgaagtgggagaatatgcgaaactggagaagatcaatgctgagcagcag ctcaaaattcaggagctccaagagaaactggagaaggctgtaaaagccagcacggaggcc accgagctgctgcagaatatccgccaggcaaaggagcgagccgagagggagctggagaag ctgcagaaccgagaggattcttctgaaggcatcagaaagaagctggtggaagctgaggaa cgccgccattctctggagaacaaggtaaagagactagagaccatggagcgtagagaaaac agactgaaggatgacatccagacaaaatcccaacagatccagcagatggctgataaaatt ctggagctcgaagagaaacatcgggaggcccaagtctcagcccagcacctagaagtgcac ctgaaacagaaagagcagcactatgaggaaaagattaaagtgttggacaatcagataaag aaagacctggctgacaaggagacactggagaacatgatgcagagacacgaggaggaggcc catgagaagggcaaaattctcagcgaacagaaggcgatgatcaatgctatggattccaag atcagatccctggaacagaggattgtggaactgtctgaagccaataaacttgcagcaaat agcagtctttttacccaaaggaacatgaaggcccaagaagagatgatttctgaactcagg caacagaaattttacctggagacacaggctgggaagttggaggcccagaaccgaaaactg gaggagcagctggagaagatcagccaccaagaccacagtgacaagaatcggctgctggaa ctggagacaagattgcgggaggtcagtctagagcacgaggagcagaaactggagctcaag cgccagctcacagagctacagctctccctgcaggagcgcgagtcacagttgacagccctg caggctgcacgggcggccctggagagccagcttcgccaggcgaagacagagctggaagag accacagcagaagctgaagaggagatccaggcactcacggcacatagagatgaaatccag cgcaaatttgatgctcttcgtaacagctgtactgtaatcacagacctggaggagcagcta aaccagctgaccgaggacaacgctgaactcaacaaccaaaacttctacttgtccaaacaa ctcgatgaggcttctggcgccaacgacgagattgtacaactgcgaagtgaagtggaccat ctccgccgggagatcacggaacgagagatgcagcttaccagccagaagcaaacgatggag gctctgaagaccacgtgcaccatgctggaggaacaggtcatggatttggaggccctaaac gatgagctgctagaaaaagagcggcagtgggaggcctggaggagcgtcctgggtgatgag aaatcccagtttgagtgtcgggttcgagagctgcagagaatgctggacaccgagaaacag agcagggcgagagccgatcagcggatcaccgagtctcgccaggtggtggagctggcagtg aaggagcacaaggctgagattctcgctctgcagcaggctctcaaagagcagaagctgaag gccgagagcctctctgacaagctcaatgacctggagaagaagcatgctatgcttgaaatg aatgcccgaagcttacagcagaagctggagactgaacgagagctcaaacagaggcttctg gaagagcaagccaaattacagcagcagatggacctgcagaaaaatcacattttccgtctg actcaaggactgcaagaagctctagatcgggctgatctactgaagacagaaagaagtgac ttggagtatcagctggaaaacattcaggttctctattctcatgaaaaggtgaaaatggaa ggcactatttctcaacaaaccaaactcattgattttctgcaagccaaaatggaccaacct gctaaaaagaaaaagggtttatttagtcgacggaaagaggaccctgctttacccacacag gttcctctgcagtacaatgagctgaagctggccctggagaaggagaaagctcgctgtgca gagctagaggaagcccttcagaagacccgcatcgagctccggtccgcccgggaggaagct gcccaccgcaaagcaacggaccacccacacccatccacgccagccaccgcgaggcagcag atcgccatgtccgccatcgtgcggtcgccagagcaccagcccagtgccatgagcctgctg gccccgccatccagccgcagaaaggagtcttcaactccagaggaatttagtcggcgtctt aaggaacgcatgcaccacaatattcctcaccgattcaacgtaggactgaacatgcgagcc acaaagtgtgctgtgtgtctggataccgtgcactttggacgccaggcatccaaatgtctc gaatgtcaggtgatgtgtcaccccaagtgctccacgtgcttgccagccacctgcggcttg cctgctgaatatgccacacacttcaccgaggccttctgccgtgacaaaatgaactcccca ggtctccagaccaaggagcccagcagcagcttgcacctggaagggtggatgaaggtgccc aggaataacaaacgaggacagcaaggctgggacaggaagtacattgtcctggagggatca aaagtcctcatttatgacaatgaagccagagaagctggacagaggccggtggaagaattt gagctgtgccttcccgacggggatgtatctattcatggtgccgttggtgcttccgaactc gcaaatacagccaaagcaggcaggtttggagtcctacttatccttaagctgttagaaaca tcagcctcacgatgtcttctgtttttcttttctttcacctatgaaaagaaactgcttgga aactccctgctgaaactggaaggtgatgaccgtctagacatgaactgcacgctgcccttc agtgaccaggtggtgttggtgggcaccgaggaagggctctacgccctgaatgtcttgaaa aactccctaacccatgtcccaggaattggagcagtcttccaaatttatattatcaaggac ctggagaagctactcatgatagcaggagaagagcgggcactgtgtcttgtggacgtgaag aaagtgaaacagtccctggcccagtcccacctgcctgcccagcccgacatctcacccaac atttttgaagctgtcaagggctgccacttgtttggggcaggcaagattgagaacgggctc tgcatctgtgcagccatgcccagcaaagtcgtcattctccgctacaacgaaaacctcagc aaatactgcatccggaaagagatagagacctcagagccctgcagctgtatccacttcacc aattacagtatcctcattggaaccaataaattctacgaaatcgacatgaagcagtacacg ctcgaggaattcctggataagaatgaccattccttggcacctgctgtgtttgccgcctct tccaacagcttccctgtctcaatcgtgcaggtgaacagcgcagggcagcgagaggagtac ttgctgtgtttccacgaatttggagtgttcgtggattcttacggaagacgtagccgcaca gacgatctcaagtggagtcgcttacctttggcctttgcctacagagaaccctatctgttt gtgacccacttcaactcactcgaagtaattgagatccaggcacgctcctcagcagggacc cctgcccgagcgtacctggacatcccgaacccgcgctacctgggccctgccatttcctca ggagcgatttacttggcgtcctcataccaggataaattaagggtcatttgctgcaaggga aacctcgtgaaggagtccggcactgaacaccaccggggcccgtccacctcccgcagattt tggtacccacaggaagtcctggaagcagtcccccaggccaccaagggacaactgtacaac caaatgtacagtcagatccactgccctgagaatgggtcttttccacggagcgtcgctagg cagctgctcaaccctgttgtggtcatgcggaagcagctcagccccaacaagcgaggccca cccacgtacaacgagcacatcaccaagcgcgtggcctccagcccagcgccgcccgaaggc cccagccacccgcgagagccaagcacaccccaccgctaccgcgaggggcggaccgagctg cgcagggacaagtctcctggccgccccctggagcgagagaagtcccccggccggatgctc agcacgcggagagagcggtcccccgggaggctgtttgaagacagcagcaggggccggctg cctgcgggagccgtgaggaccccgctgtcccaggtgaacaagggaagagggcagagtgcc tctcaagttttcacggttaacactgtcacctattatgactggaataaaaagctggacaac ctgccagctaactggtcagtcctgaggatcatccagctgaatggagaaatccggcagcag gttgaaaagtctgttctgagaacagattattgctga