GENSCAN 1.0 Date run: 4-Aug-121 Time: 20:41:23 Sequence gi568815588f:13487087_13730337 : 243251 bp : 43.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 Intr - 5790 5525 266 1 2 72 95 123 0.573 8.63 1.11 Intr - 9802 9680 123 2 0 79 87 83 0.953 7.86 1.10 Intr - 12994 12692 303 2 0 50 56 121 0.011 1.66 1.09 Intr - 13580 13424 157 2 1 34 53 119 0.010 2.58 1.08 Intr - 24825 24725 101 1 2 61 33 67 0.001 -1.67 1.07 Intr - 39135 39052 84 1 0 68 82 56 0.028 2.79 1.06 Intr - 41472 41397 76 0 1 78 93 38 0.005 2.39 1.05 Intr - 42360 42346 15 0 0 108 106 21 0.057 1.64 1.04 Intr - 49803 49710 94 2 1 63 70 77 0.067 3.37 1.03 Intr - 73837 73648 190 2 1 98 50 47 0.006 0.54 1.02 Intr - 75433 75292 142 1 1 46 75 62 0.201 0.63 1.01 Init - 85707 85627 81 0 0 68 86 95 0.915 8.27 1.00 Prom - 90183 90144 40 -3.66 2.00 Prom + 92599 92638 40 -5.96 2.01 Init + 100001 100066 66 1 0 97 65 132 0.284 12.87 2.02 Intr + 110372 110449 78 1 0 67 86 46 0.094 2.05 2.03 Intr + 113158 113262 105 0 0 42 107 74 0.875 5.11 2.04 Intr + 118545 118658 114 2 0 59 116 50 0.974 5.54 2.05 Intr + 122953 123099 147 0 0 97 74 200 0.999 19.93 2.06 Intr + 124529 124597 69 1 0 83 100 97 0.998 9.78 2.07 Intr + 126655 126795 141 0 0 40 72 129 0.992 7.05 2.08 Intr + 126929 127000 72 1 0 68 83 55 0.854 2.60 2.09 Intr + 129312 129467 156 2 0 103 115 11 0.951 5.51 2.10 Term + 143174 143254 81 1 0 109 50 67 0.064 2.69 2.11 PlyA + 149017 149022 6 1.05 3.16 PlyA - 149991 149986 6 1.05 3.15 Term - 152005 151893 113 2 2 56 38 85 0.238 -0.98 3.14 Intr - 159075 158959 117 1 0 102 46 91 0.026 6.74 3.13 Intr - 170436 169550 887 2 2 86 94 1620 0.844 153.37 3.12 Intr - 172404 172237 168 2 0 78 105 168 0.984 16.56 3.11 Intr - 173369 173230 140 2 2 72 99 182 0.649 16.96 3.10 Intr - 179239 178865 375 1 0 123 63 385 0.726 34.71 3.09 Intr - 183442 183320 123 1 0 56 92 157 0.978 13.68 3.08 Intr - 187958 187825 134 0 2 80 61 180 0.990 14.86 3.07 Intr - 190466 190347 120 0 0 1 100 129 0.005 5.87 3.06 Intr - 206953 206812 142 1 1 120 96 225 0.921 26.43 3.05 Intr - 207129 206995 135 0 0 98 105 13 0.963 4.76 3.04 Intr - 208888 208745 144 1 0 31 102 79 0.291 4.08 3.03 Intr - 214392 214254 139 2 1 111 113 72 0.851 12.47 3.02 Intr - 220027 219951 77 1 2 98 115 70 0.823 9.01 3.01 Intr - 220349 220200 150 2 0 78 13 93 0.341 1.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 63787 63812 26 2 2 164 39 37 0.846 4.79 S.002 Init - 190476 190347 130 0 1 25 100 129 0.993 8.11 S.003 Init + 195251 195311 61 1 1 32 86 72 0.933 2.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:13487087_13730337|GENSCAN_predicted_peptide_1|544_aa MPADLVSGESLFLIDDSFVLCPHMAKGVAISHPELMETRLGRVTYPRSHSYEQNTQNFKP GLSMILSTKLILPQDQNVKHVGSQKLPIRDQGMNWKQGVWNQPRNNIESQDACHSIHQSK RNLETVQVGSTQEVSLAKIWSSIVPVQCGSSGATLAELQLHAQATVFLQRERKWPPPPPS PHHGVLREEKKQEIPELQTDYHHEVYKIPEFSNDVNGEAKETQPIFLASSLSSLVEECST GTQKPDASNADLSPHSLCISWRQRVILEDIVSAVKPCIPASHPEGHCVSSEALRPCQSRL SSLSLPRLVKLTDRDESMEIKKQITGMRRLLNDSTGRIYQRVGKEGEKLKEEPQDLDLVW PPRLNSSAEAPQSLHPSSRGVWNELPPQSGQFSGQYGTRSRTFQSQPHPTTSSNGELPVV NSSAGSNCCTCNCQSTLQAILQELKTMRKLMQIQAVGTQNRQQPPISLICSQRTAVSRKR NKKKKVPPKTVEPLTVKQKPSGSEMEKKSVVASELSALQAAEHTSPEESRVLGFGIVLES PSSD >gi568815588f:13487087_13730337|GENSCAN_predicted_CDS_1|1632_bp atgccagcagatttggtgtctggtgagagcctgttcctcatagatgacagcttcgtgctg tgtcctcacatggcaaaaggggtagcaatatcacacccggagctgatggagacacggctt ggtcgagtaacttacccacgatcacacagctatgagcagaacacccagaacttcaaacca ggtctgtccatgattttatcaactaagttaatactgcctcaagaccaaaatgtgaagcat gttggatcccagaagctccccatcagggaccagggaatgaactggaaacagggagtctgg aaccaacccagaaacaacattgagtcccaggatgcatgtcacagtatccaccagagcaag agaaacttggaaacagttcaagttggatccacccaggaggtctctcttgctaagatctgg tcctccattgttcctgttcagtgtgggagctcaggggccaccttggcagagctccagctg catgctcaggccacagtgttcttacagagagagcgcaaatggcccccgccgccgccttcc ccgcaccatggagttctccgagaggaaaagaagcaggaaatcccagagcttcaaactgat tatcaccacgaggtgtataagatcccagaattcagcaacgatgttaatggggaggccaaa gagacacagcccatttttttagctagttccttgagcagcttagtagaggagtgtagcaca gggacccagaagcctgatgcatccaatgcagatctctcacctcacagtctctgcatctcc tggcggcagcgtgtcatcctggaggacattgtgtcagcagtgaagccctgcatccctgcc agtcatcctgaaggacattgtgtcagtagtgaagccctgcgtccctgccagtccaggctt tcttccttgtccttgccccgcctggtgaagctgacagacagagatgaaagcatggaaata aaaaagcaaattacagggatgagaagattgctgaacgacagcactgggcggatctatcag cgagttggcaaagaaggagagaaactaaaagaagagccccaggacctggatttagtctgg cctccacgtttgaactcctctgctgaggccccgcaaagcctccacccgtcttcacgtggt gtgtggaatgagctaccgccccagagtggacagttctcagggcagtatggcacccgttct agaaccttccaaagccagccccaccctaccacgagctccaatggagaacttccagtggtg aattcatcagctggatcaaactgctgtacttgtaactgccagtcaacgttgcaggccatt ctacaagaactcaagaccatgaggaaattaatgcaaattcaagcagttggaactcaaaac agacaacaacctccaatttcccttatatgctcccagcgaactgctgtctcacgaaagaga aataaaaagaaaaaagtgcccccaaagactgtggaacctcttactgtgaaacagaagccc agtgggtcagagatggagaaaaagtcggtggtggcctctgagctatctgctctccaggca gccgagcacacctccccggaggagagccgcgttctaggattcggcattgttctggaatca ccttcctcagat >gi568815588f:13487087_13730337|GENSCAN_predicted_peptide_2|342_aa MDILKSEILRKRQLVEDRNLLVENKKYFKRSELAKKEEEAYFERCGYKIQPKEEDQKPLT SSNPVLELELAEEKLPMTLSRQEVIRRLRERGEPIRLFGETDYDAFQRLRKIEILTPEVN KGLRNDLKAALDKIDQQYLNEIVGGQEPGEEDTQNDLKVHEENTTIEELEALGESLGKGD DHKDMDIITKFLKFLLGVWAKELNAREDYVKRSVQGKLNSATQKQTESYLRPLFRKLRKR NLPADIKESITDIIKFMLQREYVKANDAYLQMAIGNAPWPIGVTMVGIHARTGREKIFSK HVAHVLNDETQRKYIQGLKRLMTICQKHFPTDPSKCVEYNAL >gi568815588f:13487087_13730337|GENSCAN_predicted_CDS_2|1029_bp atggacattctgaaatcagagatccttcggaagcggcagctggtggaggacaggaacctg ctggtggaaaataaaaaatatttcaagcgtagtgagctcgccaaaaaagaagaggaagca tattttgaaagatgtggctacaagatacagccaaaagaggaggaccagaaaccattaact tcatcgaatccagtgttagaacttgaactggcagaggaaaaattacctatgacgctttct aggcaagaggtcatcagaagattgagagaaagaggagaaccaatcagactatttggagag actgattatgatgcttttcaacgtttaaggaaaatagagatcctcacaccagaagttaac aagggattgaggaatgatttgaaagcagccttggataagattgatcagcagtacctcaat gaaatcgtcggcggtcaggagcctggagaggaagacacacagaatgatctgaaagttcat gaggaaaacaccacaattgaagagttagaggcgcttggagagtccttagggaaaggcgat gatcataaagacatggacatcatcaccaaattcctgaagtttcttcttggcgtttgggct aaagaattgaatgccagagaagattatgtgaaacgcagtgtgcagggtaaactgaacagt gcgacccagaaacagaccgagtcctacctaagaccactttttagaaagctacggaaaagg aatcttcctgctgatattaaagaatcaataacggatattattaaattcatgttgcagaga gaatacgtgaaggcaaatgatgcttatcttcagatggccattggaaatgcgccttggccc atcggtgtcactatggttggtatccatgccagaactggcagagaaaagattttttccaag catgttgcacatgttttaaatgacgaaactcagcggaaatatattcagggattgaagagg ttaatgaccatttgccagaaacactttcctacagacccatccaaatgtgtggagtacaat gcactgtga >gi568815588f:13487087_13730337|GENSCAN_predicted_peptide_3|987_aa GKVCQAGDCHDQTNKGRPKAENLKRTCFVPVKEPAGKSTPTLPGFELESCIFQWRQLENL YFREKKFSVEVHDPRRASVTRRTFGHSGIAVHTWYACPALIKSIWAMAISQHQFYLDRKQ SKPADFHRVNMRSEVAVLASVDSNVEGRGRAELQAFRQPWLDSKGPQARKRVSVINDEAK GCKQKCCTLVREISPFAECQDGEQARARDDCPFRKSKIHAARSLSEIAIDLTETGTLKTS KLANMGSKGKIISGSSGSLLSSVPENHEVTDEAFLCPVAFGDLQLKYVKIQETATKIIQS SEGSQESDSSQSAKKDMLAALKSRQEALEETLRQRLEELKKLCLREAELTGKLPVEYPLD PGEEPPIVRRRIGTAFKLDEQKILPKGEEAELERLEREFAIQSQITEAARRLASDPNVSK KLKKQRKTSYLNALKKLQEIENAINENRIKSGKKPTQRASLIIDGQCQPGCPHPTPTPGP RQPDARSHEWSLHDLSSCLTVRCLHLSIHKMKRSLEGLRQMHYHRNDYDKSPIKPKMWSE SSLDEPYEKVKKRSSHSHSSSHKRFPSTGSCAEAGGGSNSLQNSPIRGLPHWNSQSSMPS TPDLRVRSPHYVHSTRSVDISPTRLHSLALHFRHRSSSLESQGKLLGSENDTGSPDFYTP RTRSSNGSDPMDDCSSCTSHSSSEHYYPAQMNANYSTLAEDSPSKARQRQRQRQRAAGAL GSASSGSMPNLAARGGAGGAGGAGGGVYLHSQSQPSSQYRIKEYPLYIEGGATPVVVRSL ESDQEGHYSVKAQFKTSNSYTAGGLFKESWRGGGGDEGDTGRLTPSRSQILRTPSLGREG AHDKGAGRAAVSDELRQWYQRSTASHKEHSRLSHTSSTSSDSGSQYSTSSQSTFVAHSRV TRMPQMCKATSESLKNCPNATVKARKSWLALLTHSRHPSGHKERPVGVVEGGCGKVRFYS SRNEQRPQHHGTLATSHLMILKVKKFE >gi568815588f:13487087_13730337|GENSCAN_predicted_CDS_3|2964_bp ggaaaagtctgccaagctggggactgccacgaccaaaccaacaaaggccgaccgaaggcg gagaacttgaaaagaacttgttttgttccagttaaggaacctgcaggaaaatccacaccc acattgcctggttttgaactggagagctgcatattccaatggagacagttggaaaacctg tacttcagagaaaagaagttttccgtggaagttcatgacccacgcagggcttcagtgaca aggaggacgtttgggcacagcggcattgcagtgcacacgtggtatgcatgtccggcattg atcaagtccatctgggctatggccataagccaacaccagttctatctggacagaaagcag agtaagcccgcagacttccatcgtgttaacatgaggtcggaagtcgctgtccttgccagt gtggacagcaacgtcgagggcagaggaagggcagagctgcaggcattcagacagccctgg ctggacagtaagggcccccaggcacggaagagagtttcggtgattaatgatgaggccaag ggatgcaaacaaaaatgctgcacattagtcagggagataagtccctttgcggaatgtcaa gacggtgagcaggcgcgggcgagggatgactgtcctttccgcaagtccaaaatccatgca gcacgcagcctgagtgagatcgccatcgacctgaccgagacggggacgctgaagacctcg aagctggccaacatgggtagcaaggggaagatcatcagcggcagcagcggcagcctgctg tcttcagtacctgagaaccatgaggtcacggatgaagctttcttgtgtccagttgcattt ggagacctgcaactgaagtatgtcaaaatccaggaaacagctacgaaaataattcaaagc tcggaaggttctcaggaatcagatagctcgcagtcggccaagaaggacatgctggctgcc ttgaagtccaggcaggaagctctggaggaaaccctgcgtcagaggctggaggaactgaag aagctgtgtctccgagaagctgagctcacgggcaagctgccagtagaatatcccctggat ccaggggaggaaccacccattgttcggagaagaataggaacagccttcaaactggatgaa cagaaaatcctgcccaaaggagaggaagctgagctggaacgcctggaacgagagtttgcc attcagtcccagattacggaggccgcccgccgcctagccagtgaccccaacgtcagcaaa aaactgaagaaacaaaggaaaacctcgtatctgaatgcactgaagaaactgcaggagatt gaaaatgcaatcaatgagaaccgcatcaagtctgggaagaaacccacccagagggcttcg ctgatcatagacggtcagtgccaaccgggctgcccccaccccactcccacgcccgggccc cggcaaccagacgccaggtcccacgagtggtccctgcacgacctgagcagctgcctaact gtccgctgccttcatttgtccatccataaaatgaagcgatccctggagggactccgacag atgcactatcaccgcaacgactatgacaagtcacccatcaagcccaaaatgtggagtgag tcctctttagatgaaccctatgagaaggtcaagaagcgctcctctcacagccattccagc agccacaagcgcttccccagcacaggaagctgtgcggaagccggcggaggaagcaactcc ttgcagaacagccccatccgcggcctcccgcactggaactcccagtccagcatgccgtcc acgccagacctgcgggtccggagtccccactacgtccattccacgaggtcggtggacatc agccccacccgactgcacagcctcgcactgcactttaggcaccggagctccagcctggag tcccagggcaagctcctgggctcggaaaacgacaccgggagccccgacttctacaccccg cggactcgtagcagcaacggctcagaccccatggacgactgctcgtcgtgcaccagccac tcgagctcggagcactactacccggcgcagatgaacgccaactactccacgctggccgag gactcgccgtccaaggcgcgccagaggcagaggcagcggcagcgggcggcgggcgcactg ggctcagccagctcgggcagcatgcccaacctggcggcgcgcgggggtgcggggggcgcg gggggcgcggggggcggtgtgtacctgcacagccagagccagcccagctcgcagtaccgc atcaaggagtacccgctgtacatcgagggcggcgccacgcccgtggtggtgcgcagcctg gagagcgaccaggagggccactacagcgtcaaggctcagttcaagacgtccaactcctac acggcgggcggcctgttcaaggagagctggcgcggcggcggcggcgacgagggcgacacg ggccgcctgacgccgtcgcgatcgcagatcctgcggactccgtcgctgggccgcgagggc gcccacgacaagggcgcgggccgtgccgccgtctcagacgagctgcgccagtggtaccag cgttccaccgcctcgcacaaggagcacagccgcctgtcgcacaccagctccacctcctcg gacagcggctcgcagtacagcacctcctcccagagcaccttcgtggcgcacagcagggtc accaggatgccccagatgtgcaaggccacgtcagagagcctgaaaaactgccccaatgcc acggtaaaggcgaggaagtcttggctggcgttgctgactcacagtcgccatccatctgga cacaaagagagacctgtgggagtcgtagagggaggatgtggcaaagtccgattttattcc agtcgaaatgaacaaagacctcagcaccatggcacacttgccacaagtcacttgatgatt ttgaaggtcaagaagtttgagtaa