GENSCAN 1.0 Date run: 3-Nov-116 Time: 16:09:25 Sequence gi568815596r:61726458_61954041 : 227584 bp : 42.43% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 483 522 40 -3.55 1.01 Init + 5419 5499 81 0 0 45 94 64 0.111 3.72 1.02 Term + 13684 13833 150 0 0 47 40 122 0.300 0.23 1.03 PlyA + 13943 13948 6 1.05 2.00 Prom + 15981 16020 40 -2.45 2.01 Init + 19578 19640 63 2 0 65 65 100 0.504 6.60 2.02 Term + 37421 37528 108 1 0 87 48 163 0.791 9.73 2.03 PlyA + 38917 38922 6 1.05 3.00 Prom + 38940 38979 40 -6.35 3.01 Init + 41831 41971 141 1 0 110 35 133 0.466 10.39 3.02 Intr + 47174 47245 72 1 0 95 88 25 0.295 1.88 3.03 Intr + 64360 64468 109 0 1 75 71 63 0.019 2.24 3.04 Term + 66028 66212 185 2 2 82 44 81 0.024 -0.18 3.05 PlyA + 66477 66482 6 1.05 4.00 Prom + 71751 71790 40 -7.55 4.01 Init + 72062 72268 207 1 0 59 23 188 0.708 8.27 4.02 Intr + 79159 79329 171 0 0 38 23 129 0.287 0.62 4.03 Intr + 90771 90918 148 2 1 38 47 135 0.444 3.29 4.04 Term + 93769 94139 371 2 2 29 36 359 0.963 18.62 4.05 PlyA + 94322 94327 6 1.05 5.25 PlyA - 95093 95088 6 1.05 5.24 Term - 100142 99998 145 1 1 57 41 209 0.984 9.50 5.23 Intr - 100801 100647 155 1 2 68 87 153 0.147 11.15 5.22 Intr - 106449 106371 79 2 1 79 23 47 0.064 -4.07 5.21 Intr - 112248 112081 168 2 0 44 69 99 0.636 1.74 5.20 Intr - 114124 112964 1161 0 0 54 115 905 0.826 76.80 5.19 Intr - 115903 115665 239 1 2 72 115 176 0.824 14.19 5.18 Intr - 127752 127402 351 0 0 57 15 317 0.082 15.89 5.17 Intr - 132988 132709 280 0 1 43 39 198 0.143 6.96 5.16 Intr - 133660 133524 137 1 2 55 67 111 0.204 4.15 5.15 Intr - 143096 142983 114 2 0 76 103 122 0.998 12.22 5.14 Intr - 145859 145625 235 1 1 49 84 353 0.999 27.77 5.13 Intr - 146131 146001 131 1 2 27 80 130 0.999 4.67 5.12 Intr - 146655 146545 111 0 0 80 87 100 0.992 8.76 5.11 Intr - 146836 146740 97 0 1 65 93 56 0.887 2.89 5.10 Intr - 149777 149638 140 2 2 67 70 58 0.884 0.24 5.09 Intr - 150595 150463 133 0 1 116 105 90 0.996 13.23 5.08 Intr - 151057 150936 122 1 2 84 94 86 0.998 7.17 5.07 Intr - 152554 152412 143 2 2 52 100 127 0.999 9.45 5.06 Intr - 153937 153829 109 1 1 58 98 106 0.998 7.54 5.05 Intr - 157091 157002 90 2 0 86 115 97 0.999 11.47 5.04 Intr - 158615 158563 53 0 2 89 58 63 0.989 1.11 5.03 Intr - 161692 161567 126 2 0 75 35 106 0.913 3.73 5.02 Intr - 162313 161924 390 2 0 80 64 356 0.823 26.17 5.01 Init - 169481 169328 154 2 1 86 107 49 0.588 6.80 5.00 Prom - 174810 174771 40 -7.95 6.00 Prom + 178718 178757 40 -4.25 6.01 Init + 179222 179401 180 1 0 82 79 192 0.772 16.03 6.02 Term + 185443 185466 24 0 0 111 37 1 0.133 -5.45 6.03 PlyA + 185981 185986 6 1.05 7.00 Prom + 192277 192316 40 -8.05 7.01 Init + 197986 198091 106 0 1 77 105 96 0.325 10.63 7.02 Intr + 206243 206366 124 0 1 68 33 123 0.408 3.52 7.03 Intr + 212198 212309 112 2 1 79 36 135 0.410 6.86 7.04 Term + 216521 216664 144 1 0 34 44 135 0.459 0.63 7.05 PlyA + 218900 218905 6 1.05 8.02 PlyA - 219136 219131 6 1.05 8.01 Term - 224760 224547 214 2 1 62 40 166 0.437 4.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 100789 100647 143 1 2 76 87 143 0.853 12.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:61726458_61954041|GENSCAN_predicted_peptide_1|76_aa MVTGVSLESDKLEVRAPGRGVQQWRCQSFARVWKWPDATLRHLGMLAAPDCLLIGPDTSS TVPKSFLERRVVDVPY >gi568815596r:61726458_61954041|GENSCAN_predicted_CDS_1|231_bp atggtaacaggggtctccctggagtctgacaaattggaagtaagggcacctggaagagga gtccagcagtggagatgccagagttttgctcgtgtttggaagtggccagatgctacactt aggcatcttggaatgctagcagcccccgattgcctgctcattggtccagatacttcgagc actgtcccaaaaagttttttagaaaggagagttgtagatgttccttattga >gi568815596r:61726458_61954041|GENSCAN_predicted_peptide_2|56_aa MKKKKKRKKKKEKKKGMKGKKLSEEPSYRKHATSGLGILGIGRTARDLRRRKTVTF >gi568815596r:61726458_61954041|GENSCAN_predicted_CDS_2|171_bp atgaagaagaagaagaagaggaagaagaagaaggagaagaagaaaggaatgaagggaaaa aagctctccgaggagcccagctaccggaaacacgccacatcaggcctcggaattctgggc atcggacggacagctcgtgacctcagaaggaggaaaacggtcaccttctga >gi568815596r:61726458_61954041|GENSCAN_predicted_peptide_3|168_aa MPGLQAVYSSAALASASGENLRKLPLLVEGEGKPVCHMARGGTRGKEAPSPTCGLLLNMR FARGQIPKLYQRAYNGESMEGKRIVIIWKAAQSRCCTMELRQVHSIVDQNLNILIRGVHI DNVASGCLDVDAKSQELSEKASCLLSGSLGCPLEVAALPAGREKVMGS >gi568815596r:61726458_61954041|GENSCAN_predicted_CDS_3|507_bp atgcctggcctgcaggctgtatattcatcagcagcactggcatctgcttctggtgagaac ctcaggaagcttccactcttggtagaaggtgaagggaagccagtgtgtcacatggcaaga gggggaacaagaggaaaggaggccccatctccaacatgtgggttacttctcaacatgaga tttgcacggggacaaatacccaaactatatcagagagcttacaatggtgagagcatggaa ggcaaaagaatagtcatcatctggaaagcagctcaaagtagatgttgtacaatggagctc aggcaagttcacagcatcgtagaccaaaacttgaacattcttatccgtggggtccacata gacaatgtggcatcaggctgtctggatgtagatgccaagagtcaggagttatctgagaag gccagttgcctcttatctggaagtcttgggtgcccactggaagttgcagcccttccagcg gggagggagaaagtcatgggaagctga >gi568815596r:61726458_61954041|GENSCAN_predicted_peptide_4|298_aa MNGINVLTKGDPRELSGPPSAMQEYSKKMAVSEPGSGPSDTDFARALLLDSPSSRTMRNK SVVEATQSMINTVLPPIANCRVNVWLYRARCADPSSVLYQAQVPALMELQLPMICCSAQN RLTQRQIILAAFQNQSSQNRQAALKENPAEKPQTEDNTINTAQGNKKMQECFSVPAKQLW QLFTGGSSFETTDERLRSHSEQWGMLTGCVVMRDPNTKCSRGFGFVTYATVEEVDAAMNA RPHKVDGRAVEPKRAVSREGSQRPGAHLTVKKVFVAGIKEDTEEHHLRDSRINVNTAR >gi568815596r:61726458_61954041|GENSCAN_predicted_CDS_4|897_bp atgaatgggattaatgtccttacaaaaggggaccccagagagctctctggccctccttct gctatgcaagaatacagcaagaagatggctgtgtctgaaccaggaagtggaccctcagac accgactttgctagagccttgctcttagactccccatcctccagaactatgagaaataaa tctgttgttgaagccactcagtctatgataaataccgttcttcctcctattgcaaactgc agagtgaatgtttggctttaccgagccaggtgtgcagaccccagttcagtcctgtaccag gcacaggtccctgccctcatggagctgcaactgccaatgatctgttgcagtgcccaaaac agacttacccagaggcagatcatattagcagctttccaaaaccagtcctcacaaaacaga caagcggctttaaaagaaaatcctgcagagaaacctcagactgaagataatactattaac acagcacaaggaaacaagaaaatgcaagaatgcttctctgtacctgccaagcagctgtgg cagctcttcactggaggatcgagctttgaaacgactgatgagaggctgaggagccattct gagcaatggggaatgctcacaggctgtgtggtcatgagagatcccaacaccaagtgctcc aggggctttgggtttgtcacctatgccactgtggaggaggtggatgcagccatgaatgca aggccacacaaggtggatgggagagctgtggaaccaaagagggctgtctcaagagaaggt tctcaaagaccaggtgcccacctaactgtgaaaaaggtatttgttgctggcattaaagaa gacactgaagaacatcacctaagagacagtaggataaatgtgaacactgctaggtaa >gi568815596r:61726458_61954041|GENSCAN_predicted_peptide_5|1620_aa MGSNGCGSCLVHGWTGLPLGPWSVGLVLGWRSADNAPITRCTDWCGFLLFPAPRAYNLAS PEPTARILLARRAADSLLRHASAARQGRPPSPPPPPPDAGAAFWKVREGSEGLPLLHCGR PESGSIRPSRANPDTAEFAMPENVAPRSGATAGAAGGRGKGAYQDRDKPAQIRFSNISAA KVFHPALCIIHIGAESSSKCRGLHAVFTRDLTLEGCRLLLIGTAVADAIRTSLGPKGMDK MIQDGKGDVTITNDGATILKQMQVLHPAARMLVELSKAQDIEAGDGTTSVVIIAGSLLDS CTKLLQKGIHPTIISESFQKALEKGIEILTDMSRPVELSDRETLLNSATTSLNSKVVSQY SSLLSPMSVNAVMKVIDPATATSVDLRDIKIVKKLGGTIDDCELVEGLVLTQKVSNSGIT RVEKAKIGLIQFCLSAPKTDMDNQIVVSDYAQMDRVLREERAYILNLVKQIKKTGCNVLL IQKSILRDALSDLALHFLNKMKIMVIKDIEREDIEFICKTIGTKPVAHIDQFTADMLGSA ELAEEVNLNGSGKLLKITGCASPGKTVTIVVRGSNKLVIEEAERSIHDALCVIRCLVKKR ALIAGGGAPEIELALRLTEYSRTLSGMESYCVRAFADAMEVIPSTLAENAGLNPISTVTE LRNRHAQGEKTAGINVRKGGISNILEELVVQPLLVSVSALTLATETVRSILKIDDVVDPF LCFDDDEREHLLGTWGILCLGSEEWEILCFVGGESDALCFGSWLLGYWGMMHFWQWSCWR QQPVHIHLQGFAEGKNLLVFQGALDVVLGSFSRGQRATPTHDADECLFHLTGDLLPLWLG GILQGDGHFIDSAVLPEWILRDFWSLAAPVASPVTAPIATGSLARRPRVARGRTSFPVPG QRRRSGLGGGAMATSHRVAKLVASSLQTPVNPITGARVAQYEREDPLKALAAAEAILEDE EEEKVAQPAGASADLNTSFSGVDEHAPISYEDFVNFPDIHHSNEEYFKKVEELKAAHIET MAKLEKMYQDKLHLKEVQPVVIREDSLSDSSRSVSEKNSYHPVSLMTSFSEPDLGQSSSL YVSSSEEELPNLEKEYPRKNRMMTYAKELINNMWTDFCVEDYIRCKDTGFHAAEKRRKKR KEWVPTITVPEPFQMMIREQKKKEESMKSKSDIEMVHKALKKQEEDPEYKKKFRANPVPA SVFLPLYHDLVKQKEERRRSLKEKSKEALLASQKPFKFIAREEQKRAAREKQLRDFLKYK KKTNRFKARPIPRSTYGSTTNDKLKEEELYRNLRTQLRAQEHLQNSSPLPCRSACGCRNP RCPEQAVKLKCKHKVRCPTPDFEDLPERYQKHLSEHKSPKLLTVCKPFDLHASPHASIKR EKILADIEADEENLKETRWPYLSPRRKSPVRCAGVNPVPCNCNPPVPTVSSRGREQAVRR SLEEKKMLEEERNRILTKQKQRMKELQKLLTTRAKAYDSHQSLAQISKSRVKCLRPAVLN LFGARDWFYGRPFFHRPGLRGKNARMAAEKHYSNTLKALGISDEFVSKKGQSGKVLEYFN NQETKSVTEDKESFNEEEKIEERENGEENYFIDTNSQDSYKEKDEANEESEEEKSVEESH >gi568815596r:61726458_61954041|GENSCAN_predicted_CDS_5|4863_bp atgggctctaatgggtgtggatcctgtctggtccatgggtggactggactgcccctagga ccttggtcagtagggctggtgttgggatggaggtctgcagacaatgcgcctattaccaga tgcacagactggtgtggcttcctcctgtttcctgcgccccgggcgtacaatctcgccagc ccggagcccaccgcacgcatcctcctcgcccggcgcgcagcggacagcctcctacgtcac gccagcgccgcgcggcaaggaaggcccccttctccgcctccgcctcctcccgacgccggc gccgctttctggaaggttcgtgaaggcagtgagggcttaccgttattacactgcggccgg ccagaatccgggtccatccgtccttcccgagccaacccagacacagcggagtttgccatg cccgagaatgtggcaccccggagcggggcgactgccggggctgccggcggccgcgggaaa ggcgcctatcaggaccgcgacaagccagcccagatccgcttcagcaacatttccgccgcc aaagtttttcatcctgcactgtgcatcattcatattggagccgaaagcagttcaaaatgc cgaggattgcatgcagtcttcactagggatctgacgcttgagggctgtcgcttattgctt attggaactgcggttgctgatgctattagaacaagccttggaccaaaaggaatggataaa atgattcaagatggaaaaggtgatgtaaccattacaaatgatggtgctaccattctgaaa caaatgcaagtattacatccagcagccagaatgctggtggagctgtctaaggctcaagat atagaagcaggagatggcaccacatcagtagtcatcattgctggctccctcttagattct tgtaccaagcttcttcagaaagggattcatccaaccatcatttctgagtcattccagaag gccctggaaaagggcattgaaatcttgactgacatgtctcgacctgtggaactgagtgac agagaaactttgttaaatagtgcaaccacttcactgaactcaaaggtggtttctcagtat tcaagtctgctttctccaatgagtgtaaatgcagtgatgaaagtgattgacccagccaca gccaccagtgtagatcttagagatattaaaatagttaagaagcttggtgggacaattgat gactgtgagttggtggaagggctggttctcacccaaaaagtgtcaaattctggcataacc agagttgaaaaggccaagattgggcttattcagttttgcttatctgctcccaaaacagac atggataatcaaatagtggtttctgactatgcccagatggaccgagtgctgcgagaagag agagcctatattttaaatttagtgaagcaaattaaaaaaacaggatgtaatgtccttctc atacagaaatctattctaagagatgctcttagtgatcttgcattacactttctgaataaa atgaagatcatggtgattaaggatattgaaagagaagacattgaattcatttgtaagaca attggaaccaagccagttgctcatattgaccaatttactgctgacatgctgggttctgct gagttagctgaggaggtcaatttaaatggttctggcaaactgctcaagattacaggctgt gccagccctggaaaaacagttacaattgttgttcgtggttctaacaaactggtgattgaa gaagctgagcgctccattcatgatgccctatgtgttattcgttgtttagtgaagaagagg gctcttattgcaggaggtggtgctccagaaatagagttggccctacgattaactgaatat tcacgaacactgagtggtatggaatcctactgcgttcgtgcttttgcagatgctatggag gtcattccatctacactagctgaaaatgccggcctgaatcccatttctacagtaacagaa ctaagaaaccggcatgcccagggagaaaaaactgcaggcattaatgtccgaaagggtggt atttccaacattttggaggaactggttgtccagcctctgttggtatcagtcagtgctctg actcttgcaactgaaactgttcggagcattctgaaaatagatgatgtggtagaccctttc ctgtgctttgatgatgatgaaagagaacatctcttggggacttggggaatcctttgtttg ggaagtgaagaatgggagatcctgtgctttgtgggtggagaaagtgatgccctttgtttt ggaagttggctgctggggtattggggaatgatgcatttttggcagtggagttgctggagg caacagccagtgcacattcaccttcaaggatttgctgaaggaaagaacttactagttttt cagggggcgctggatgtcgtcttgggaagcttctcaagaggacaacgagctactcctaca cacgatgctgatgagtgcctttttcatctcactggagatctgctcccgctatggcttggt ggaatccttcaaggagatggccactttattgactctgctgttcttcctgagtggattttg cgtgacttttggtccctggcggctcctgtagcgtccccagttacggcgcccatagcaacc ggctccctagctaggcgcccccgggttgccaggggccgcaccagctttcccgtcccgggc cagcgcaggcgctcaggcctcggaggcggggcgatggccacctcccaccgagtggcgaag ctggtggcctccagtctccagaccccggtaaatcccatcactggagcgcgggtcgcccag tacgaacgcgaagaccccttaaaggccctggcggcagcggaggcgatcttggaggacgaa gaggaggagaaagtggctcagcccgctggggcatcggctgatttgaacaccagcttttct ggggtggatgaacatgcaccgataagctatgaggactttgtgaactttcctgatattcac cactctaatgaggagtatttcaagaaagtagaagagttgaaggctgcccacatagaaact atggcaaaattagagaaaatgtaccaggataaattacatttaaaggaagttcagccagtg gtcatcagagaagactctcttagtgactcttccagatctgtatcagaaaagaactcctat caccctgtctcattaatgacatcattttcagagcctgatttaggccagtcttcctccttg tatgtgtcctcctctgaagaggagttacccaacctagaaaaagagtatcctaggaaaaac agaatgatgacctatgctaaggagctcatcaacaatatgtggacagacttttgtgttgag gattatattcgctgtaaagatactggcttccatgcagctgaaaaaagaaggaagaaacga aaagaatgggtgcccacaattacagtaccggagccttttcaaatgatgataagagaacag aagaaaaaagaagagtccatgaaatctaaatcagatatcgaaatggtacataaagcgctc aaaaaacaagaagaggatccagagtataagaagaaattccgagccaatccagttcctgca tctgtctttctccccctttaccatgatttagtcaagcaaaaagaagaacggagaaggtct ctgaaggagaaaagcaaagaagctcttttggcctcacaaaagccatttaaatttatagca agggaggaacagaagcgagcagcccgggaaaagcagctgagagactttcttaagtataaa aagaaaacaaatcgatttaaagccagacccattcctcgatctacttatggttcaactacc aatgacaagttaaaagaagaagagctctatcgaaaccttaggacacagctgagagcccag gagcatttacagaactcatctcctctgccttgtaggtcagcttgtggatgcaggaacccc aggtgtcctgaacaggctgtaaagttgaagtgtaaacacaaggttaggtgcccaactcct gattttgaggaccttcctgagagataccagaaacacctctcagaacacaagtctccaaaa ctcttaacagtgtgtaaaccatttgatcttcatgcatctccacatgcatctattaaaaga gaaaaaattttggcagacatcgaagcagatgaagaaaatttaaaagaaacacgttggcct tatttgtctccaaggcgtaagtcaccagtaagatgtgcaggtgtaaaccctgtgccttgt aactgcaaccctcccgtgcccacggtatcttccagaggacgagaacaagccgtaaggaga tcacttgaggaaaagaaaatgttggaagaagagagaaatcggatcctaactaaacagaag caaagaatgaaagaattgcagaaactcctgacaacccgggctaaggcttatgactcacat caaagtttagctcaaatatctaaatccagagtaaaatgtctcaggccagcagtcctcaac ctttttggcgctagagactggttttatggaagaccatttttccacagaccagggttgcgt gggaaaaatgcaagaatggcagcagaaaagcattattctaataccctaaaagcactagga atatctgatgagtttgtttcaaagaaaggccaaagtggaaaagtacttgagtacttcaac aatcaagagacgaaaagtgtcactgaagacaaagaaagctttaatgaagaagaaaaaata gaagaaagagagaatggggaagaaaattattttattgataccaacagccaggattcttac aaggaaaaagatgaagccaatgaggaaagtgaagaagagaaatctgttgaagaatcacac tga >gi568815596r:61726458_61954041|GENSCAN_predicted_peptide_6|67_aa MAAGELEGGKPLSGLLNALAQDTFHGYPGITEELLRSQLYPEVPPEEFRPFLAKMRGILK HSNLVLI >gi568815596r:61726458_61954041|GENSCAN_predicted_CDS_6|204_bp atggcggcgggcgagcttgagggtggcaaacccctgagcgggctgctgaatgcgctggcc caggacactttccacgggtaccccggcatcacagaggagctgctacggagccagctatat ccagaggtgccacccgaggagttccgcccctttctggcaaagatgagggggattcttaag cactctaatcttgtccttatttaa >gi568815596r:61726458_61954041|GENSCAN_predicted_peptide_7|161_aa MLFGGGLGCCRGGEKSARSKEEKSSARAVGVREAGGVWLVYLAAVCPNPLHEGAHERVQE REQMNAGSSPVDPLWQDQILSCGEARTEVAVDSYGFAASNLDLFHWLTGCQMRENSSHEI LTETYQFANNSKKKRLDANSGFYLSRKVTADLKQAEKKAER >gi568815596r:61726458_61954041|GENSCAN_predicted_CDS_7|486_bp atgctgttcggtggaggattaggatgttgtcgaggaggggagaaatcagccagatcaaag gaggagaagtcgtcggctagggctgttggggtgagggaagcaggaggggtgtggcttgtt tacttggctgccgtgtgcccaaaccccttgcatgagggagcacatgagcgagtgcaggaa cgagagcaaatgaatgctggaagcagcccagttgatcctctctggcaggatcagattctt tcttgtggagaggcaaggactgaagttgctgtggactcctacggatttgctgccagtaac ttggatctcttccactggttaacaggatgccaaatgagagagaactctagccatgagata ttaacagagacttaccagtttgcaaacaatagtaaaaagaaacggttggatgcaaacagt ggattttatctcagtagaaaagtaacagcagacttaaagcaggctgaaaagaaagcagag agatag >gi568815596r:61726458_61954041|GENSCAN_predicted_peptide_8|71_aa XAGITDISHCAQPQKDILPELPLSNFPLESPAVINPLSVGVSWNSAAIESLPIAIGSKKA NRTIYTFQLKP >gi568815596r:61726458_61954041|GENSCAN_predicted_CDS_8|216_bp nntgctgggattacagatataagccactgtgcccagccgcagaaagacatattacctgaa cttcccttatccaacttccctctggaaagcccagctgtcattaaccctctctccgtggga gtttcctggaattcagctgccatagagagtctgcccattgccattggcagcaaaaaggcc aacagaaccatctacaccttccaattgaagccctaa