GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:09:17 Sequence gi568815591f:116134532_116357479 : 222948 bp : 37.43% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11640 11667 28 2 1 80 108 29 0.122 3.91 1.02 Term + 26077 26189 113 2 2 88 36 117 0.238 4.24 1.03 PlyA + 26196 26201 6 1.05 2.00 Prom + 28287 28326 40 -4.85 2.01 Init + 29771 29831 61 1 1 75 64 75 0.923 5.26 2.02 Intr + 36052 36145 94 2 1 85 86 60 0.789 3.60 2.03 Intr + 37246 37433 188 1 2 8 100 95 0.621 1.11 2.04 Intr + 39322 39413 92 2 2 119 55 64 0.539 4.99 2.05 Intr + 45401 45425 25 1 1 81 119 -1 0.094 -1.02 2.06 Intr + 46719 46894 176 1 2 52 78 131 0.087 7.24 2.07 Intr + 51082 51322 241 0 1 -5 89 238 0.051 10.80 2.08 Intr + 51643 51836 194 2 2 82 81 80 0.682 4.99 2.09 Term + 51923 52393 471 1 0 47 35 131 0.712 -2.36 2.10 PlyA + 52478 52483 6 1.05 3.00 Prom + 54354 54393 40 -4.45 3.01 Init + 64253 64316 64 1 1 111 62 64 0.919 7.46 3.02 Term + 67225 67262 38 2 2 108 40 49 0.772 -1.38 3.03 PlyA + 67602 67607 6 1.05 4.05 PlyA - 67963 67958 6 1.05 4.04 Term - 73536 73474 63 0 0 96 36 57 0.552 -1.79 4.03 Intr - 76182 75883 300 0 0 115 100 322 0.770 32.11 4.02 Intr - 95830 95709 122 2 2 59 108 37 0.355 2.09 4.01 Init - 104520 104484 37 0 1 59 113 20 0.274 1.84 4.00 Prom - 106045 106006 40 -4.55 5.00 Prom + 109277 109316 40 -6.55 5.01 Init + 109926 109975 50 2 2 66 98 22 0.446 1.48 5.02 Intr + 114489 114741 253 0 1 55 27 292 0.998 16.31 5.03 Intr + 115630 115965 336 0 0 79 92 441 0.999 38.39 5.04 Intr + 117229 117444 216 0 0 67 87 179 0.998 13.48 5.05 Intr + 117787 117945 159 0 0 81 69 86 0.961 5.26 5.06 Term + 122763 122951 189 2 0 111 49 52 0.797 0.07 5.07 PlyA + 123030 123035 6 1.05 6.00 Prom + 125630 125669 40 -1.15 6.01 Init + 131671 131772 102 0 0 81 4 153 0.631 6.49 6.02 Term + 131951 132160 210 1 0 -32 44 241 0.909 4.01 6.03 PlyA + 132366 132371 6 1.05 7.04 PlyA - 132606 132601 6 1.05 7.03 Term - 153111 152992 120 0 0 108 38 63 0.030 0.79 7.02 Intr - 157698 157588 111 0 0 45 89 52 0.025 0.66 7.01 Init - 171226 171128 99 1 0 88 92 57 0.241 6.41 7.00 Prom - 178938 178899 40 -3.95 8.00 Prom + 180340 180379 40 -8.25 8.01 Init + 180908 180974 67 1 1 70 105 68 0.815 8.06 8.02 Intr + 185110 185228 119 2 2 54 56 111 0.239 3.76 8.03 Intr + 200065 200195 131 0 2 7 4 130 0.667 -4.93 8.04 Term + 201030 201240 211 0 1 106 54 174 0.992 11.58 8.05 PlyA + 201604 201609 6 1.05 9.00 Prom + 202102 202141 40 -8.05 9.01 Init + 204083 204252 170 1 2 85 94 105 0.769 9.85 9.02 Intr + 215566 215654 89 1 2 49 53 104 0.017 1.70 9.03 Term + 220283 220335 53 0 2 107 43 23 0.028 -3.69 9.04 PlyA + 220355 220360 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:116134532_116357479|GENSCAN_predicted_peptide_1|46_aa MSNCRSTKTTDSVITKAGVQMERPSAQASSGSDEQSFLISYVGLVA >gi568815591f:116134532_116357479|GENSCAN_predicted_CDS_1|141_bp atgagcaactgcagatctacaaagacaactgactcagtgatcaccaaagcaggtgtccag atggagcgtccatcagctcaggcctcaagtggttctgatgaacaaagcttccttataagc tatgttggactcgtagcataa >gi568815591f:116134532_116357479|GENSCAN_predicted_peptide_2|513_aa MNPDRRLGNSAMGNLTNHHKRLGCTIGKKGVGGEEGECAGTGERSSDKTSPGGLKKSINV HQFESQNLRLRWIQGVMILPVKSSRGFAREDSRVLNGECSGMECLELEGTQGFKEQREKR AIQVKGRELTVECQEMLTEMNRTGQHLPLCLTPDVFGVSSFWFVRGVAGSEAKLHTFAVS VTALKAVRLELFVSSSGFLVSLASGAKLQTFALQVPAQVLAPRKAMAGPDALHAASAVGT HNWMRETQWHLKIWRCQEPQSPKEGVTALAQGAPKSGLPEGLQLFCPHHPQHAGNPDVCV SLAGFFVVSEGRKCMLIGPWAAMDGPAKSTISSHSRLWTPPETDSPAPGFQAIHGLKAVP TEEHLKACAELPSAPIGLPPMLVGTQSPEEAQAAGGWPVSATPSACTPGWVATTPRLSHN CAAPWSGCWELGENRQWEQALASLWEQGAFWAPKSTGMPSWVAAAVGAWGSCPADSVEAR AATCSGPHWLHAACSPTVPPPLQPVSSQWSLHG >gi568815591f:116134532_116357479|GENSCAN_predicted_CDS_2|1542_bp atgaatccggatagacgtctaggaaactctgccatgggtaacttgacaaatcatcacaaa cgcttgggatgcacaattgggaagaagggtgttgggggtgaggagggggaatgtgcagga actggggaaagaagttcagataagacaagtccaggagggctgaagaagagcatcaacgtc caccagttcgaatcccagaatctcagactcaggtggatccaaggagtgatgatccttcct gtgaaaagcagtagaggttttgcaagggaagatagtagagtattgaatggagaatgttca ggaatggaatgtttggagctggagggcacacaggggtttaaagaacaaagagagaaaaga gcaatacaagtgaaaggaagagaattaacagttgagtgtcaagagatgctgactgaaatg aacagaacagggcaacacctacccctgtgcctcacacctgatgtgttcggagtttcttcc ttctggtttgttcgtggtgtcgctggctcagaagcgaagctgcacaccttcgcggtcagc gttacagctctcaaggcggtgcgtctggagttgttcgtttcttccagtgggttcttggtc tcactggcttcaggagcgaagctgcagacctttgccctccaggtgccggcacaggtgcta gctccacgcaaagctatggctggaccagatgcactgcacgcagcttctgctgtgggcacc cacaactggatgagggaaacgcagtggcacctgaaaatttggagatgccaggaaccacag agcccaaaagagggtgtcacagccctggctcagggagcccctaagtctgggctccccgaa ggtctgcagctcttctgtcctcatcacccacaacatgctggtaatcctgatgtctgtgtg agtctggctgggttttttgtggtctcagaagggaggaaatgtatgctgattggtccatgg gcagccatggacgggcctgcaaaaagcaccataagttctcactccaggctgtggactcca cctgaaactgacagtccagcccccggttttcaggccatccatggcttgaaggcagttcct actgaagagcacctgaaggcctgtgctgagctgccctcagcccctattggcctccctccc atgctcgttggcacccaaagtccagaggaggcccaggcagcagggggctggcctgtcagt gccaccccaagtgcatgcacacctggctgggttgcaacaacacccaggctcagccacaac tgtgctgcaccctggagtgggtgctgggagttgggagagaataggcaatgggagcaggca cttgccagcctgtgggagcaaggagctttctgggcccccaagagcacagggatgcccagc tgggtggctgcagctgtaggagcatggggctcctgcccagctgactcagtagaggccagg gctgccacctgttctggcccccactggctccatgcagcatgcagccccactgtgcctccg ccactgcagccagtgtcttcccagtggtcactccatggttag >gi568815591f:116134532_116357479|GENSCAN_predicted_peptide_3|33_aa MEYFWWKKKMIDLETGMNSPEEFYIEEPRSFDL >gi568815591f:116134532_116357479|GENSCAN_predicted_CDS_3|102_bp atggagtatttttggtggaaaaagaagatgattgacttggaaaccggaatgaatagccca gaagaattttatattgaagaaccaagatcatttgatctgtag >gi568815591f:116134532_116357479|GENSCAN_predicted_peptide_4|173_aa MSLKADSSLALKVRFVTAWKKCPSTFPNTNTFIFPTCSDLSVRELLNITDREQVHVNASY GSPPPPIRDPTAGFSWPPAPLRTRETCGTPAAPLNSHSPGAVELPPPSPPQRACGPDPGT KGAGQPMAAGQLRAAANHRAPHAPSRPERGAGKSALVNVGLDFSTKLMDTSAK >gi568815591f:116134532_116357479|GENSCAN_predicted_CDS_4|522_bp atgagcttgaaagcagattcttccctcgcacttaaagttagatttgtgactgcctggaaa aagtgcccctctacctttccaaacacaaacactttcattttccccacatgctcagaccta agtgtgagggaacttttaaacatcacagacagagagcaagtccatgttaacgcgtcctat gggtcccctcctcctcctatccgggatcccacggccgggttcagctggcctccggctccg ctgcgaacacgggaaacctgcggaactcccgcggcgccgctcaacagccactcgcccggc gccgtcgaacttccgccgcccagtccgccgcagcgggcctgcggcccggacccgggaaca aagggggccgggcagccaatggcagccgggcagctccgggccgccgccaatcaccgagcg ccgcacgcaccttcgcggcccgagcgcggcgctggcaagagtgctttagtcaatgttgga cttgacttttccacaaaacttatggacacatcagccaaataa >gi568815591f:116134532_116357479|GENSCAN_predicted_peptide_5|400_aa MSFDFMSHIQGVLMQKRKICRNCKCGQEEHDVLLSNEEDRKVGKLFEDTKYTTLIAKLKS DGIPMYKRNVMILTNPVAAKKNVSINTVTYEWAPPVQNQALARQYMQMLPKEKQPVAGSE GAQYRKKQLAKQLPAHDQDPSKCHELSPREVKEMEQFVKKYKSEALGVGDVKLPCEMDAQ GPKQMNIPGGDRSTPAAVGAMEDKSAEHKRTQYSCYCCKLSMKEGDPAIYAERAGYDKLW HPACFVCSTCHELLVDMIYFWKNEKLYCGRHYCDSEKPRCAGCDELIFSNEYTQAENQNW HLKHFCCFDCDSILAGEIYVMVNDKPVCKPCYVKNHAVVCQGCHNAIDPEVQRVTYNNFS WHASTECFLCSCCSKCLIGQKFMPVEGMVFCSVECKKRMS >gi568815591f:116134532_116357479|GENSCAN_predicted_CDS_5|1203_bp atgtcctttgacttcatgtctcacatccagggtgtgctaatgcaaaaaagaaaaatatgt cgtaactgcaagtgtggccaagaagagcatgatgtcctcttgagcaatgaagaggatcga aaagtgggaaaactttttgaagacaccaagtataccactctgattgcaaaactaaagtca gatggaattcccatgtataaacgcaatgttatgatattgacgaatccagttgctgccaag aagaatgtctccatcaatacagttacctatgagtgggctcctcctgtccagaatcaagca ttggccaggcagtacatgcagatgctacccaaggaaaagcagccagtagcaggctcagag ggggcacagtaccggaagaagcagctggcaaagcagctccctgcacatgaccaggaccct tcaaagtgccatgagttgtctcccagagaggtgaaggagatggagcagtttgtgaagaaa tataagagcgaagctctgggagtaggagatgtcaaacttccctgtgagatggatgcccaa ggccccaaacaaatgaacattcctggaggggatagaagcaccccagcagcagtgggggcc atggaggacaaatctgctgagcacaaaagaactcaatattcctgctattgctgcaaactg agtatgaaagaaggtgacccagccatctatgccgaaagggctggctatgataaactgtgg cacccagcttgttttgtctgcagcacctgccatgaactcctggttgacatgatttatttt tggaagaatgagaagctatactgtggcagacattactgtgacagcgagaaaccccgatgt gctggctgtgacgagctgatattcagcaatgagtatacccaggcagaaaaccagaattgg cacctgaaacacttctgctgctttgactgtgatagcattctagctggggagatatacgtg atggtcaatgacaagcccgtgtgcaagccctgctatgtgaagaatcacgctgtggtgtgt caaggatgccacaatgccatcgacccagaagtgcagcgggtgacctataacaatttcagc tggcatgcatccacagagtgctttctgtgctcttgctgcagcaaatgcctcattgggcag aagttcatgccagtagaagggatggttttctgttcagtggaatgtaagaagaggatgtct tag >gi568815591f:116134532_116357479|GENSCAN_predicted_peptide_6|103_aa MQVTENENNEDLSLTGDNGSDAKAAVCDVKISSLGRAEKKNDQFTWEYVELEASGGVQQA VWDLKVDAQEKTLARDIDEEVNRQKLPFTSDNSLLGLQSPYQM >gi568815591f:116134532_116357479|GENSCAN_predicted_CDS_6|312_bp atgcaggtaacagaaaatgaaaataatgaagacctgtcactgacaggagacaacggaagt gatgccaaagcagctgtttgtgatgtcaaaatatcttctctaggaagagctgagaagaaa aatgatcagttcacttgggagtatgtggagcttgaggcaagtggaggagtacagcaggca gtttgggacctgaaagtggatgctcaagaaaaaactctggctagagatatagatgaagaa gtcaacagacaaaagttaccattcacatctgataatagcttactggggctccagtctcct tatcaaatgtga >gi568815591f:116134532_116357479|GENSCAN_predicted_peptide_7|109_aa MTTTNCTRSGKVMQFRFIGMQDLGNLTLTHLCQDSASDLKPAQHWVSLRVSCETAWLPPM FAQGSRALKSVYAQKSPYQRSFQNTLSQTADPIVSDMLTVMVNIECQLN >gi568815591f:116134532_116357479|GENSCAN_predicted_CDS_7|330_bp atgacaacaaccaattgcaccagaagtggaaaagtgatgcaattcaggttcatcggaatg caggatcttgggaatttgacattgacacacctatgtcaggacagtgcttcagacctgaag ccggcacagcattgggtttcactcagagtcagctgtgaaactgcctggctaccacctatg tttgctcaaggctctagggctctaaaatcagtctatgctcaaaagtcaccttaccagagg tcctttcagaacaccctgtcccaaacagcagaccctatagtctctgatatgcttactgtg atggttaatattgagtgtcagcttaattag >gi568815591f:116134532_116357479|GENSCAN_predicted_peptide_8|175_aa MPTQTRGMTGLPQTVPTGQSLPGHSVLPLLSKQQQRTILESGDGPHQTTKSADTLDFPAS RTVNQPPEPTTNFCVGDGGIQMPPVIMEAEADRGHDSSSLLKAEDRPKSCTVGITKEAFD DVPKRVPQSSHRRCKAYQRPRNVCVAQWGNVERPQKPRGDSDWEKEATQPRDFTP >gi568815591f:116134532_116357479|GENSCAN_predicted_CDS_8|528_bp atgccaacgcagacccggggaatgacaggacttccccagacagtccccaccgggcagagt ctgcccggacacagtgttctgcctctcctgagtaagcagcaacaacgcaccatcttggaa tcaggggacggccctcaccagacaactaaatctgcagacaccttggacttcccagcctcc agaactgttaatcagcctccagaacccaccaccaatttttgtgttggtgatgggggaatc cagatgcctccagtcattatggaagctgaagctgacagaggccatgactcctcatccctg ctgaaggcagaggataggccaaagagctgtacagtaggaattacaaaggaagcatttgat gatgtacccaagagggtgccccagtcctcccatagacgatgcaaggcataccagagaccc agaaatgtctgtgtggctcagtggggaaatgtggagaggccacaaaaacctagaggggac tctgattgggaaaaggaagctacacagcctagggactttactccttga >gi568815591f:116134532_116357479|GENSCAN_predicted_peptide_9|103_aa MALEVAMEINIQLMEEARERMQKPHQLFNYLGPEVTGATTSHILLVEISHMASPNCYFLT KMVTKVCCTGLHTPEHPPVSADTPTAGWFLCRNVREQDLSGSD >gi568815591f:116134532_116357479|GENSCAN_predicted_CDS_9|312_bp atggctttagaggttgccatggaaatcaacatccaactgatggaagaggcaagagaaagg atgcaaaaaccacaccagctgtttaattacctgggtccagaagtaacaggtgccactacc tctcacattctattagtggaaattagtcacatggcctctcctaactgctactttcttacc aagatggtcaccaaggtctgctgcacaggattgcatactccagagcaccctccagtcagt gcagacacaccaaccgctggctggttcctctgcaggaatgtgagggagcaagacttgtca ggttctgactaa