GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:15:41 Sequence gi568815578f:56529545_56738010 : 208466 bp : 48.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3798 4046 249 2 0 78 116 118 0.569 9.22 1.02 Intr + 6628 6824 197 0 2 94 77 97 0.984 7.41 1.03 Intr + 7020 7169 150 0 0 69 10 115 0.002 1.08 1.04 Intr + 29663 29844 182 2 2 32 71 93 0.008 1.51 1.05 Intr + 44962 45076 115 2 1 100 64 30 0.126 1.31 1.06 Intr + 45606 45757 152 0 2 93 9 87 0.199 1.11 1.07 Intr + 50305 50429 125 2 2 70 37 62 0.268 -0.40 1.08 Intr + 50657 50791 135 1 0 111 98 61 0.644 10.16 1.09 Intr + 65519 65586 68 1 2 81 77 5 0.004 -3.60 1.10 Intr + 72303 72515 213 0 0 42 26 240 0.002 11.03 1.11 Intr + 77112 77186 75 0 0 69 109 69 0.604 5.73 1.12 Intr + 79314 79431 118 0 1 74 84 12 0.152 -0.13 1.13 Intr + 97725 97856 132 2 0 71 82 22 0.007 0.74 1.14 Intr + 99737 100155 419 1 2 48 52 279 0.151 13.12 1.15 Intr + 101237 101444 208 2 1 22 18 189 0.126 4.38 1.16 Intr + 101661 102312 652 2 1 129 94 746 0.846 70.78 1.17 Intr + 103809 104025 217 1 1 89 75 69 0.610 3.26 1.18 Intr + 104606 104724 119 2 2 39 86 45 0.344 -0.49 1.19 Intr + 107066 107210 145 0 1 59 100 53 0.792 2.94 1.20 Term + 108184 108469 286 1 1 63 38 229 0.721 10.28 1.21 PlyA + 109711 109716 6 1.05 2.00 Prom + 122338 122377 40 -4.96 2.01 Init + 135012 135095 84 2 0 101 72 64 0.518 6.92 2.02 Intr + 138046 138163 118 0 1 41 76 54 0.218 -0.46 2.03 Intr + 143467 143559 93 2 0 103 83 16 0.208 2.54 2.04 Intr + 154061 154190 130 0 1 129 52 71 0.933 7.35 2.05 Term + 156342 156606 265 0 1 53 45 189 0.698 6.08 2.06 PlyA + 157740 157745 6 1.05 3.09 PlyA - 157753 157748 6 1.05 3.08 Term - 159686 159561 126 2 0 112 48 90 0.852 5.68 3.07 Intr - 168715 168603 113 2 2 78 72 81 0.009 5.60 3.06 Intr - 171460 171247 214 1 1 -21 -85 348 0.009 4.89 3.05 Intr - 171989 171801 189 2 0 81 44 177 0.644 12.38 3.04 Intr - 173117 173013 105 2 0 52 105 31 0.445 1.61 3.03 Intr - 174110 174008 103 1 1 61 32 119 0.704 3.78 3.02 Intr - 177178 176861 318 0 0 59 113 89 0.686 3.57 3.01 Init - 179783 179587 197 2 2 72 70 58 0.669 1.00 3.00 Prom - 192569 192530 40 -4.76 4.00 Prom + 197012 197051 40 -4.26 4.01 Init + 205411 205578 168 0 0 57 76 153 0.804 10.53 4.02 Intr + 207073 207125 53 0 2 59 110 80 0.683 5.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 171460 171224 237 1 0 -21 36 331 0.986 13.17 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:56529545_56738010|GENSCAN_predicted_peptide_1|1318_aa MWTLKSSLVLLLCLTCSYAFMFSSLRQKTSEPQGKVPCGEHFRIRQNLPEHTQGWLGSKW LWLLFAVVPFVILQCQRDSEKNKEQSPPGLRGFPFRTPLKKNQNASLYKDCVFNTLNELE VELLKFVSEVQNLKGAMATGSGSNLKLRRRKSEWETSSHSSDGFVFASLMSSRAVDGEES TESYIIVRDELVLPCNGRGEVLEHKVEMGVLAVAEEVGAGGATVSGNEFLKLAGPTKMQI LRQFLQLSCGRSEDPTPSPETRPEGLLKNALGNNCFSRTLFGFPLAGWNPLCAAFAGSPL PAGFGTHCPSSQIHSCQGHQEYPVHSTTGTSMSSRDGSSQLHLTQLTQERTNDPHALLFG KGNEGFRTVGTGHPQGLFKQPKLLLQNLGTYTRGKRSRGIPCWKYPSPEGCWQNHHLIQN KKSAPGAPESVKKNKVSPTGLYLQMPRTWDIMQRPSTWNRLKYVLTGNEVKKICMQRFIK INGKVRTDITYSAGFMDVNSIDKSGENFRLIYDTKGRFAVHRITPEEAKAGSLVLPHPCP EQKNPEEAFGQSRRGLIQLMGAFENLNVALAKNSWTLHPPFLHLPYSMYLYDSTLPGGEA YRFLRAALASVAVGKESQGLGWGMSTFLPIPGPSVQEALGSPGYRQDTLFGRGFPRPRSG LDTRGGSIYARRAADACPVTRTARPARGGGGGRRLVTVTPILDLPLGGWGDPGFNWRLFW GTPDAMLWKITDNVKYEEDCEVSWGSGVQPRPAEDSPGGRGHWTEVGDEGIGALASRPCV MGGLHEIALHRASGSFAPGSGSFAPGSGHGLFWPKTQGSDLAPPSASGLGAAPRPSPPGF ENSFPQDRHDGSSNGNPRVPHLSSAGQHLYSPAPPLSHTGVAEYQPPPYFPPPYQQLAYS QSADPYSHLGEAYAAAINPLHQPAPTGSQQQAWPGRQSQEGAGLPSHHGRPAGLLPHLSG LEAGAVSARRDAYRRSDLLLPHAHALDAAGLAENLGLHDMPHQMDEVQVSGAAAPDRTCS PYGLLPPPRTPLGSPELKGILSSLPQNVDDQHLLLHDQTVIRKGPISMTKNPLNLPCQKE LVGAVMNPTEVFCSVPGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSK NGGRSLREKLDKIGLNLPAGRRKAAHVTLLTSLVEGEAVHLARDFAYVCEAEFPSKPVAE YLTRPHLGGRNEMAARKNMLLAAQQLCKEFTELLSQDRTPHGTSRLAPVLETNIQNCLSH FSLITHGFGSQAICAAVSALQNYIKEALIVIDKSYMNPGDQSPADSNKTLEKMEKHRK >gi568815578f:56529545_56738010|GENSCAN_predicted_CDS_1|3957_bp atgtggacgctgaaatcgtccctggtcctgcttctgtgcctcacctgcagctatgccttt atgttctcttctctgagacagaaaactagcgaaccccaggggaaggtgccgtgtggagag cactttcggattcggcagaacctaccagagcacacccaaggctggcttgggagcaaatgg ctctggcttttgtttgctgttgtgccgtttgtgatactgcagtgtcaaagagacagtgag aagaataaggagcagagtcctcctggccttcgaggcttcccatttcgcactccactaaag aaaaatcaaaatgcttctctttacaaagactgtgtattcaataccttaaacgaacttgaa gtggagcttttgaaatttgtgtccgaagtgcagaatcttaaaggtgccatggcaacaggc agtggcagtaacctcaagcttcgaagaagaaaaagtgaatgggagaccagtagccatagc agtgatggttttgtttttgcctctctcatgtcatctagagctgtggatggtgaggagagc accgaaagctacataatagtacgtgatgagctcgtgcttccgtgcaatggccgcggggag gtgttggaacacaaggtggaaatgggtgtgttggcggtggctgaggaagtcggggcagga ggagccactgtttctgggaatgaattcctgaaacttgctggtcccaccaagatgcaaatc ttgcggcagtttttacagttatcctgtggaagaagtgaggatcccactccaagcccagag acaagacctgaaggattgctaaagaatgcccttggtaacaactgtttctccaggaccttg tttggcttccccctggctgggtggaacccgctgtgtgcagccttcgcagggagcccactt ccagctggctttggcacccactgtccctcctctcagattcactcttgtcaaggtcaccag gaatacccggtgcacagtacaacaggcacttccatgtcctcgcgggatgggtcttcccag ctgcacttgacgcagctgacccaggagagaaccaacgaccctcatgcattgctttttggc aagggaaatgaaggattcaggacagtgggcactgggcacccacagggtttattcaagcag ccgaagctcctgcttcagaaccttggcacgtacacccgtgggaagaggagtcggggcatc ccttgctggaaataccccagccctgaaggttgctggcaaaaccaccatttaattcagaac aaaaagtctgctccaggagcccctgagagcgtcaaaaagaacaaggtttcccccactgga ctttacctccagatgcccaggacatgggacatcatgcaaagacctagcacgtggaacaga cttaagtatgtcctcactggaaatgaagtaaagaagatttgcatgcagcggttcattaag atcaatggcaaggtccgtactgatataacctactctgctggattcatggatgtcaacagc attgacaagtcgggagagaatttccgtctgatctatgacaccaagggtcgctttgctgta catcgtattacacctgaggaggccaaggctgggagcctggtgctgccacacccttgccct gagcagaagaatccggaagaagcctttggccagagccgcaggggtctaattcaattgatg ggagcatttgagaacctaaatgtagcattagctaaaaactcttggacactacaccctccc ttcttgcatttaccctatagcatgtacctgtatgattctacgctcccaggcggggaggca tatcggttcctccgggcagctttggctagtgttgctgtgggaaaggagagccagggcctg ggatgggggatgagcaccttcttgcccattccgggccccagcgtgcaggaggcgctcggg tcccccggctatcgccaggacacactgttcgggcgcggctttccccgtccgcggagcggt cttgacactcgcggcggcagcatctacgctcgcagagccgccgatgcgtgtccagtgacc cggacagcaaggcccgcgcgcggcgggggcggcggcagacgcctggtcaccgtgaccccg attttggatttaccgcttgggggctggggggatcctggatttaactggcgactgttttgg gggacgccggacgccatgttgtggaaaataaccgataatgtcaagtacgaagaggactgc gaggtgagctggggctccggggtgcagccccgccccgccgaggacagtccgggaggcagg ggccactggaccgaggtcggggacgagggcataggagccctggcctctcgtccctgcgtc atgggcgggctccacgagatagctctgcaccgggcgtccggctccttcgccccgggctct ggctccttcgccccgggctccggccacggacttttctggcccaagacccagggttcggac ttggcgcctccaagcgcctcgggcttgggagcagcgcctagaccttcgccgccgggcttt gagaactcgttcccccaggatcgccacgacgggagcagcaatgggaatccgcgggtcccc cacctctcctccgccgggcagcacctctacagccccgcgccacccctctcccacactgga gtcgccgaatatcagccgccaccctactttccccctccctaccagcagctggcctactcc cagtcggccgacccctactcgcatctgggggaagcgtacgccgccgccatcaaccccctg caccagccggcgcccacaggcagccagcagcaggcctggcccggccgccagagccaggag ggagcggggctgccctcgcaccacgggcgcccggccggcctactgccccacctctccggg ctggaggcgggcgcggtgagcgcccgcagggatgcctaccgccgctccgacctgctgctg ccccacgcacacgccctggatgccgcgggcctggccgagaacctggggctccacgacatg cctcaccagatggacgaggtgcaggtgagcggcgctgcggctcctgaccggacctgttca ccctacggccttctgcccccaccccgcactcctctaggctcccccgaacttaagggaatt ttgtcctctctcccccagaatgtcgacgaccagcacctgttgctgcacgatcagacagtc attcgcaaaggtcccatttccatgaccaagaaccctctgaacctcccctgtcagaaggag ctggtgggggccgtaatgaaccccactgaggtcttctgctcagtccctggaagattgtcg ctcctcagctctacgtctaaatacaaagtgacagtggctgaagtacagaggcgactgtcc ccacctgaatgcttaaatgcctcgttactgggaggtgttctcagaagagccaaatcgaaa aatggaggccggtccttgcgggagaagttggacaagattgggttgaatcttccggccggg aggcggaaagccgctcatgtgactctcctgacatccttagtagaaggtgaagctgttcat ttggctagggactttgcctatgtctgtgaagccgaatttcctagtaaaccagtggcagaa tatttaaccagacctcatcttggaggacgaaatgagatggcagctaggaagaacatgcta ttggcggcccagcaactgtgtaaagaattcacagaacttctcagccaagaccggacaccc catgggaccagcaggctcgccccagtcttggagacgaacatacagaactgcttgtctcat ttcagcctgattacccacgggtttggcagccaggccatctgtgccgcggtgtctgccctg cagaactacatcaaagaagccctgattgtcatagacaaatcctacatgaaccctggagac cagagtccagctgattctaacaaaaccctggagaaaatggagaaacacaggaaataa >gi568815578f:56529545_56738010|GENSCAN_predicted_peptide_2|229_aa MEGQKDLGSILKAELTELADELDGLRSRTLFWVLRTQQKEIGPQAVHPGEDPDSSHNGRV KCTECSQVCPVTRLHPCSVHSKARTGILTDPGGLKSPFSVGSYCQVMLPLCGSLAQVQNP DPVDTDASFYFGNLESECGYTGVLLYHHSITPNQDVPAWTPIVLILSSLEEEASPGAAVD PGSLPNDPGSCGPLVPQESGSCLEAPTKSWPGSRLPSQLAEDGAIWLRF >gi568815578f:56529545_56738010|GENSCAN_predicted_CDS_2|690_bp atggagggccagaaagacctgggaagtattttgaaggcggagctgacagaacttgctgat gagttggatgggctcaggagcaggacactgttctgggtgctgaggacacagcagaaagag attggccctcaggcggttcatcctggtgaggacccagacagctcacacaacggacgtgta aaatgcactgagtgttcccaggtgtgccctgtgacaaggcttcacccttgctctgttcac agcaaagcacgcacgggcatcctgactgatcctggaggtttgaaatcaccttttagtgtg ggctcttattgccaggtgatgctgcctctctgtggatccctggctcaggtgcagaatcct gacccagtggacactgacgcaagcttctattttggaaatctggaatctgaatgtggatac actggggttcttctgtaccatcactcaataactcccaaccaggatgtgcctgcctggacc cccatcgtgctcatcctgagcagcttggaggaagaggcctcccctggggcagcagtcgac ccagggtctctgccaaatgaccctggcagctgcggccccctcgtgccccaggagtcaggg tcctgtctggaggctccaaccaaaagctggcccgggtcccgactgccctctcagctggca gaggatggagccatttggctccgcttctga >gi568815578f:56529545_56738010|GENSCAN_predicted_peptide_3|454_aa MEETLPAFWVRDPSPEDLPLPPCLAVTPESKRGGGGSSLRQRSWPPSHRMLHSAMSPAIP LVRGPDPCGILRAGAKQIVLFHELGDKHVTQSNPKTHKETDTGIPGRFLLLPSSGIRKPM WPWSSWPPSCEMGKCGANTEERETSSFCSARALHPAFPKARFDSPLAYDSVTSSTERSER TLPEVKLFDQILRMLHNTEEGSGGDQKRKLMLVEALGKTSGMSTKWEHCYSKGLLLSLPL LITCVPHVRRSRAHQLLDHHQDLKEKEEVVEEVDNERDAPTNRNTNEGNGEEEADHEVDK EGKKKRKKVERKGEEEEVDGEEEDGDKDEEAEAAEGKLAAENDQDDNIVTKKQKTNEDDQ AAKKENSNFKKTLPASWVLYQVLSTDNLPSLYPETGSSKDEVTLPKSCSPKYQETFLILP DKVSSTGAAACLTFNDFSPKLPAGPVCHGGRPTP >gi568815578f:56529545_56738010|GENSCAN_predicted_CDS_3|1365_bp atggaggaaaccctcccagcattttgggtcagagaccctagccctgaagacctgcccctg cccccgtgtctggctgtgaccccagagagcaagcgaggaggaggaggctcatcactgcgc cagcggtcctggcccccgagccaccggatgctacattcggctatgtccccagcaatcccc ctggtccgtgggccagacccatgcggaatattgagagccggagccaagcagatcgttctg ttccatgagctcggggataagcacgtgacccagtctaacccaaagactcacaaggagact gacactgggatccccggaaggttcttgctgcttccttcttctggaatcaggaagcccatg tggccctggagctcctggccgccatcttgtgagatgggaaaatgtggggccaatacagag gagagagaaacctcatccttttgttcagctagagccctgcatccagctttccctaaagcc agatttgattcacctctggcttatgattctgtcacctcctccacagagagaagtgaaagg acactccccgaggtgaagctctttgaccagattttgcggatgctgcacaacactgaggaa gggagtgggggagaccagaaaaggaaactaatgctggtggaggctttaggaaaaacgtcg gggatgtccactaaatgggaacattgctattcgaagggccttctgttgtcgttgccactc ctgatcacctgtgtgccccatgtcagacgcagccgtgcacaccagctcctagatcaccac caagacttaaaggagaaggaggaagttgtggaggaggtggacaatgaaagagatgcccct actaacaggaatactaatgagggcaatggggaggaggaggctgaccatgaggtagacaaa gaagggaagaagaagaggaagaaagtggagaggaagggggaggaggaggaagttgatggt gaggaagaagatggagataaagatgaggaagctgaggctgctgagggcaaactggctgct gaaaatgatcaggatgacaatattgttaccaagaagcagaagaccaatgaggatgaccag gcagccaaaaaggaaaattcaaacttcaagaaaaccttgccggccagctgggtgctgtac caggtactttccacagacaaccttccgagcctttacccagaaactgggtcttcgaaagat gaagtgactttgccaaagtcatgcagccctaagtaccaggaaactttcctgatcctccca gacaaggtcagcagcaccggggccgctgcttgcctgacattcaacgacttttcacccaaa ctccccgcggggcctgtctgccacggtggacgtccaactccataa >gi568815578f:56529545_56738010|GENSCAN_predicted_peptide_4|74_aa MWTLIVSPYLGVTTVLDKIMPVKGVPVSEKMAITQRLALANVKLQTEISALKESLRLSPA QIGSTGFEGHVYSS >gi568815578f:56529545_56738010|GENSCAN_predicted_CDS_4|222_bp atgtggacgttaatagtatctccttacctcggggtcaccacggtattggataaaataatg ccagtaaaaggtgtcccagtatcagagaagatggccattactcaaaggcttgctctagca aatgtcaagctacagacagaaatcagcgctctcaaggagagcttgaggctgagtcccgca cagatcggcagcaccggttttgaaggccacgtgtactccagn