GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:09:05 Sequence gi568815593f:62247147_62485566 : 238420 bp : 39.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6290 6359 70 1 1 67 99 49 0.418 1.74 1.02 Intr + 6684 6868 185 1 2 83 83 49 0.490 2.49 1.03 Intr + 8006 8037 32 1 2 54 115 35 0.022 -1.19 1.04 Intr + 25471 25691 221 1 2 89 86 130 0.064 9.92 1.05 Intr + 35022 35155 134 1 2 -20 73 110 0.002 -1.86 1.06 Intr + 36236 36358 123 1 0 110 37 60 0.033 2.96 1.07 Intr + 45238 45400 163 0 1 52 71 110 0.456 4.33 1.08 Term + 47792 48417 626 0 2 52 42 307 0.714 16.06 1.09 PlyA + 49139 49144 6 1.05 2.00 Prom + 56273 56312 40 -4.15 2.01 Init + 56669 56712 44 1 2 61 80 18 0.238 -1.76 2.02 Intr + 58556 58616 61 2 1 96 95 14 0.294 0.72 2.03 Intr + 59171 59390 220 1 1 108 89 132 0.701 12.25 2.04 Term + 59937 60028 92 1 2 67 49 92 0.793 0.10 2.05 PlyA + 60360 60365 6 1.05 3.00 Prom + 77596 77635 40 -3.35 3.01 Init + 78165 78213 49 2 1 86 58 63 0.328 2.26 3.02 Intr + 86338 86578 241 2 1 47 32 159 0.454 1.89 3.03 Intr + 87502 87707 206 1 2 106 94 125 0.539 12.82 3.04 Intr + 99984 100078 95 1 2 70 94 67 0.039 4.26 3.05 Intr + 100902 101021 120 2 0 36 96 113 0.854 6.67 3.06 Intr + 102920 102974 55 1 1 77 88 33 0.955 -0.17 3.07 Intr + 105442 105564 123 2 0 86 36 95 0.919 3.74 3.08 Intr + 106129 106229 101 2 2 82 79 105 0.969 7.91 3.09 Intr + 108013 108108 96 0 0 93 58 65 0.826 3.29 3.10 Intr + 110545 110599 55 0 1 72 86 -14 0.893 -5.57 3.11 Intr + 110991 111153 163 1 1 64 96 93 0.941 5.81 3.12 Intr + 114096 114186 91 0 1 84 103 67 0.974 6.88 3.13 Intr + 116032 116174 143 0 2 65 99 75 0.997 4.53 3.14 Intr + 116549 116753 205 2 1 31 94 125 0.990 5.68 3.15 Intr + 118097 118207 111 1 0 88 115 85 0.999 10.86 3.16 Intr + 119268 119335 68 2 2 99 52 14 0.597 -4.32 3.17 Intr + 126541 126691 151 1 1 64 82 177 0.936 13.94 3.18 Intr + 130515 130616 102 2 0 65 103 91 0.994 7.75 3.19 Intr + 133972 134107 136 0 1 78 43 176 0.894 11.32 3.20 Term + 138338 138423 86 0 2 81 40 91 0.804 0.44 3.21 PlyA + 139215 139220 6 1.05 4.12 PlyA - 139829 139824 6 1.05 4.11 Term - 143673 143613 61 2 1 49 42 89 0.743 -3.30 4.10 Intr - 143836 143730 107 1 2 52 91 126 0.903 7.39 4.09 Intr - 146901 146809 93 0 0 85 94 84 0.990 7.94 4.08 Intr - 147461 147338 124 1 1 127 106 61 0.997 11.47 4.07 Intr - 149781 149646 136 1 1 -12 35 102 0.227 -6.39 4.06 Intr - 151364 151283 82 2 1 83 82 73 0.252 4.49 4.05 Intr - 151593 151500 94 2 1 91 91 56 0.941 5.25 4.04 Intr - 151735 151674 62 1 2 83 94 39 0.915 0.61 4.03 Intr - 154976 154890 87 2 0 53 70 76 0.719 1.55 4.02 Intr - 156200 156127 74 0 2 105 98 9 0.849 1.81 4.01 Init - 156626 156548 79 2 1 101 64 119 0.446 10.11 4.00 Prom - 161135 161096 40 -4.25 5.04 PlyA - 161160 161155 6 1.05 5.03 Term - 165941 165310 632 0 2 52 37 233 0.490 8.09 5.02 Intr - 180379 180276 104 0 2 101 18 70 0.054 0.20 5.01 Init - 185271 185114 158 0 2 73 47 124 0.232 6.23 5.00 Prom - 186654 186615 40 -5.85 6.00 Prom + 188049 188088 40 -5.75 6.01 Init + 190134 190271 138 2 0 49 119 95 0.504 8.89 6.02 Intr + 195837 195937 101 2 2 82 89 41 0.953 1.59 6.03 Intr + 202781 202853 73 2 1 108 92 63 0.996 7.19 6.04 Intr + 204584 204787 204 1 0 102 95 150 0.827 15.57 6.05 Intr + 218852 218959 108 1 0 49 89 57 0.051 1.46 6.06 Intr + 219985 220117 133 0 1 107 74 21 0.072 1.90 6.07 Intr + 223104 223162 59 1 2 77 115 52 0.045 4.48 6.08 Intr + 235955 236147 193 1 1 94 86 71 0.008 5.64 6.09 Term + 236864 237027 164 0 2 84 28 72 0.008 -2.08 6.10 PlyA + 237523 237528 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 35038 35155 118 0 1 88 73 107 0.901 9.61 S.002 Init + 100001 100078 78 1 0 51 94 73 0.899 5.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:62247147_62485566|GENSCAN_predicted_peptide_1|517_aa SSKCTAILYCPVQHLQALSSMQGECIGIDTNKWKHHYKAPAHTSEFRLHYLLREVFLNHS LLESHLDDLLCACISSVHTSVSRGVGGPSSKALPPRLCPYIPRPRDIIKKPDSVKRPAEI LGAGSGMFGQEIKGSLTPKTWSRNWKAGSSKHAPIAPQTSHPMGKDRLEAWSQETWKSRE KNEQAVPIGSGGGWNMFLPLSAHADTNGQKTLMLECKTGMRNDHLSLGDQSINANFRTVS ATCGLQTRKLICIEKPPLRIPRQTWTGLDLQQTPTDLQLRVLTVRRKTNKQKGHPHQKPI CTSPSSKTKEIQSTIREYYKHLYVNKLENLEEMDKFLDTYTLPRQNQEEVESLNRPITGS EIEAIINSLPTKKSPGPDGFTAEFYQRYKEELLPFLLKLFQSIEKEGILPNSFYEASIIL IPKPGRDTTKKENFTPISLMNITAKMLNEILANRIQQHIKKLIHHDQMGFIPGMQGWFNL CKSVNIIQHINRINDKNHMIISIDAEKVFDKIQQTSC >gi568815593f:62247147_62485566|GENSCAN_predicted_CDS_1|1554_bp tcttcaaagtgcacagccatcctttactgccccgtgcagcaccttcaggccttgagctct atgcagggagaatgcatcgggatagacacaaataaatggaaacaccattacaaagcccct gctcacacttcagaattcagattgcattacctcctccgggaagtcttccttaatcactct ctgctcgagtctcatttagatgatctcctgtgtgcttgcatatcgtctgtgcatacctct gtgtcgagaggagtgggtggcccctcttcaaaggctctgccccccaggctctgtccatat atccctaggcctagggatattatcaagaaaccagactcagtaaagaggcctgcagagatc cttggagcaggctcagggatgtttgggcaggaaattaaagggtccttgacacccaaaaca tggtcaagaaattggaaagcaggctccagtaagcatgcccccatcgccccgcagacttct caccctatggggaaagataggctggaagcctggagccaggagacatggaagtccagggag aaaaacgagcaagccgtcccaataggcagtggaggaggttggaatatgtttcttcctctt tcagcccacgcagacacaaatgggcagaaaacactgatgttggaatgtaaaactggaatg agaaacgatcatctctctctaggtgaccagtctataaatgccaacttcagaactgtttct gctacgtgtggactgcaaacaagaaagctgatctgcattgagaaacctccgctgcggata cccaggcaaacatggactggactggacctccagcaaactccaacagacctgcagctgagg gtcctgactgttagaaggaaaactaacaaacagaaaggacatccacaccaaaaacccatc tgtacgtcaccatcatcaaagaccaaagaaatacaatctaccatcagagaatactataaa cacctctacgtaaataaactagaaaatctagaagaaatggataaattcctcgacacatac acgctcccaagacaaaaccaggaagaagttgaatctctgaatagaccaataacaggctct gaaattgaggcaataattaatagcttaccaaccaaaaaaagtccaggaccagatggattc acagccgaattctaccagaggtacaaggaggagctgttaccattccttctgaaactattc caatcaatagaaaaagagggaatcctccctaactcattttatgaggccagcatcatcctg ataccaaagcctggcagagacacaacaaaaaaagagaattttacaccaatatccctgatg aacatcactgcaaaaatgctcaatgaaatattggcaaaccgaatccagcagcacatcaaa aagcttatccaccatgatcaaatgggcttcatccctgggatgcaaggctggttcaaccta tgcaaatcagtaaacataatccagcatataaacagaatcaatgacaaaaaccacatgatt atctcaatagatgcagaaaaggtctttgacaaaattcaacaaacttcatgctaa >gi568815593f:62247147_62485566|GENSCAN_predicted_peptide_2|138_aa MKVLVNTYADVYQKNVTGWETYTKCAWEGCLGQLPLLLAAPLAFTLSPGPARPRATAPPP TPTPPPPRLFRPPVPLPRPAAAAPDEVMATANFGKIQIGIYVEIKRSDGLPGFVVDHAAF FFATSWEKNVENIALSHE >gi568815593f:62247147_62485566|GENSCAN_predicted_CDS_2|417_bp atgaaagtcttggtaaatacatatgctgatgtgtaccaaaaaaacgtaactggatgggaa acctacaccaagtgtgcttgggagggctgcttgggtcagctacctctgctccttgcggcc ccgcttgcgttcacgctgtcgcccgggccggcgcggccgcgggcaaccgctccccctccc acacctaccccgccccctccccgccttttccgccctccggtccccctccctcggcccgct gctgctgctccagatgaggtgatggcaacggccaacttcggcaagatccagatcgggatt tacgtggagatcaagcgcagcgatggcctccccggtttcgtcgtcgaccatgctgctttc ttttttgccacttcttgggagaaaaatgtggagaatatcgcgctgagccatgaataa >gi568815593f:62247147_62485566|GENSCAN_predicted_peptide_3|798_aa MGFRHVGQAGLELLTSALAFNLCKFSEITLQKQCQAEDTVALEEMIMDISSTQLHTQKLQ LPYTRSSRYGGSLPTPYHTDSSPYSPAYLSPPQVSSCPLSLLTGPADARRSQQQLPKQFL PVSPTLSSIVQGVPLETSNLHTQPHTPKSLQQPELPSQACSAQPSGRIHQAMVTSLNEDN ESVTVEWIENGDTKGKEIDLESIFSLNPDLVPDEEIEPSPETPPPPASSAKVNKIVKNRR TVASIKNDPPSRDNRVVGSARARPSQFPEQSSSAQQNGSVSDISPVQAAKKEFGPPSRRK SNCVKEVEKLQEKREKRRLQQQELREKRAQDVDATNPNYEIMCMIRDFRGSLDYRPLTTA DPIDEHRICVCVRKRPLNKKETQMKDLDVITIPSKDVVMVHEPKQKVDLTRYLENQTFRF DYAFDDSAPNEMVYRFTARPLVETIFERGMATCFAYGQTGSGKTHVFDLLNRKTKLRVLE DGKQQVQVVGLQEREVKCVEDVLKLIDIGNSCRTSGQTSANAHSSRSHAVFQIILRRKGK LHGKFSLIDLAGNERGADTSSADRQTRLEGAEINKSLLALKECIRALGRNKPHTPFRASK LTQVLRDSFIGENSRTCMIATISPGMASCENTLNTLRYANRVKELTVDPTAAGDVRPIMH HPPNQIDDLETQWGVGSSPQRDDLKLLCEQNEEEVSPQLFTFHEAVSQMVEMEEQVVEDH RAVFQESIRWLEDEKALLEMTEEVDYDVDSYATQLEAILEQKIDILTELRDKVKSFRAAL QEEEQASKQINPKRPRAL >gi568815593f:62247147_62485566|GENSCAN_predicted_CDS_3|2397_bp atggggtttcgccatgttggccaggctggtctcgaactcctgacctcagccttggctttc aatctgtgcaagtttagtgagatcacgctgcagaagcagtgtcaggctgaggacacagtg gccctcgaggaaatgataatggacatcagctccacccagttacacacccaaaaactgcaa ctgccatacacaaggagctcccgttatggtggttctctgccaacaccctaccacactgac agctctccctatagtcctgcctacttatctcctccccaagtgtccagctgccccctaagt ttgctcacaggtccagccgatgccagaaggtcgcaacagcagctacccaaacagtttttg ccagtgtcacccaccctgtcttccatcgttcagggtgtccccctggagaccagtaatctg cacacccagccacacaccccaaagtctctacagcagccagagctgccctctcaggcctgc tcagcgcagccctcaggccgaatacatcaagcaatggtaacatctttaaatgaagataat gaaagtgtaactgttgaatggatagaaaatggagatacaaaaggcaaagagattgacctg gagagcatcttttcacttaaccctgaccttgttcctgatgaagaaattgaacccagtcca gaaacacctccacctccagcatcctcagccaaagtaaacaaaattgtaaagaatcgacgg actgtagcttctattaagaatgaccctccttcaagagataatagagtggttggttcagca cgtgcacggcccagtcaatttcctgaacagtcttcctctgcacaacagaatggtagtgtt tcagatatatctccagttcaagctgcaaaaaaggaatttggacccccttcacgtagaaaa tctaattgtgtgaaagaagtagaaaaactgcaagaaaaacgagagaaaaggagattgcaa cagcaagaacttagagaaaaaagagcccaggacgttgatgctacaaacccaaattatgaa attatgtgtatgatcagagactttagaggaagtttggattatagaccattaacaacagca gatcctattgatgaacataggatatgtgtgtgtgtaagaaaacgaccactcaataaaaaa gaaactcaaatgaaagatcttgatgtaatcacaattcctagtaaagatgttgtgatggta catgaaccaaaacaaaaagtagatttaacaaggtacctagaaaaccaaacatttcgtttt gattatgcctttgatgactcagctcctaatgaaatggtttacaggtttactgctagacca ctagtggaaactatatttgaaaggggaatggctacatgctttgcttatgggcagactgga agtggaaaaactcatgtgtttgacttgctaaacaggaaaacaaaattaagagttctagaa gatggaaaacagcaggttcaagtggtgggattacaggaacgggaggtcaaatgtgttgaa gatgtactgaaactcattgacataggcaacagttgcagaacatccggtcaaacatctgca aatgcacattcatctcggagccatgcagtgtttcagattattcttagaaggaaaggaaaa ctacatggcaaattttctctcattgatttggctggaaatgaaagaggagctgatacttcc agtgcggacaggcaaactaggcttgaaggtgctgaaattaataaaagccttttagcactc aaggagtgcatcagagccttaggtagaaataaacctcatactcctttccgtgcaagtaaa ctcactcaggtgttaagagattctttcataggtgaaaactctcgtacctgcatgattgcc acaatctctccaggaatggcatcctgtgaaaatactcttaatacattaagatatgcaaat agggtcaaagaattgactgtagatccaactgctgctggtgatgttcgtccaataatgcac catccaccaaaccagattgatgacttagagacacagtggggtgtggggagttcccctcag agagatgatctaaaacttctttgtgaacaaaatgaagaagaagtctctccacagttgttt actttccacgaagctgtttcacaaatggtagaaatggaagaacaagttgtagaagatcac agggcagtgttccaggaatctattcggtggttagaagatgaaaaggccctcttagagatg actgaagaagtagattatgatgtcgattcatatgctacacaacttgaagctattcttgag caaaaaatagacattttaactgaactgcgggataaagtgaaatctttccgtgcagctcta caagaggaggaacaagccagcaagcaaatcaacccgaagagaccccgtgccctttaa >gi568815593f:62247147_62485566|GENSCAN_predicted_peptide_4|332_aa MPKVKSGAIGRRRGRQEQRRELKSAGGLMFNTGIGQHILKNPLIINSIIDKAALRPTDVV LEVGPGTGNMTVKLLEKAKKVVACELDPRLVAELHKRVQGTPVASKLQVLVGDVLKTDLP FFDTCVANLPYQVCDKRLELLPNFERLTVQVVNIAKNESELQRIVMEMKYGFTSVILKTK QSESDAYQETEWLQSKQKQTSQEQRCAILMFQREFALRLVAKPGDKLYCRLSINTQLLAR VDHLMKVGKNNFRPPPKVESSVVRIEPKNPPPPINFQIIPEDFSIADKIQQILTSTGFSD KRARSMDIDDFIRLEILKAFKRYKVLPGRVIA >gi568815593f:62247147_62485566|GENSCAN_predicted_CDS_4|999_bp atgccgaaggtcaagtcgggggccatcggccgccgccgcgggcggcaggagcagcgccgg gagctgaagagcgctggaggactcatgttcaacacggggattgggcagcacattttgaaa aatcctctcattattaacagcattatcgataaggctgccttaagaccaactgatgtagtg ctggaagttggacctggaactggcaacatgactgtaaagttgttagaaaaggcaaaaaag gttgttgcttgtgaacttgacccaaggctagtagctgaacttcacaaaagagttcagggc acgcctgtggccagcaaacttcaagtactggtgggtgatgtgctgaaaacagatttgcca ttctttgatacttgtgtggcaaatttgccttatcaggtttgtgacaaaagactagagtta ctgccaaattttgaaagacttacagttcaggtggtgaacatagccaaaaatgagtcagaa ttacaaagaattgtaatggagatgaaatatggctttaccagtgtgatcctaaagacaaag cagagtgaaagtgatgcctaccaagaaacggagtggctccagtcaaagcaaaagcagacc agtcaagagcaaaggtgtgctatacttatgtttcaaagagaatttgccctccgactggtt gcaaaacctggagataagttatactgcagactctcaattaatacacagctgttggcacgt gtggaccatctaatgaaagtgggaaagaataacttcagaccaccgcccaaggtggaatcc agtgttgtaaggatagaacctaagaatccaccaccacccatcaattttcagataatacca gaagatttcagcatagcagataaaatacagcaaatcctaaccagcacaggttttagtgac aaacgggcccgttccatggacatagatgacttcatcagattggagattttaaaggctttt aagcggtataaggtgctacctgggagagttattgcatag >gi568815593f:62247147_62485566|GENSCAN_predicted_peptide_5|297_aa MGSQPASISHTPQLYAAEILYQANTARGLGLSSSIQAPLLRQKLYHRAAGKEYVAQFLAG HGLTDTGPWPEGWGHLPYDTSCRLSKTGCANSQVCRPPLPRYRYPSSPGSPTFLPPLGTR VPNPLPSTPARSVALGPGPHRLVRSGIARPSRSHFPTRSGVGWSLSPRCQTPRRRLLMPK AASRNTPCLASQLQTTRTRENYVTGDQRACDRQREKAIGERREGEGGALSLNQKARGIQI SCVRLRSALAQYKEPGVRTSRICHHGNHRSGFSDSQGAQLFPALIEASIRTISRFIF >gi568815593f:62247147_62485566|GENSCAN_predicted_CDS_5|894_bp atgggtagccagccagccagtatttctcatacaccccagctatatgctgcagaaattctg taccaggcaaacacagctagaggtctagggctctcttcttccatccaggcaccactcctg cggcagaagctctaccacagggcagcaggcaaagaatatgtggcccagttcctagcaggc cacggactgactgatactggtccatggcctgagggctggggacacctgccctatgacact tcatgcagactgtcaaaaactggttgcgcaaacagccaagtctgcaggccgcccttgcct cgctaccgttatccctcatctcctggtagccccactttcctcccgcctctcgggactcgg gtgcccaacccactcccttccactcccgcgcgctcggtggctctcgggccggggcctcac cgtttggttcggtcgggaattgcccggccctcccgctcccacttcccaacacgaagtggc gtcggttggtccctctccccacgctgccagaccccacgccgccgcctactgatgccgaaa gcggcttctaggaacacgccatgtttggcgtcgcagctccaaacgacgcggacgcgcgaa aactacgtcacaggagaccagcgcgcatgcgaccggcagagagagaaggcgataggcgaa cggcgggaaggggaaggaggggctttgtctttgaaccaaaaggcgcggggtatccaaatc agttgtgtgcgcttgcgcagtgcgcttgcgcagtataaagagccaggagtccggactagc cggatctgtcaccatggaaaccataggtctggtttctccgactcccagggagctcaattg tttcctgcgttgattgaagcttcaatcagaaccatttcacgctttatattctag >gi568815593f:62247147_62485566|GENSCAN_predicted_peptide_6|390_aa MDLNSASTVVLQVLTQATSQDTAVLKPAEEQLKQWETQPGFYSVLLNIFTNHTLDINVRW LAVLYFKHGIDRYWRRVAPHALSEEEKTTLRAGLITNFNEPINQIATQIAVLIAKVARLD CPRQWPELIPTLIESVKVQDDLRQHRALLTFYHVTKTLASKRLAADRKLFYDGSRAENLI PLQNLRGDEKLSHDSLMVQIKRRVGDGLLASGIYNFACSLWNHHTDTFLQEVSSGNEAAI LSSLERTLLSLKVLRKLTVNGFVEPHKNMEVMLLDFLDQHPFSFTPLIQRSLEFSVSYVF TEVGEGVTFERFIVQCMNLIKMIVKNYAYKPSKNFEDSSPETLEAHKIKMAFFTYPTLTE ICRRLVSHYFLLTEEELTMWEEDPEGFSKN >gi568815593f:62247147_62485566|GENSCAN_predicted_CDS_6|1173_bp atggatctcaatagtgccagcactgttgttcttcaggtgttaacacaggccaccagtcag gatactgctgtgttaaaaccagctgaggagcagttgaagcagtgggagacacagccaggt ttctattcagtgttgctgaatattttcaccaaccacactttggatataaatgtaaggtgg cttgctgtactgtattttaaacatggaattgatcgctactggagacgtgtagcacctcat gctctctcagaggaggagaaaactactctgcgtgcagggctcatcaccaacttcaatgaa ccaataaaccagattgcaactcagattgcagtgctcattgcaaaagttgctagattggat tgtcccagacagtggcctgaactaattcccactcttatagagtctgttaaagtccaggat gatcttcgacagcacagagcattacttaccttctatcatgttaccaagacactggcatct aaacgacttgctgctgatagaaaactattttatgatgggagtagggcagaaaaccttata cctcttcagaacctgagaggtgatgagaaactgtctcatgatagcctaatggtgcagata aagagacgagtaggagatggtcttttagcttctggaatttataattttgcctgctctctg tggaatcaccacacagacacattcctgcaagaagtttcttctggcaatgaagctgcaatt ttgagttcactagaacgaacactgctatcattgaaagtgctgcgtaagttaactgttaat ggatttgtggaacctcataagaatatggaggtgatgcttttggacttcttggatcagcat cctttttcatttactcctctaattcagagatcactggaattttctgtaagctatgttttt acagaagttggtgaaggcgttacatttgaacgattcattgtccaatgtatgaatcttatt aagatgattgtcaaaaattatgcttataagccatccaaaaattttgaagatagcagccct gaaactcttgaagcccataagattaagatggcattcttcacatatcctactttgacagag atatgtagaagattagtctctcattatttcctattaactgaagaagaactgacaatgtgg gaagaagacccagaaggctttagtaagaattaa