GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:33:20 Sequence gi568815586f:70178227_70453912 : 275686 bp : 37.39% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 927 922 6 1.05 1.03 Term - 2061 1950 112 2 1 109 49 61 0.280 1.35 1.02 Intr - 4439 4356 84 1 0 69 66 72 0.194 1.12 1.01 Init - 25356 24986 371 0 2 68 22 162 0.015 4.11 1.00 Prom - 27965 27926 40 -5.75 2.04 PlyA - 28623 28618 6 1.05 2.03 Term - 36416 36301 116 0 2 59 52 84 0.841 -0.35 2.02 Intr - 36828 36601 228 1 0 83 50 158 0.104 8.42 2.01 Init - 51528 51387 142 0 1 52 71 108 0.805 5.84 2.00 Prom - 59923 59884 40 -6.45 3.00 Prom + 61018 61057 40 -6.35 3.01 Init + 64554 64706 153 2 0 71 43 170 0.497 10.84 3.02 Intr + 64865 64980 116 1 2 70 76 194 0.999 14.73 3.03 Intr + 65107 65254 148 1 1 79 101 103 0.799 10.02 3.04 Intr + 65546 65786 241 1 1 35 28 361 0.641 21.00 3.05 Intr + 65803 65963 161 2 2 31 67 129 0.054 3.89 3.06 Term + 93990 94181 192 2 0 62 44 119 0.003 1.34 3.07 PlyA + 97500 97505 6 1.05 4.04 PlyA - 97624 97619 6 1.05 4.03 Term - 99998 99861 138 2 0 51 36 102 0.037 -1.72 4.02 Intr - 114286 114220 67 0 1 117 61 41 0.092 2.29 4.01 Init - 123255 122375 881 0 2 58 39 388 0.436 24.89 4.00 Prom - 123552 123513 40 -7.65 5.02 PlyA - 123775 123770 6 1.05 5.01 Sngl - 125031 123952 1080 0 0 56 39 258 0.860 14.53 5.00 Prom - 129637 129598 40 -4.65 6.00 Prom + 132185 132224 40 -4.25 6.01 Init + 132681 132791 111 2 0 47 95 93 0.782 6.16 6.02 Intr + 141072 141138 67 2 1 77 69 56 0.114 0.16 6.03 Intr + 151197 151344 148 1 1 66 68 51 0.066 -0.73 6.04 Intr + 152061 152243 183 0 0 47 110 42 0.098 0.28 6.05 Intr + 154541 154620 80 2 2 51 115 41 0.406 1.38 6.06 Intr + 157212 157337 126 1 0 71 46 69 0.456 0.73 6.07 Intr + 159163 159287 125 2 2 31 84 106 0.759 3.88 6.08 Intr + 160217 160337 121 1 1 70 44 101 0.989 3.05 6.09 Intr + 160440 160596 157 1 1 91 80 109 0.765 8.55 6.10 Intr + 163881 163942 62 0 2 84 63 19 0.229 -3.54 6.11 Intr + 164032 164081 50 2 2 72 113 22 0.376 0.58 6.12 Term + 176491 176667 177 0 0 53 47 107 0.386 -0.20 6.13 PlyA + 176744 176749 6 1.05 7.00 Prom + 183501 183540 40 -2.25 7.01 Init + 188509 188844 336 0 0 102 116 534 0.977 54.92 7.02 Term + 189097 189204 108 0 0 83 44 65 0.812 -0.87 7.03 PlyA + 190059 190064 6 1.05 8.03 PlyA - 190603 190598 6 1.05 8.02 Term - 230428 230239 190 0 1 61 38 151 0.941 3.34 8.01 Init - 232161 232112 50 0 2 106 81 32 0.934 4.77 8.00 Prom - 234438 234399 40 -5.65 9.07 PlyA - 236524 236519 6 1.05 9.06 Term - 238854 238756 99 0 0 82 49 72 0.633 -0.15 9.05 Intr - 239169 239046 124 2 1 106 66 81 0.568 7.37 9.04 Intr - 252422 252261 162 1 0 62 18 203 0.092 8.87 9.03 Intr - 269370 269184 187 1 1 90 55 109 0.304 5.63 9.02 Intr - 270048 269768 281 2 2 74 85 254 0.363 19.90 9.01 Init - 271311 270833 479 0 2 73 -2 216 0.098 6.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 25356 24982 375 0 0 68 50 158 0.908 5.89 S.002 Term + 65803 65978 176 2 2 31 36 155 0.832 1.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:70178227_70453912|GENSCAN_predicted_peptide_1|188_aa MTLNEHAALKHLFNKAHPAPPLIHLTLSGHSTCFREHRVGDKVTDQQDPKAEEFFLVQNK MKSLPCLLLSTETRQPSDFSIFSPPFPPFYSTKPPLSSWPIPNEPLGTPPRRGRGRAEGL LTSQDVDEAGDHYSEQTITRTENQTLQVLTHRPSANWMLLIHGGGWIFLTESTNSNANLF QKQLHRHT >gi568815586f:70178227_70453912|GENSCAN_predicted_CDS_1|567_bp atgactctcaacgagcatgctgccctcaagcatctgttcaacaaagcacatcctgcaccg cccttaatccatttaaccctgagtggacacagcacatgtttcagagagcacagggttggg gataaggtcacagatcaacaggatcccaaggcagaagaatttttcttagtacagaacaaa atgaaaagtctcccatgtctacttctatccacagagacccggcaaccatccgatttctca attttttccccacccttcccgcctttctattccacaaaaccgccattgtcatcatggccc atccccaatgagccgctgggcacacctcccagacggggtcgtggccgggcagaggggctc ctcacttcccaggacgtggatgaagctggagaccattattctgagcaaactatcacaagg acagaaaaccaaacactgcaagttctcactcataggccctcagccaattggatgttgctt atccatgggggagggtggatcttccttactgagtccactaattcaaatgccaatctcttc cagaaacagctgcacagacatacttag >gi568815586f:70178227_70453912|GENSCAN_predicted_peptide_2|161_aa MTNIRKLHNGKPLQAPSKKPHYRSSFWILSHRGYQNPVNEDCILTQSNSGSSPILAPSQL PWSQAPDCPLWPYAPEYLGSRPTPVEFDTSPALKDPSSRLDPINPRPKPARMDSASRPTT MDPGPISRPNPVESSTRLASVNPRSKPNHLDSGSTPAPLDP >gi568815586f:70178227_70453912|GENSCAN_predicted_CDS_2|486_bp atgaccaacatcaggaaactacataatgggaagcccctgcaggctccctcaaagaagcct cattatagatctagcttttggatactgtcacacagaggttatcaaaatccagttaatgag gactgcatccttacccagtcaaactcaggctctagtcccatcctagcaccaagccagctc ccatggtcccaggctccagactgccccctgtggccctatgctccagaatatctaggttcc aggcccaccccagtagaatttgatactagtccagccctcaaagacccaagctccagactg gaccccatcaacccaagaccaaagcctgcccgcatggactcagcctccaggcccactacc atggacccaggcccaatctctaggcccaacccagtggagtccagcactaggctggcctct gtgaacccaagatccaagcctaaccacctagattcgggttcaacgcctgccccactggac ccatga >gi568815586f:70178227_70453912|GENSCAN_predicted_peptide_3|336_aa MVSKHTSGFIRGELTTQEVTVTWCRSLKYAHKLRRLRELREETGTRSDSLQSPSDSGPEP SAKQPSPRETVAAAEAPLSVTAAEEQQASLRRSHLPLPPGGKPERRGSRRRSEGKGLAPS SLRGGYVAILDPRDKKIHAQQQLQILGPVPSASGGASPDSTPLGSRQRGGAAAAALLPGG EEKAAAAVAVADELEPALDGSAAFPAAGAVCGFLPSLNTLMCISVSVFASVRWETLDWLT GVRILQALGYRRLCMVEGEGMQLRCRLCHRTRKVLVITVISLIDKVLLLKELQGYRQKTN ENKIQTVGCSVKAFLKRDSQADMLTSRSSLAFGDLG >gi568815586f:70178227_70453912|GENSCAN_predicted_CDS_3|1011_bp atggtgagtaaacacaccagtggtttcatcagaggggaactcactactcaggaggtgacg gtgacgtggtgccggtccctgaagtacgcgcacaagctccggaggttgcgggagctgagg gaggaaacagggacacgatcagattcgctccagtccccctcggactcagggccagagcct tctgcgaagcaacctagccccagggaaacggtagcggccgcagaagccccgctctctgta acggccgcggaggagcagcaagcctcccttcgtcgctcccaccttccgctgccgcctgga gggaagccggagcgacgggggtcacggcggcggtcagagggtaaaggtcttgctcccagc agcctccgcggtggatacgtcgccatcttggatccgcgggacaagaaaattcatgcgcag cagcagctccagatcctaggcccggtcccatccgcgagtggtggggcttccccagactcc actcccctaggctcccgccagcgcggaggagccgctgccgccgcgctgcttcctggggga gaagagaaggcggcagcggccgtggccgtggccgacgagctcgagcccgctttagacggc tctgccgccttcccggcagctggtgctgtctgtggcttcttaccttctctcaatacactt atgtgtatttctgttagtgtgtttgcaagtgtcagatgggagacactggactggctaacg ggtgtcaggatcctgcaggccctgggataccgaaggctgtgcatggtggagggagaagga atgcaactgcgatgccgcctctgccaccgcaccaggaaggtacttgtgataacagtgata agcttaatagacaaggttcttctgcttaaggaacttcagggatacagacagaaaacaaat gagaacaaaattcagactgttgggtgttcagtgaaggcctttctgaaaagagatagtcaa gcagacatgttaacgtcaagaagcagtttggcatttggagatttggggtaa >gi568815586f:70178227_70453912|GENSCAN_predicted_peptide_4|361_aa MIISIDAEKAFDKIQQLFMLKTLNKLGIDGTYLKIIRAIYDKPTASITVNVQKLEAFPLK TGTREGCPLSPLLFNTVLEVLARAIRQEKEIKGIQLGKEEVKLTLFAGDMTVYLENSSIS AQNLLKLIINFSKVSGYKINVQKSQAFLYTNNRQTESQIMKELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLNEIKEDTNKWKNIPCSYVGRINIVKMAILPKVIYRFNAIPIKLPT TFFTELEKTTLKFIWHQKRALIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWNALPLD IQMASYFTYFRCEVISSRVLSSQYRRQSNRTVTQGSCHRKDRPPRLPLEWNYPKAIKITN M >gi568815586f:70178227_70453912|GENSCAN_predicted_CDS_4|1086_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacagctcttcatgcta aaaactctcaataaattaggtattgatgggacatatctcaaaataataagagctatctat gacaaacccacagccagtatcacagtgaatgtacaaaaactggaagcattccctttgaaa actggcacaagagaaggatgccctctctcaccactcctattcaacacagtgttggaagtt ctggccagggcaatcaggcaggagaaggaaataaagggcattcaattaggaaaagaggaa gtcaaattgaccctgtttgcaggtgacatgactgtatatctagaaaactccagcatctca gcccaaaatctccttaagctgataatcaacttcagcaaagtctcgggatacaagatcaat gtgcaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatg aaagaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagag gatacaaacaaatggaagaacattccatgctcatacgtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaacg actttcttcacagaattggaaaaaactactttaaagttcatatggcaccaaaaaagggcc ctcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggaatgctcttcctctagac atccagatggcttcctacttcacctacttcaggtgtgaagtcatctcaagtcgtgtcctt tcctcacagtaccggaggcaatcaaatagaactgtcactcaagggtcgtgtcacaggaag gaccgcccaccacgtctccctctagagtggaattatccaaaagccatcaaaattactaac atgtaa >gi568815586f:70178227_70453912|GENSCAN_predicted_peptide_5|359_aa MTGSNSHITILTLNVNGLYAPIKRHRLANWIKSQDPSVCCIQETHLMCRDTHRLKIKGWR KIYQANGKQKKAGVAILVSDKTDFKPTKIKRDKEGHYTMVKGSIQQEELTILNIYTPNTG APKFIKQVLSDLQRDLDSHTIIMGDFNTPLSTLDRSTRQKVNKDIQELNSALHQVDLIDI YRTLHPKSTEYTFFSAPHHTYSKVDHIVGSKALLSKCKRSEIITNCLSDHSAIKLELGIK KLTQNLSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFI ALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKN >gi568815586f:70178227_70453912|GENSCAN_predicted_CDS_5|1080_bp atgacaggatcaaattcacacataacaatactaaccttaaatgtgaacgggctatatgct ccaattaaaaggcacagactggcaaattggataaagagtcaagacccatcagtgtgctgt attcaggaaacccatctcatgtgcagagacacacataggctcaaaataaagggatggagg aagatctaccaagcaaatggaaaacaaaaaaaggcaggggttgcaatcctagtctctgat aaaacagactttaaaccaacaaagatcaaaagagacaaagaaggccattacacaatggta aagggatcaattcaacaagaagaactaactatcctaaatatatatacacccaatacagga gcacccaaattcataaagcaagtccttagtgacctacaaagagacttagactcccacaca ataataatgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaa gttaacaaggatatccaggaattgaactcagctctgcaccaagtggacctaatagacatc tacagaactctccaccccaaatcaacagaatatacattcttttcagcaccacaccacacc tattccaaagttgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagatca gaaattataacaaactgtctctcagaccacagtgcaatcaaactagaactcgggattaag aaactcactcaaaacctctcaactacatggaaactgaacaacctgctcctgaatgactac tgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaa gacacaacataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttata gcactaaatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaa ttaaaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaata actaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaaattaa >gi568815586f:70178227_70453912|GENSCAN_predicted_peptide_6|468_aa MFGASRKKFVEGVDSDYHDENMYYSQSSMFPHRSEKDMLASPSTSGQLSQFGASLYGQQS ALGLPMRGMSNNTPQLNRSLSQGTQLPSHVTPTTGVPTMSLHTPPSPSRGILPMNPRNMM NHSQVGQGIGIPSRTNSMSSSGLGSPNRSSPSIICMPKQQPSRQPFTVNSMSGFGMNRNQ AFGMNNSLSSNIFNGTDGSENVTGLDLSDFPALADRNRREGSGNPTPLINPLAGRAPYVG MVTKPANEQSQDFSIHNEDFPALPGSSYKDPTSSNDDSKSNLNTSGKTTSSTDGPKFPGD KSSTTQNNNQQKKGIQVLPDGRVTNIPQGMVTDQFGMIGLLTFIRAAETDPGMVHLALGS DLTTLGLNLNSPENLYPKFASPWASSPCRPQDIDFHVPSEYLTNIHIRDKEVCMDAIYEN GTFWNCWSGDFVALCTKRARFSAAKDIHTQVNVMGLKRSELRQFTLAV >gi568815586f:70178227_70453912|GENSCAN_predicted_CDS_6|1407_bp atgtttggtgcttcaagaaagaagtttgtagagggggtcgacagtgactaccatgacgaa aacatgtactacagccagtcttctatgtttccacatcggtcagaaaaagatatgctggca tcaccatctacatcaggtcagctgtctcagtttggggcaagtttatacgggcaacaaagt gcactaggccttccaatgagggggatgagcaacaatacccctcagttaaatcgcagctta tcacaaggcactcagttaccgagccacgtcacgccaacaacaggggtaccaacaatgtca cttcacacgcctccatctccaagcaggggtattttgcctatgaatcctaggaatatgatg aaccactcccaggttggtcagggcattggaattcctagcaggacaaatagcatgagcagt tcagggttaggtagccccaacagaagctcgccaagcataatatgtatgccaaagcagcag ccttctcgacagccttttactgtgaacagtatgtctggatttggaatgaacaggaatcag gcatttggaatgaataactccttatcaagtaacatttttaatggaacagacggaagtgaa aatgtgacaggattggacctttcagatttcccagcattagcagaccgaaacaggagggaa ggaagtggtaacccaactccattaataaaccccttggctggaagagctccttatgttgga atggtaacaaaaccagcaaatgaacaatcccaggacttctcaatacacaatgaagatttt ccagcattaccaggctccagctataaagatccaacatcaagtaatgatgacagtaaatct aatttgaatacatctggcaagacaacttcaagtacagatggacccaaattccctggagat aaaagttcaacaacacaaaataataaccagcagaaaaaagggatccaggtgttacctgat ggtcgggttactaacattcctcaagggatggtgacggaccaatttggaatgattggcctg ttaacatttatcagggcagcagagacagacccaggaatggtacatcttgcattaggaagt gacttaacaacattaggcctcaatctgaactctcctgaaaatctctaccccaaatttgcg tcaccctgggcatcttcaccttgtcgacctcaagacatagacttccatgttccatctgag tacttaacgaacattcacattagggataaggaggtgtgcatggatgcaatatatgaaaat gggacattctggaactgctggtcaggggactttgtcgccctgtgcactaaaagggccaga ttttcagcagccaaggacatccatacccaagtgaatgtgatgggacttaaaagaagtgaa ctgagacaattcactctggctgtttga >gi568815586f:70178227_70453912|GENSCAN_predicted_peptide_7|147_aa MAKLRVAYEYTEAEDKSIRLGLFLIISGVVSLFIFGFCWLSPALQDLQATEANCTVLSVQ QIGEVFECTFTCGADCRGTSQYPCVQVYVNNSESNSRALLHSDEHQLLTNPKILRPRPSP GPAPTGLGLEKCKEPRKLTFPPFKASG >gi568815586f:70178227_70453912|GENSCAN_predicted_CDS_7|444_bp atggcgaagctccgggtggcttacgagtacacggaagccgaggacaagagcatccggctc ggcttgtttctcatcatctccggcgtcgtgtcgctcttcatcttcggcttctgctggctg agtcccgcgctgcaggatctgcaagccacggaggccaattgcacggtgctgtcggtgcag cagatcggcgaggtgttcgagtgcaccttcacctgtggcgccgactgcaggggcacctcg cagtacccctgcgtccaggtctacgtgaacaactctgagtccaactctagggcgctgctg cacagcgacgagcaccagctcctgaccaaccccaagattttgaggcctcgacccagtccg ggtcctgcgcctaccggactgggtttggagaagtgcaaggagccaagaaagttaactttt cctccatttaaagccagtggttag >gi568815586f:70178227_70453912|GENSCAN_predicted_peptide_8|79_aa MAVWDCRRGSNELYLKSDASAESFLGVERDGRTETDCPRQAQLLRGCFGRCRLNTGIQLA STGLNQWKAKTPLIFVLDF >gi568815586f:70178227_70453912|GENSCAN_predicted_CDS_8|240_bp atggcagtgtgggactgcagaagaggaagtaatgaattatacctgaaaagtgacgcctct gcagaatccttcctgggtgtagagagggatggccgcaccgagactgactgcccacggcag gcacagctgcttcggggctgcttcggccgctgcagactcaacactggaatacagcttgcc tcaacaggacttaaccagtggaaagctaagactcctctaatctttgtcttagatttttaa >gi568815586f:70178227_70453912|GENSCAN_predicted_peptide_9|443_aa MSQELTKEQKVFYKMVQQLLKAIQCTVESEALHKLMLLIWQECPWLHDQGTLDLKLWEQV RCCLKRGLEQGHFANVTVLTTWSWLHSALYPFDMPDCDESHHPFSSPEGNSEDLKGKGTQ EPEQQRGAPRPTSLPSVGHKEDTLVKVLVSAAEYTVWGSDSEGLDLAVITVMVLKEQDAV QLISTGICDRLPRGMFGLIIGRSSNTLKGIQILPGVIDSDYLGEIKLMAQVSGVHTISKG TRLAQTILIPYLQGATAPLKIIKIKRKTTSPVWVEQWPIKKEKLEHIQRLVQEQHAGHIE PPASPWNTPIFTIPKRELALLHGFRLDRQALGTDGQDHNENAHHKCHQGPEEAMQEDNLI MSAMQKHIIWCFVLLEEPPMLLLLYMSQFQTQHARNCCQRLVKPILNGAKQLQHQYCTII LAALEKGFEKYILGEGKVSCMSF >gi568815586f:70178227_70453912|GENSCAN_predicted_CDS_9|1332_bp atgagtcaggagttgaccaaagagcagaaagttttttataaaatggtgcaacaattactt aaggctatccagtgcactgtagagtccgaagctttgcataagcttatgcttttaatttgg caggaatgcccttggttacatgatcaaggaacattagatttaaagttatgggagcaggtg cgttgctgcctgaaaagaggattggagcagggccattttgctaatgttaccgttttgacc acctggagttggttacactctgcgttatacccttttgatatgccagactgtgatgagtca catcacccattttcttcacctgaagggaattcagaagatttaaagggaaagggtacacag gaaccagaacaacaaagaggggctcctcgccccacttcactcccttcagtagggcataaa gaagacactctggtaaaagttttggtttcagctgccgaatatactgtgtgggggagcgac agcgaagggctggacctcgcagtcataacagtcatggtgcttaaagagcaagatgcggtc cagttgatttcaactggaatctgtgaccggttacccagaggaatgtttggattgattatt ggaaggtcttctaatacacttaaagggattcaaattctccctggagtgatagattcagac tatttaggagaaataaaacttatggcacaggtgtcaggggtccacaccatctctaaagga actcgccttgctcagactattcttattccttaccttcaaggggccactgctcctttgaaa attatcaaaattaagaggaaaactactagcccagtatgggtggagcagtggcccattaag aaggaaaaattggaacatattcaacgtctagtacaagaacaacatgctggccacattgag cctcctgctagtccctggaacactcctatttttactattccaaagagagaacttgcgctt cttcatggcttccgccttgaccgccaagctcttggcacagatggtcaggaccacaatgag aacgcccaccacaaatgtcaccaggggccagaggaagcaatgcaggaggacaatctcatc atgagtgcgatgcagaagcacatcatctggtgctttgtcctccttgaggagcctccaatg ctgctgctcctatacatgtcacaatttcagacccagcatgctaggaactgctgccagcgc ctggttaagccaatactaaatggggccaaacagctccagcatcagtattgtaccattatt ttggcagctctggagaaaggatttgaaaaatatatcctgggtgaagggaaagtcagctgc atgtctttctga