GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:52:59 Sequence gi568815584f:28667280_28868746 : 201467 bp : 37.05% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 310 305 6 1.05 1.02 Term - 6469 6348 122 2 2 84 42 150 0.490 7.66 1.01 Init - 10184 10094 91 2 1 59 17 81 0.322 -1.20 1.00 Prom - 10362 10323 40 -6.25 2.00 Prom + 11214 11253 40 -1.85 2.01 Sngl + 16653 17105 453 2 0 87 54 232 0.915 15.84 2.02 PlyA + 21700 21705 6 1.05 3.03 PlyA - 22026 22021 6 1.05 3.02 Term - 25379 25191 189 2 0 11 47 135 0.673 -1.83 3.01 Init - 28676 28593 84 2 0 85 90 87 0.863 9.47 3.00 Prom - 81645 81606 40 -1.85 4.02 PlyA - 82051 82046 6 1.05 4.01 Sngl - 94586 94335 252 2 0 44 42 282 0.749 14.04 4.00 Prom - 96055 96016 40 -6.15 5.00 Prom + 96275 96314 40 -7.35 5.01 Init + 98037 98209 173 2 2 53 -5 157 0.763 1.87 5.02 Intr + 98613 98734 122 0 2 23 91 142 0.547 7.22 5.03 Intr + 99497 99680 184 0 1 17 23 92 0.452 -6.48 5.04 Term + 99838 101470 1633 1 1 11 38 1341 0.797 109.65 5.05 PlyA + 102365 102370 6 1.05 6.04 PlyA - 102417 102412 6 1.05 6.03 Term - 106705 106572 134 2 2 61 44 105 0.650 0.67 6.02 Intr - 106806 106717 90 1 0 39 50 165 0.404 6.85 6.01 Init - 107294 106931 364 2 1 53 -7 332 0.574 15.47 6.00 Prom - 109594 109555 40 -6.75 7.00 Prom + 110121 110160 40 -8.45 7.01 Init + 110598 110712 115 2 1 61 86 74 0.495 4.92 7.02 Intr + 111000 111123 124 1 1 30 7 171 0.177 1.92 7.03 Term + 118069 118432 364 1 1 58 54 226 0.608 9.25 7.04 PlyA + 120567 120572 6 1.05 8.00 Prom + 128964 129003 40 -3.05 8.01 Init + 161892 162005 114 2 0 35 110 109 0.231 8.06 8.02 Term + 179047 179346 300 0 0 76 34 177 0.044 5.34 8.03 PlyA + 179366 179371 6 1.05 9.00 Prom + 180395 180434 40 -6.15 9.01 Sngl + 180556 181050 495 0 0 44 41 241 0.492 10.80 9.02 PlyA + 181204 181209 6 1.05 10.00 Prom + 181271 181310 40 -8.05 10.01 Init + 182315 182738 424 1 1 60 65 208 0.227 12.25 10.02 Term + 182988 183382 395 1 2 -5 50 231 0.204 4.21 10.03 PlyA + 183429 183434 6 -0.45 11.00 Prom + 183612 183651 40 -2.75 11.01 Init + 190183 190341 159 0 0 58 54 106 0.357 4.09 11.02 Term + 190806 190964 159 2 0 -4 42 211 0.731 4.26 11.03 PlyA + 192473 192478 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 175700 175808 109 1 1 92 66 50 0.883 3.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:28667280_28868746|GENSCAN_predicted_peptide_1|70_aa MGPDSYNSKEVNFVYTKNDHEMRFFPDPQTNPYWLREDERRNYEPRKTRQTAGISAYWWE RNCSQKEEGS >gi568815584f:28667280_28868746|GENSCAN_predicted_CDS_1|213_bp atgggtcctgactcctacaacagcaaggaggtgaattttgtctacacaaagaatgaccat gaaatgagatttttcccagatcctcagacaaacccttactggctccgagaagatgaaaga cgcaattatgaaccaagaaaaactcgccaaactgcaggcataagtgcgtattggtgggaa aggaactgctcgcagaaagaagaaggttcatag >gi568815584f:28667280_28868746|GENSCAN_predicted_peptide_2|150_aa MGKYLQAMSETFPAAPHITGLQAQEEKVVLCARPRVPVQPRDLVPCVPDAPAMAERGQCR AQTVASEGGSPKPWQLPHGVECVGAQKSRIGFGNLHLDFRRCTEMPGCPGKSLLQGQGPH GEPLLEQCGKEMWSQSPHTESLLGHILLEL >gi568815584f:28667280_28868746|GENSCAN_predicted_CDS_2|453_bp atggggaaatatctccaggccatgtcagagaccttcccagcagcccctcacatcacaggc ctgcaggcccaggaggaaaaagtggttttgtgcgccaggccaagggtccccgtgcagcct agggacttggtgccctgtgtcccagatgctccagccatggcagaaaggggacaatgtaga gctcagaccgtggcttcagagggtggaagccccaagccttggcagcttccacatggtgtt gagtgtgtgggtgcacagaagtcaagaattgggtttggaaatctccacctagatttcaga aggtgtacagaaatgcctggatgcccaggcaaaagtttgctgcaggggcagggccctcat ggagaacctctgctagagcagtgtggaaaggaaatgtggagtcagagcccgcacacagag tccctactggggcatatcctactggagctgtga >gi568815584f:28667280_28868746|GENSCAN_predicted_peptide_3|90_aa MAAHEMLLVQYLVDVIKPHYLGKSDSEKPGQQEQNSISKQQQQQQTKQQQQQTKQQQQQK TLNDITQIVRMMEMVMVMMRMMLMIIANTY >gi568815584f:28667280_28868746|GENSCAN_predicted_CDS_3|273_bp atggcggcacatgagatgcttcttgtccagtaccttgtggatgtgataaaacctcattat ctggggaaatctgactctgaaaagcctggtcaacaagagcaaaactccatctccaaacaa caacaacagcaacaaaccaaacaacaacaacaacaaaccaaacaacaacagcaacagaaa actcttaatgatataacacaaatagtgaggatgatggagatggtgatggtgatgatgagg atgatgttaatgatcatagctaacacttactaa >gi568815584f:28667280_28868746|GENSCAN_predicted_peptide_4|83_aa MFSLCKLGSHFKREKVQGEEEEQEEEEDKDEEKALTETPGTLQASLCDAARCSRLSEAKY VRSQKKSPVIFSKMCMGEALSRH >gi568815584f:28667280_28868746|GENSCAN_predicted_CDS_4|252_bp atgtttagcttgtgtaagctagggagccactttaaaagggagaaagtccaaggcgaagaa gaggagcaggaggaggaagaggacaaggatgaggagaaggccttgacagagacccctggg actctccaggcaagtctctgtgacgctgccaggtgttccaggctctctgaagccaagtac gtacgcagccaaaagaaaagtcccgtgatcttcagcaaaatgtgcatgggagaagctttg tcccgacattag >gi568815584f:28667280_28868746|GENSCAN_predicted_peptide_5|703_aa MLGLSPAPGHGILLKPPIRNFPIHFNLQPNLPSSKAQEAGTVRSTFPRLPGAFQQHSGPS EAPTTERPRPPQQHPEPQSRFPRGAGARSSPPPRPQESTRKINCGYSRLDRCLPSLAAAA PGRARPPPGSPPLWAAAAAAVTAAARGGGGGGSSGGGSGGARACGPTALLPERSVPPLPP APPAPRSAPTGRRRPPRRCPAPAPPPPFPPDDWVMLDMGDRKEVKMIPKSSFSINSLVPE AVQNDNHHASHGHHNSHHPQHHHHHHHHHHHPPPPAPQPPPPPQQQQPPPPPPPAPQPPQ TRGAPAADDDKGPQQLLLPPPPPPPPAAALDGAKADGLGGKGEPGGGPGELAPVGPDEKE KGAGAGGEEKKGAGEGGKDGEGGKEGEKKNGKYEKPPFSYNALIMMAIRQSPEKRLTLNG IYEFIMKNFPYYRENKQGWQNSIRHNLSLNKCFVKVPRHYDDPGKGNYWMLDPSSDDVFI GGTTGKLRRRSTTSRAKLAFKRGARLTSTGLTFMDRAGSLYWPMSPFLSLHHPRASSTLS YNGTTSAYPSHPMPYSSVLTQNSLGNNHSFSTANGLSVDRLVNGEIPYATHHLTAAALAA SVPCGLSVPCSGTYSLNPCSVNLLAGQTSYFFPHVPHPSMTSQSSTSMSARAASSSTSPQ APSTLPCESLRPSLPSFTTGLSGGLSDYFTHQNQGSSSNPLIH >gi568815584f:28667280_28868746|GENSCAN_predicted_CDS_5|2112_bp atgctgggccttagccctgcccctggccacgggatccttttaaagcccccgattcgcaat ttccccattcacttcaacctccaaccgaaccttcccagttccaaagcccaagaagctggg actgtgagatccacgttcccaaggctccccggtgctttccagcagcactcagggccatca gaggcgcccactactgagcggccccggccgccgcagcagcacccggagccccagtcccgg tttccccgcggtgccggagcccggagctcgccgccgcccaggcctcaggaatcgactcgg aaaattaattgtggctatagccgcctcgatcgctgtctccccagcctcgccgcggccgct ccgggacgcgcccgcccgccgcccggctctccccccctttgggctgctgctgctgctgct gtgactgctgctgcgagaggaggaggaggaggaggaagcagcgggggggggagcgggggc gcccgagcctgcggtccaactgcgctgctgccggagcgctcagtgccgccgctgccgccc gcgccccccgcgccccgttcggcacccaccggtcgccgccgcccgccgcgccgctgtccc gctcccgcgccgccgccgccgtttccccccgacgactgggtgatgctggacatgggagat aggaaagaggtgaaaatgatccccaagtcctcgttcagcatcaacagcctggtgcccgag gcggtccagaacgacaaccaccacgcgagccacggccaccacaacagccaccacccccag caccaccaccaccaccaccaccatcaccaccacccgccgccgcccgccccgcaaccgccg ccgccgccgcagcagcagcagccgccgccgccgccgcccccggcaccgcagcccccccag acgcggggcgccccggccgccgacgacgacaagggcccccagcagctgctgctcccgccg ccgccaccgccaccaccggccgccgccctggacggggctaaagcggacgggctgggcggc aagggcgagccgggcggcgggccgggggagctggcgcccgtcgggccggacgagaaggag aagggcgccggcgccgggggggaggagaagaagggggcgggcgagggcggcaaggacggg gaggggggcaaggagggcgagaagaagaacggcaagtacgagaagccgccgttcagctac aacgcgctcatcatgatggccatccggcagagccccgagaagcggctcacgctcaacggc atctacgagttcatcatgaagaacttcccttactaccgcgagaacaagcagggctggcag aactccatccgccacaatctgtccctcaacaagtgcttcgtgaaggtgccgcgccactac gacgacccgggcaagggcaactactggatgctggacccgtcgagcgacgacgtgttcatc ggcggcaccacgggcaagctgcggcgccgctccaccacctcgcgggccaagctggccttc aagcgcggtgcgcgcctcacctccaccggcctcaccttcatggaccgcgccggctccctc tactggcccatgtcgcccttcctgtccctgcaccacccccgcgccagcagcactttgagt tacaacggcaccacgtcggcctaccccagccaccccatgccctacagctccgtgttgact cagaactcgctgggcaacaaccactccttctccaccgccaacggcctgagcgtggaccgg ctggtcaacggggagatcccgtacgccacgcaccacctcacggccgccgcgctagccgcc tcggtgccctgcggcctgtcggtgccctgctctgggacctactccctcaacccctgctcc gtcaacctgctcgcgggccagaccagttactttttcccccacgtcccgcacccgtcaatg acttcgcagagcagcacgtccatgagcgccagggccgcgtcctcctccacgtcgccgcag gccccctcgaccctgccctgtgagtctttaagaccctctttgccaagttttacgacggga ctgtctgggggactgtctgattatttcacacatcaaaatcaggggtcttcttccaaccct ttaatacattaa >gi568815584f:28667280_28868746|GENSCAN_predicted_peptide_6|195_aa MLLLPAQRPASPSAGIAAAMSRERFSSPPPPPRSQGGPNSAAEVTSPHRFKTRENNAARS LPQRLQVPPLCHLSIFTPESLRRREKLINRTPRRESQIRFGPSQPVSGACSQPLWDREAL ELFRRDRREDSHTHPVPHPTRSSPAKSDRDPGQGAGGGGPALGKVGYVSVQIEKGDPKKN FREKQVLGSGAKSFN >gi568815584f:28667280_28868746|GENSCAN_predicted_CDS_6|588_bp atgttgctgctccctgcgcagcgcccggcttcaccctctgcggggatcgccgccgcaatg tcccgagagcgcttctcctctcctcctccgccaccgcgcagccagggcggccccaactca gcagctgaagttacttctccacaccggttcaaaacccgagaaaacaacgctgccaggtcc ctgccacaacgacttcaagtcccgccattatgccacctctccatcttcaccccggagtcc ctgaggcgccgggaaaagctaattaataggacaccgcgacgagagtcccaaattcgcttc gggccgagccaacctgtgtctggcgcgtgctctcagccactctgggacagagaggctctc gagctctttcgaagagacagaagagaagactcccacacgcacccagtgcctcacccgacc cggagcagcccggctaaatccgaccgggaccctggccaaggagctgggggcggagggcca gctctggggaaggttggctatgttagtgtgcaaatagagaagggcgatccaaagaagaac tttagagagaagcaggttcttggctctggagcaaagagctttaactga >gi568815584f:28667280_28868746|GENSCAN_predicted_peptide_7|200_aa MEKELIFSPWRVFLKLLILSVSAVRGGAAQSRGWLQTEGVVDSVDLNCCPSFQAFPVNEP ENTRQVVNNRFNECAKRATGHTFGPPPELRCPRVTAFCDRVRRDPVPLSPSVFREGEALR ISGAVQQPRPHPRGSRPRGPFTSPSGLGDANSPKKTLAEERGPFTTNLTSGLQPHLGTSS EKLRNHCFAKSLLYCDGALW >gi568815584f:28667280_28868746|GENSCAN_predicted_CDS_7|603_bp atggaaaaggaattaatcttctcaccctggcgagttttcctgaaactacttatccttagc gtcagcgctgtaagaggtggagccgctcagtcccgcgggtggctgcagacagaaggggta gtggacagtgttgacttgaattgctgtccctcgttccaagcctttcctgtgaatgaaccc gaaaacactcgacaggtcgtgaataatcgttttaatgagtgtgcaaagcgtgcgacggga cacactttcggtcccccgccagagctccggtgcccccgagtgaccgctttctgcgatcgc gtccgccgggaccccgtccctctttccccttcagtcttcagggagggggaggcgctccgc attagcggggcagttcagcaaccccgaccccacccgcgtggctccaggcccaggggtccg ttcacttccccgtccggtttgggggacgccaattcgcctaagaaaaccctggcagaagag cgcggacccttcactacaaacctcacgtcagggttacagccacatttaggaacctcttcg gaaaagctgagaaatcactgttttgcaaaaagccttctgtactgtgatggggctttgtgg tga >gi568815584f:28667280_28868746|GENSCAN_predicted_peptide_8|137_aa MLRILVKEEAEVSELSDCLETEKHAETSYSDGAINPGQDPSSLPATEQSWTENDFDELTE VDFRRLVIRNFSELKERVLTHHKEAKNLEKRLDEWLTRINSVEKTLNDLMELTWQENFMT HTQASIADLIKWKKGYQ >gi568815584f:28667280_28868746|GENSCAN_predicted_CDS_8|414_bp atgttaaggatactggttaaagaggaagcagaagtgtcagagttgagtgactgcttagaa actgagaagcatgcagaaaccagttattcagatggagcaattaatccagggcaggatccc agctccttaccagcaacagaacaaagctggacggagaatgactttgacgagttgacagaa gtagacttcagaaggttggtaataagaaacttctctgagctaaaggagcgtgttctaacc catcacaaggaagctaaaaaccttgaaaaaaggttagacgaatggctaactagaataaac agtgtagagaagaccttaaatgacctgatggagctgacatggcaggagaacttcatgacg catacacaggcttcaatagccgatttgatcaagtggaagaaaggatatcagtga >gi568815584f:28667280_28868746|GENSCAN_predicted_peptide_9|164_aa MQDLNSSWHQVDLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKYKRSEITT NSLLDHSAIKLESRIKKLTQNCTTTWKLNNLLLNDYWVNNEMKAEIKVFFETNENKDTMY QNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQE >gi568815584f:28667280_28868746|GENSCAN_predicted_CDS_9|495_bp atgcaggacttgaactcatcctggcaccaagtggacctaatagacatctacagaactctc caccccaaatcaacagaatatactttcttctcagcaccacatcacacttattctaaaatt gaccacatagttggaagtaaagcactcctcagcaaatataaaagatcagaaatcacaaca aactctctcctggaccacagtgcaatcaagttagaatccaggattaagaaactcactcaa aactgcacaactacatggaaactgaacaacctgctcctgaatgattactgggtaaataat gaaatgaaggcagaaataaaggtgttctttgaaaccaatgagaacaaagacacaatgtac cagaatctctgggacacatttaaagcagtgtgtagagggaaatttatagcactaaatgcc cacaagagaaagcaggaaagatctaaaatcgacaccctaacatcacaattaaaagaacta gagaagcaagagtaa >gi568815584f:28667280_28868746|GENSCAN_predicted_peptide_10|272_aa MSELPFTIATKKIKYLGIQLTTDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGGINIM KMAILPKVIYRFNALPIKIPMTFFTELEKTTLNFIWNQKRARIAKTILSTKNKAGGITLA DFKLYYKPTLVKTAWYWYQNREENLGNTIQDIGMGKNFMTKTPKAMATKAKIDKRDLIKP KSFCIAKETTIRMNRQPTEWEKLFAIYPSDKALISGIYKDLKQIYKKKNNPIKKWAKEMN RHFSKEDIYAANRHVKNAHHHWSSEKCKSKPQ >gi568815584f:28667280_28868746|GENSCAN_predicted_CDS_10|819_bp atgagtgaactcccattcacaattgctacaaagaaaataaaatacctaggaatacaactt acaacggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaggaatcaatatcatg aaaatggccattctgcccaaggtaatttatagattcaatgctctccccatcaagatacca atgactttcttcacagaattggaaaaaactactttaaatttcatatggaaccaaaaacga gcccgcatagccaaaacaatcctaagcacaaagaacaaagctggaggcatcacactagct gacttcaaactgtactacaagcctacactagtcaaaacagcatggtactggtaccaaaac agagaagaaaacctaggcaataccattcaggacataggcatgggcaaaaacttcatgact aaaacaccaaaagcaatggcaacaaaagccaaaatagacaaaagggatctaattaaacca aagagcttctgcatagcaaaagaaactaccatccgtatgaacaggcagcctacagaatgg gagaaactttttgcaatctacccgtctgacaaagcgctaatatccggaatctacaaagat cttaagcaaatttacaagaaaaaaaacaaccccatcaaaaagtgggcaaaggagatgaac agacacttctcgaaagaagacatttatgcagccaacagacacgtgaagaatgctcatcat cactggtcatcagagaaatgcaaatcaaaaccacaatga >gi568815584f:28667280_28868746|GENSCAN_predicted_peptide_11|105_aa MAILIIISMAMSQRRYSELAQPCGGEGMKSLCHWDPFSAGWSEDTDAFTETRVVIQQVKI WYRLQNQHPSDDELTGLLVVGPNPLRQQKIGAYYVKLVFLNHNVL >gi568815584f:28667280_28868746|GENSCAN_predicted_CDS_11|318_bp atggcaatcttgatcattatctcaatggccatgtcacagaggcgatattcagaacttgcc caaccctgtggtggtgaagggatgaagtcactgtgtcactgggaccctttcagtgctggc tggagtgaagacacagatgcctttactgaaactagggtcgtaatccagcaggttaaaatc tggtacaggctccagaaccagcatccaagtgatgatgaactgacaggcctactggttgtt ggaccaaatcctctccggcaacagaagattggagcctactatgtcaaacttgtttttctc aatcacaacgtgctttaa