GENSCAN 1.0 Date run: 16-Jul-119 Time: 15:58:28 Sequence gi568815583r:51838159_52071745 : 233587 bp : 42.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1337 1332 6 1.05 1.03 Term - 10925 10818 108 2 0 32 38 148 0.198 1.73 1.02 Intr - 22408 22142 267 1 0 10 58 209 0.385 6.91 1.01 Init - 23134 22799 336 1 0 59 42 365 0.576 26.32 1.00 Prom - 23711 23672 40 -12.52 2.00 Prom + 24114 24153 40 -5.05 2.01 Init + 24727 24852 126 0 0 78 92 180 0.962 17.61 2.02 Intr + 31059 31215 157 2 1 65 121 126 0.835 12.26 2.03 Intr + 49431 49553 123 1 0 53 69 113 0.352 5.54 2.04 Intr + 55657 55787 131 2 2 70 115 113 0.994 11.69 2.05 Intr + 58261 58368 108 0 0 73 93 59 0.747 4.46 2.06 Intr + 61997 62140 144 1 0 97 49 158 0.989 12.36 2.07 Intr + 63734 63878 145 1 1 54 82 133 0.963 8.23 2.08 Term + 69538 69608 71 2 2 101 54 66 0.732 1.62 2.09 PlyA + 70259 70264 6 1.05 3.09 PlyA - 70607 70602 6 1.05 3.08 Term - 80985 80924 62 1 2 71 38 82 0.516 -1.51 3.07 Intr - 84750 84567 184 0 1 55 87 192 0.758 14.24 3.06 Intr - 86020 85882 139 0 1 59 95 2 0.677 -2.45 3.05 Intr - 87721 87587 135 0 0 77 61 63 0.662 1.26 3.04 Intr - 88714 88620 95 1 2 70 105 63 0.931 4.04 3.03 Intr - 90135 90051 85 2 1 33 76 123 0.953 4.50 3.02 Intr - 91320 91175 146 0 2 79 71 108 0.991 6.36 3.01 Init - 92451 92371 81 0 0 101 87 115 0.922 13.72 3.00 Prom - 93597 93558 40 -6.45 4.13 PlyA - 94394 94389 6 1.05 4.12 Term - 100102 99998 105 1 0 21 45 235 0.998 9.93 4.11 Intr - 109231 109134 98 2 2 79 94 118 0.965 10.31 4.10 Intr - 111836 111650 187 2 1 68 105 304 0.824 28.44 4.09 Intr - 113821 113686 136 0 1 90 95 83 0.966 8.85 4.08 Intr - 115105 114971 135 0 0 81 93 55 0.942 4.06 4.07 Intr - 116417 116323 95 2 2 68 105 102 0.997 7.74 4.06 Intr - 120668 120584 85 1 1 17 76 149 0.618 5.50 4.05 Intr - 121886 121741 146 2 2 72 110 147 0.991 13.46 4.04 Intr - 122575 122481 95 2 2 52 81 129 0.998 7.36 4.03 Intr - 124335 124231 105 1 0 62 63 165 0.998 10.67 4.02 Intr - 128346 127591 756 1 0 51 90 1340 0.995 121.01 4.01 Init - 133587 133239 349 0 1 92 57 332 0.712 27.89 4.00 Prom - 153730 153691 40 -1.95 5.05 PlyA - 153800 153795 6 1.05 5.04 Term - 168592 168454 139 0 1 68 47 98 0.732 0.15 5.03 Intr - 170564 170426 139 0 1 62 99 117 0.378 8.80 5.02 Intr - 170969 170938 32 1 2 59 84 46 0.048 -1.84 5.01 Init - 181335 181043 293 0 2 83 -8 327 0.292 17.27 5.00 Prom - 206558 206519 40 -1.15 6.00 Prom + 207368 207407 40 -6.65 6.01 Init + 208303 208857 555 0 0 68 95 468 0.978 40.51 6.02 Intr + 211835 211979 145 1 1 88 92 99 0.966 9.23 6.03 Intr + 220475 220639 165 0 0 87 67 132 0.992 10.01 6.04 Intr + 223141 223342 202 2 1 101 94 183 0.994 17.62 6.05 Term + 225744 226842 1099 0 1 64 41 888 0.999 72.26 6.06 PlyA + 227047 227052 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:51838159_52071745|GENSCAN_predicted_peptide_1|236_aa MAEIIQEHKEDQLPELEQLEHIGLFSHAEINAIIKKASDLQYRIQERALFKDDFINYVQH EINLFEQIQRRTRIGYSFKKDEIENSIVHQVQGVFRSASAKRKDDVQLWLSYPALWIMAA KWEMEDCLSSESARQLFLWALHFHPEGPKLYQDYFRMELMHAEKLRKEKQEFQKANMDVE NPDYSEEILKGKLARIIYENSPLSPLIWAIAIASQLPKEADREHTPINTPLSASQS >gi568815583r:51838159_52071745|GENSCAN_predicted_CDS_1|711_bp atggcagagataattcaggaacacaaagaagatcagctccctgaattggaacagctagag cacattggactgttcagtcatgcagagattaacgctatcattaagaaggcttctgatcta cagtacagaatccaggaaagagcccttttcaaggacgactttatcaattatgttcaacat gaaattaatcttttcgaacagatccagagaagaacacgcattggatattcatttaagaag gatgagattgagaattctattgtacaccaggtacaaggtgttttccgaagtgcctcagca aagaggaaagacgatgttcaactttggctctcctatccagctttgtggattatggcagcc aaatgggaaatggaagactgcttgtcttcagaaagtgcaaggcaactatttctttgggca ctgcactttcatccagagggcccaaaactttatcaagactactttaggatggagctgatg catgctgaaaaactgaggaaggaaaagcaagaatttcaaaaagccaacatggatgtggag aatcctgattattctgaagaaatccttaagggcaaattggcacggatcatctatgaaaat tctccattatctcctcttatctgggctattgcaatagcttcccaacttccgaaagaagct gatcgggaacatactccaataaacactcctctgagtgccagccaatcatag >gi568815583r:51838159_52071745|GENSCAN_predicted_peptide_2|334_aa MALPFRKDLEKYKDLDEDELLGNLSETELKQLETVLDDLDPENALLPAGFRQKNQTSKST TGPFDREHLLSYLEKEALEHKDREDYVPYTGEKKGKIFIPKQKPVQTFTEEKVSLDPELE EALTSASDTELCDLADVVKGEKILPVFDEPPNPTNVEESLKRTKENDAHLVEVNLNNIKN IPIPTLKDFAKALETNTHVKCFSLAATRSNDPVATAFAEMLKVNKTLKSLNVESNFITGV GILALIDALRDNETLAELKIDNQRQQLGTAVELEMAKMLEENTNILKFGYQFTQQGPRTR AANAITKNNDLGAGHNADSPELVALSECSLVADR >gi568815583r:51838159_52071745|GENSCAN_predicted_CDS_2|1005_bp atggcactgccattccgtaaggacttagaaaagtacaaagaccttgatgaagatgagctc cttgggaatctgtcagaaacagaactgaaacaactggaaactgttttggatgatcttgac cccgagaatgcccttctgcctgcagggttccggcagaagaaccagacatcaaagtccacc acagggccatttgatagagagcatctcctttcatatctggagaaagaagcattggagcat aaagacagggaagactatgtgccctacactggagaaaaaaaagggaaaatatttatcccc aaacagaaacctgtacagacttttacagaagaaaaagtgtctcttgatccagaattagaa gaagctttgacaagtgcttctgatacagaattgtgtgacctcgcagatgtggtcaaaggt gaaaagattcttccggtatttgatgagccaccaaatccaaccaatgtagaagagagtttg aagagaactaaagaaaacgatgctcatcttgttgaagttaatttgaataatataaagaat atcccaattccaaccctaaaagattttgcaaaggctttggaaaccaacacacatgtgaaa tgtttcagtcttgcagccacccggagcaatgaccctgttgctactgcttttgcagaaatg ctgaaagtgaacaaaactttgaagagcttaaatgtggagtccaactttatcacgggagtt gggattctggcactgattgatgcgttaagagataatgaaaccctggcagagctcaagatt gacaatcagaggcagcagttggggacagctgtagaattggaaatggccaagatgcttgag gaaaatacaaatatccttaaatttggatatcagtttacacagcagggaccacgaaccaga gcagctaatgctataacaaaaaacaatgacttaggagccggccacaatgctgatagccct gaattggtggccctcagtgaatgctctctagtggctgataggtga >gi568815583r:51838159_52071745|GENSCAN_predicted_peptide_3|308_aa MDLFGDIDDVSSESDEGNQPPIPGQLIDEHGVPQDQQEEEPISETIIEEEIPSINSDLGN ELYFVKLPKFLSIEPKPFDPQFYEDEFEDEKVLDEEDRIRLKLKVENTIRWRIRRDKEGN KIKESNARMVKWSDRSMSLHLGNEVFDVYKAPLLGNYIHLFIREDTGLQGQAVFKSKLTF RPHSRDSATYRKMTLPLANRSSKTQKIRILPMAGRDPEGQHTQVMKKKEERLRASTQRES QGIHLREKRYQEWPSVSYQDPGSDSAEEEGKHTFSLAAIKNYYQGELQSKPSRKRKAEHE EEEDDIKP >gi568815583r:51838159_52071745|GENSCAN_predicted_CDS_3|927_bp atggatctgtttggagatatagacgacgtttcttctgagagtgatgagggcaatcaacca cctattccaggacagctgattgatgaacatggagtgcctcaggaccagcaggaggaagag ccaatttctgaaaccataatagaagaagaaattcccagtatcaactctgatttaggaaat gaattgtattttgttaaactacccaagtttctcagtatagaacccaaaccttttgatcct cagttttatgaagatgaatttgaagatgagaaagtgcttgacgaggaagacagaatcagg ttaaaattaaaggtagaaaatactataagatggaggatacgccgggataaagaaggaaat aaaattaaagaaagcaatgctcggatggtcaagtggtcagacagaagcatgtccctgcat ttaggcaatgaagtgtttgatgtgtacaaagccccgctgctgggcaattacatccacctg tttataagagaagacactggtctacagggacaagccgtctttaaatccaaacttaccttt agacctcactctagagacagtgccacatacagaaagatgaccctgccacttgctaataga agttcaaagacacagaaaattagaatcttaccaatggcgggtcgtgatcctgaaggccaa cacacacaagtgatgaagaagaaagaagaacgcttgagggcttccactcaacgggagtct cagggaatccatctgcgggagaagcgctaccaggagtggccaagtgtctcctaccaggac cctggcagtgacagtgcagaggaggaaggcaagcacaccttcagcctggctgctattaaa aactattatcaaggtgaactccaaagtaaaccttccagaaagaggaaagcagagcatgaa gaggaagaagatgatataaagccctaa >gi568815583r:51838159_52071745|GENSCAN_predicted_peptide_4|763_aa MADMEDLFGSDADSEAERKGGFAGLRRMGAWQAFRCGSRTTVEAPAPRARGGVSSTPASA PPALPEVPSSSGGQDGESDSLRLEALGVGGADGSWELVERKGRGAWVEVQASLRSADSDS GSDSDSDQENAASGSNASGSESDQDERGDSGQPSNKELFGDDSEDEGASHHSGSDNHSER SDNRSEASERSDHEDNDPSDVDQHSGSEAPNDDEDEGHRSDGGSHHSEAEGSEKAHSDDE KWGREDKSDQSDDEKIQNSDDEERAQGSDEDKLQNSDDDEKMQNTDDEERPQLSDDERQQ LSEEEKANSDDERPVASDNDDEKQNSDDEEQPQLSDEEKMQNSDDERPQASDEEHRHSDD EEEQDHKSESARGSDSEDEVLRMKRKNAIASDSEADSDTEVPKDNSGTMDLFGGADDISS GSDGEDKPPTPGQPVDENGLPQDQQEEEPIPETRIEVEIPKVNTDLGNDLYFVKLPNFLS VEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRIRRDEEGNEIKESNARIVKW SDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTGLQGQAVFKTKLTFRPHSTDSATHRK MTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASIRRESQQRRMREKQHQR GLSASYLEPDRYDEEEEGEESISLAAIKNRYKGGIREERARIYSSDSDEGSEEDKAQRLL KAKKLTSDEEGEPSGKRKAEDDDKANKKHKKYVISDEEEEDDD >gi568815583r:51838159_52071745|GENSCAN_predicted_CDS_4|2292_bp atggcggatatggaggatctcttcgggagcgacgccgacagcgaagctgagcgtaaaggt gggttcgctggtcttcggaggatgggtgcgtggcaggccttccgctgtgggagccggaca accgtggaggctccagcgccgcgggcaagaggtggcgtttcctccactcctgcctcagcg ccccctgcactcccggaggtgccatcctcctccggcgggcaggatggagagtcagacagt ctgcgcctcgaggctctgggtgtcggtggagcagacggttcctgggagcttgtggagcga aaggggcgcggggcatgggtggaagtgcaggcgagcctgcgcagcgcggattctgattct ggatctgactcagattctgatcaagagaatgctgcctctggcagtaatgcctctggaagt gaaagtgatcaggatgaaagaggtgattcaggacaaccaagtaataaggaactgtttgga gatgacagtgaggacgagggagcttcacatcatagtggtagtgataatcactctgaaaga tcagacaatagatcagaagcttctgagcgttctgaccatgaggacaatgacccctcagat gtagatcagcacagtggatcagaagcccctaatgatgatgaagacgaaggtcatagatcg gatggagggagccatcattcagaagcagaaggttctgaaaaagcacattcagatgatgaa aaatggggcagagaagataaaagtgaccagtcagatgatgaaaagatacaaaattctgat gatgaggagagggcacaaggatctgatgaagataagctgcagaattctgacgatgatgag aaaatgcagaacacagatgatgaggagaggcctcagctttccgatgatgagagacaacag ctatctgaggaggaaaaggctaattctgatgatgaacggccggtagcttctgataatgat gatgagaaacagaattctgatgatgaagaacaaccacagctgtctgatgaagagaaaatg caaaattctgatgatgaaaggccacaggcctcagatgaagaacacaggcattcagatgat gaagaggaacaggatcataaatcagaatctgcaagaggcagtgatagtgaagatgaagtt ttacgaatgaaacgcaagaatgcgattgcatctgattcagaagcggatagtgacactgag gtgccaaaagataatagtggaaccatggatttatttggaggtgcagatgatatctcttca gggagtgatggagaagacaaaccacctactccaggacagcctgttgatgaaaatggattg cctcaggatcaacaggaagaggagccaattcctgagaccagaatagaagtagaaataccc aaagtaaacactgatttaggaaacgacttatattttgttaaactgcccaactttctcagt gtagagcccagaccttttgatcctcagtattatgaagatgaatttgaagatgaagaaatg ctggatgaagaaggtagaaccaggttaaaattaaaggtagaaaatactataagatggagg atacgccgagatgaagaaggaaatgaaattaaagaaagcaatgctcggatagtcaagtgg tcagatggaagcatgtccctgcatttaggcaatgaagtgtttgatgtgtacaaagcccca ctgcagggcgaccacaatcatctttttataagacaaggtactggtctacagggacaagca gtctttaaaacgaaactcaccttcagacctcactctacggacagtgccacacatagaaag atgactctgtcacttgcagataggtgttcaaagacacagaagattagaatcttgccaatg gctggtcgtgatcctgaatgccaacgcacagaaatgattaagaaagaagaagaacgtttg agggcttccatacgtagggaatctcagcagcgccgaatgagagagaaacagcaccagcgg gggctgagcgccagttacctggaacctgatcgatacgatgaggaggaggaaggcgaggag tccatcagcttggctgccattaaaaaccgatataaagggggcattcgagaggaacgagcc agaatctattcatcagacagtgatgagggatcagaagaagataaagctcaaagattactc aaagcaaagaaacttaccagtgatgaggaaggtgaaccttccggaaagagaaaagcagaa gatgatgataaagcaaataaaaagcataagaagtatgtgatcagcgatgaagaggaagaa gatgatgattga >gi568815583r:51838159_52071745|GENSCAN_predicted_peptide_5|200_aa MEEGAGSAVAAAAAAAGPAAAPSRAAPPAHPAHVHTHTHGTSPTHRESAGLADCPQCRLR SPAVLAAAASPRLSLCRSRRHVTPTRERGRGRGGGASRSQAILSFEEEGHYTGPCVQKDP MLGLMLCYDHLDIPDDFIFELVLCSEVRCGSGARRYEGSSSSDLLLEELDCRFTRVDWPL ATNTYQPSSRDCTLDQLPIH >gi568815583r:51838159_52071745|GENSCAN_predicted_CDS_5|603_bp atggaggagggagccggctccgccgtcgccgccgccgccgccgccgccgggcccgccgcg gcgccctcccgagccgccccgccggcgcaccccgcacacgtacacacacacactcacggg acttctccgactcatcgagagagtgcggggctcgcggactgcccccagtgtcggctcagg tctccggctgtgctggcggcagcggcgtctccgaggctctcgctttgccgcagtcgccgc catgtaaccccgacccgcgagagagggcgaggaagagggggcggggcttcgcgtagtcaa gctatcctgtcctttgaggaagaagggcattacacagggccctgcgtgcaaaaggacccc atgcttggtttaatgctctgctatgaccatcttgacattcctgatgattttatctttgaa ctcgtgttgtgtagtgaagtcagatgtggcagtggagcacgcagatatgaagggtccagt tcttcagatttgctgcttgaagaacttgattgcaggtttaccagggttgattggcccctg gcaaccaacacataccaaccaagttcaagagattgtaccctagaccagctgccaatccac taa >gi568815583r:51838159_52071745|GENSCAN_predicted_peptide_6|721_aa MAEKFESLMNIHGFDLGSRYMDLKPLGCGGNGLVFSAVDNDCDKRVAIKKIVLTDPQSVK HALREIKIIRRLDHDNIVKVFEILGPSGSQLTDDVGSLTELNSVYIVQEYMETDLANVLE QGPLLEEHARLFMYQLLRGLKYIHSANVLHRDLKPANLFINTEDLVLKIGDFGLARIMDP HYSHKGHLSEGLVTKWYRSPRLLLSPNNYTKAIDMWAAGCIFAEMLTGKTLFAGAHELEQ MQLILESIPVVHEEDRQELLSVIPVYIRNDMTEPHKPLTQLLPGISREALDFLEQILTFS PMDRLTAEEALSHPYMSIYSFPMDEPISSHPFHIEDEVDDILLMDETHSHIYNWERYHDC QFSEHDWPVHNNFDIDEVQLDPRALSDVTDEEEVQVDPRKYLDGDREKYLEDPAFDTNYS TEPCWQYSDHHENKYCDLECSHTCNYKTRSSSYLDNLVWRESEVNHYYEPKLIIDLSNWK EQSKEKSDKKGKSKCERNGLVKAQIALEEASQQLAGKEREKNQGFDFDSFIAGTIQLSSQ HEPTDVVDKLNDLNSSVSQLELKSLISKSVSQEKQEKGMANLAQLEALYQSSWDSQFVSG GEDCFFINQFCEVRKDEQVEKENTYTSYLDKFFSRKEDTEMLETEPVEDGKLGERGHEEG FLNNSGEFLFNKQLESIGIPQFHSPVGSPLKSIQATLTPSAMKSSPQIPHQTYSSILKHL N >gi568815583r:51838159_52071745|GENSCAN_predicted_CDS_6|2166_bp atggcagagaaatttgaaagtctcatgaacattcatggttttgatctgggttctaggtat atggacttaaaaccattgggttgtggaggcaatggcttggttttttctgctgtagacaat gactgtgacaaaagagtagccatcaagaaaattgtccttactgatccccagagtgtcaaa catgctctacgtgaaatcaaaattattagaagacttgaccatgataacattgtgaaagtg tttgagattcttggtcccagtggaagccaattaacagacgatgtgggctctcttacggaa ctgaacagtgtttacattgttcaggagtacatggagacagacttggctaatgtgctggag cagggccctttactggaagagcatgccaggcttttcatgtatcagctgctacgggggctc aagtatattcactctgcaaatgtactgcacagagatctcaaaccagctaatcttttcatt aatacggaagacttggtgctgaagataggtgactttggtcttgcacggatcatggatcct cattattcccataagggtcatctttctgaaggattggttactaaatggtacagatctcca cgtcttttactttctcctaataattatactaaagccattgacatgtgggctgcaggctgc atctttgctgaaatgctgactggtaaaaccctttttgcaggtgcacatgaacttgaacag atgcagctgattttagaatctattcctgttgtacatgaggaagatcgtcaggagcttctc agcgtaattccagtttacattagaaatgacatgactgagccacacaaacctttaactcag ctgcttccaggaattagtcgagaagcactggatttcctggaacaaattttgacatttagc cccatggatcggttaacagcagaagaagcactctcccatccttacatgagcatatattct tttccaatggatgagccaatttcaagccatccttttcatattgaagatgaagttgatgat attttgcttatggatgaaactcacagtcacatttataactgggaaaggtatcatgattgt cagttttcagagcatgattggcctgtacataacaactttgatattgatgaagttcagctt gatccaagagctctgtccgatgtcactgatgaagaagaagtacaagttgatccccgaaaa tatttggatggagatcgggaaaagtatctggaggatcctgcttttgataccaattactct actgagccttgttggcaatactcagatcatcatgaaaacaaatattgtgatctggagtgt agccatacttgtaactacaaaacgaggtcatcatcatatttagataacttagtttggaga gagagtgaagttaaccattactatgaacccaagcttattatagatctttccaattggaaa gaacaaagcaaagaaaaatctgataagaaaggcaaatcaaaatgtgaaaggaatggattg gttaaagcccagatagcgctagaggaagcatcacagcaactggctggaaaagaaagggaa aagaatcagggatttgattttgattcctttattgcaggaactattcagcttagttcccag catgagcctactgatgttgttgataaattaaatgacttgaatagctcagtgtcccaacta gaattgaaaagtttgatatcaaagtcagtaagccaagaaaaacaggaaaaaggaatggca aatctggctcaattagaagccttgtaccagtcttcttgggacagccagtttgtgagtggt ggggaggactgttttttcataaatcagttttgtgaggtaaggaaggatgaacaagttgag aaggaaaacacttacactagttacttggacaagttctttagcaggaaagaagatactgaa atgctagaaactgagccagtagaggatgggaagcttggggagagaggacatgaggaagga tttctgaacaacagtggggagttcctctttaacaagcagctcgagtccataggcatccca cagtttcacagtccagttgggtcaccacttaagtcaatacaggccacattaacaccttct gctatgaaatcttcccctcaaattcctcatcaaacatacagcagcattctgaaacatctg aactaa