GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:25:06 Sequence gi568815586r:14842525_15050712 : 208188 bp : 38.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 473 537 65 1 2 93 77 102 0.343 6.20 1.02 Term + 1222 1354 133 1 1 49 38 128 0.772 0.48 1.03 PlyA + 2247 2252 6 1.05 2.03 PlyA - 2284 2279 6 1.05 2.02 Term - 8113 7954 160 0 1 97 42 152 0.570 7.93 2.01 Init - 11778 11762 17 0 2 77 107 4 0.561 0.96 2.00 Prom - 14976 14937 40 -3.05 3.02 PlyA - 15091 15086 6 1.05 3.01 Sngl - 21263 20613 651 2 0 43 45 309 0.974 18.02 3.00 Prom - 21488 21449 40 -13.78 4.02 PlyA - 21596 21591 6 1.05 4.01 Sngl - 22563 21775 789 0 0 44 41 315 0.683 18.07 4.00 Prom - 39255 39216 40 -4.25 5.17 PlyA - 39392 39387 6 1.05 5.16 Term - 39756 39615 142 2 1 99 53 164 0.986 10.42 5.15 Intr - 40523 40448 76 0 1 57 78 31 0.973 -3.35 5.14 Intr - 41721 41689 33 1 0 97 83 35 0.681 1.18 5.13 Intr - 42363 42289 75 1 0 38 84 84 0.623 1.57 5.12 Intr - 43326 43207 120 1 0 64 81 149 0.435 11.35 5.11 Intr - 46803 46720 84 1 0 68 44 105 0.096 2.97 5.10 Intr - 60183 60145 39 1 0 92 88 34 0.001 1.08 5.09 Intr - 62691 62585 107 2 2 48 87 83 0.002 3.14 5.08 Intr - 67998 67913 86 0 2 47 73 58 0.050 -2.10 5.07 Intr - 73162 72965 198 1 0 59 111 169 0.934 14.93 5.06 Intr - 74779 74654 126 1 0 35 116 88 0.857 6.26 5.05 Intr - 74977 74936 42 1 0 54 95 74 0.613 2.22 5.04 Intr - 78524 78408 117 2 0 98 42 116 0.863 7.74 5.03 Intr - 92469 92332 138 0 0 102 119 35 0.983 7.74 5.02 Intr - 95528 95428 101 0 2 142 92 120 0.998 16.71 5.01 Init - 95984 95891 94 2 1 78 98 124 0.998 11.31 5.00 Prom - 98943 98904 40 -5.15 6.07 PlyA - 99524 99519 6 1.05 6.06 Term - 100197 99998 200 1 2 88 42 159 0.140 7.88 6.05 Intr - 102172 102002 171 2 0 33 15 185 0.106 4.69 6.04 Intr - 102315 102252 64 0 1 107 116 47 0.956 6.77 6.03 Intr - 105425 105349 77 0 2 110 106 58 0.999 8.12 6.02 Intr - 107361 107278 84 1 0 93 84 99 0.992 8.87 6.01 Init - 108188 108008 181 2 1 61 91 299 0.963 26.89 6.00 Prom - 110890 110851 40 -5.35 7.00 Prom + 113159 113198 40 -8.05 7.01 Init + 119226 119366 141 2 0 31 94 144 0.531 9.48 7.02 Term + 120680 120733 54 1 0 111 38 54 0.572 -0.72 7.03 PlyA + 120888 120893 6 1.05 8.00 Prom + 124493 124532 40 -3.05 8.01 Init + 131734 131793 60 0 0 88 77 83 0.886 8.50 8.02 Intr + 133681 133801 121 0 1 46 52 104 0.812 1.75 8.03 Intr + 135520 135622 103 2 1 80 90 94 0.977 7.11 8.04 Intr + 136655 136695 41 2 2 112 115 29 0.891 4.95 8.05 Intr + 151079 151222 144 0 0 33 97 78 0.503 2.53 8.06 Intr + 152616 152767 152 1 2 40 69 103 0.481 2.56 8.07 Term + 156139 156237 99 0 0 58 50 101 0.164 0.45 8.08 PlyA + 156477 156482 6 1.05 9.04 PlyA - 156671 156666 6 1.05 9.03 Term - 159347 159269 79 1 1 82 42 60 0.051 -2.94 9.02 Intr - 159729 159623 107 0 2 110 98 38 0.070 5.09 9.01 Init - 174768 174700 69 0 0 61 91 64 0.336 5.10 9.00 Prom - 186469 186430 40 -4.15 10.03 PlyA - 186499 186494 6 1.05 10.02 Term - 194733 194600 134 1 2 30 32 181 0.737 3.97 10.01 Init - 195997 195856 142 1 1 37 91 106 0.774 4.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:14842525_15050712|GENSCAN_predicted_peptide_1|65_aa AREEQQEWQQASKEPDSHRCRRDCTSGFKALPPFSLLKTGPLSAQFQSPVNPNDDCQETA YVELT >gi568815586r:14842525_15050712|GENSCAN_predicted_CDS_1|198_bp gccagagaggagcagcaggaatggcagcaggcctccaaggagccagattctcatcgttgc aggagagattgtacctctgggttcaaagctcttcccccattttctcttctgaagacaggc cctctgtccgcacagttccagtctccagtgaatcctaatgacgactgtcaggagactgct tatgtagagctgacataa >gi568815586r:14842525_15050712|GENSCAN_predicted_peptide_2|58_aa MNNKNWLGVTCFKGFYILSGMIRIRLANSNAYKVQAENIHDQTSLRVKGVNYEYMKNA >gi568815586r:14842525_15050712|GENSCAN_predicted_CDS_2|177_bp atgaacaataaaaactggctaggggtgacatgctttaaaggtttctatatcctaagtgga atgattcggatacggctggcaaactcaaatgcctacaaagtccaggcggagaatatacat gatcaaacgagtctaagggtgaaaggtgtgaattatgagtacatgaagaatgcttga >gi568815586r:14842525_15050712|GENSCAN_predicted_peptide_3|216_aa MNIDAKVLNKILANQIQQHIKKLIHHDQVSFIPGMQGWFNIRKSINVIHHINRTNDKNHM IISIDAEKAFDKIQQPFMLKTLNKLRIDGTSLKTIRAIYDKPTANNMLNGKELEAFPLKT GTRQGCPLSPFLINTVLEVLARAIRQEKEIKRIQLGKEEVKLSLFADDMTIYLENPIVSA QNLLKLIRNFSKILGYKINVQKSQAFLYTNNRQRAK >gi568815586r:14842525_15050712|GENSCAN_predicted_CDS_3|651_bp atgaacattgatgcgaaagtcctcaataaaatactggcaaaccaaatccagcagcacatc aaaaagcttatccaccacgatcaagtcagcttcatccctgggatgcaaggctggttcaac atacgcaaatcaataaacgtaatccatcacataaacagaaccaacgacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacagcctttcatgttaaaa actctcaataaactacgaattgatggaacgtctctcaaaacaataagagctatttatgac aaacccacagccaataacatgctgaatgggaaagaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccattcctaatcaacacagtgttggaagttctg gccagggcaatcaggcaagagaaagaaataaaacgtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgactatatatttagaaaaccccatcgtctcagcc caaaatctccttaagctgataaggaacttcagcaaaatcttgggatacaaaatcaatgtg caaaaatcacaagcattcctatacaccaataacagacagagagccaaatga >gi568815586r:14842525_15050712|GENSCAN_predicted_peptide_4|262_aa MVKGSIQQEELTILNVYAPNTGAARFIKQIFRDLQRDLDSHTIIMGEFNTPLSILDRSTR QKVNKDIQDLNSALQQAYLVDIYRTFHPKSIEYTFFSAPHRTYSKIDHTVGSKALFSKCK RTEITTNCLSDHRAIKLELGIKKITQNCITTWKLNNVLLNDYWGNNEMKAEIKMFIETNE NKDTTYQNLWDTMKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKETETQKTLQKNQ >gi568815586r:14842525_15050712|GENSCAN_predicted_CDS_4|789_bp atggtaaagggatcaattcaacaagaagagctaactatcctaaatgtatatgcacccaat acaggagcagccagattcataaagcaaatctttagagacctacaaagagacttagattcc cacacaataataatgggagagtttaacaccccactgtcaatattagacagatctacgaga cagaaggttaacaaagatatccaggacttgaactcagctctgcaacaagcatacctagta gacatctacagaactttccaccccaaatcaatagaatatacattcttctcagcaccacat cgcacttattccaaaattgaccacacagttggcagtaaagccctcttcagcaaatgtaaa agaacagaaatcacaacaaactgtctctcagaccacagagcaatcaaattagaactcggg attaagaaaatcactcaaaactgcataactacatggaaactgaacaacgtgctcctgaat gactactggggaaataatgaaatgaaggcagaaataaagatgttcattgaaaccaatgag aacaaagacacaacataccagaatctctgggatacaatgaaagcagtgtgtagagggaaa tttatagcactaaatgcccacaagagaaagcaggaaagatctaaaatcgacaccctaaca tcacaattaaaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaattactaagatcagagcagaactgaaggagacagagacacaaaaaacacttcaaaaa aatcaatga >gi568815586r:14842525_15050712|GENSCAN_predicted_peptide_5|525_aa MEAAPSRFMFLLFLLTCELAAEVAAEVEKSSDGPGAAQEPTWLTDVPAAMEFIAATEVAV IGFFQDLEIPAVPILHSMVQKFPGVSFGISTDSEVLTHYNITGNTICLFRLVDNEQLNLE DEDIESIDATKLSRFIEINSLHMVTEYNPVGGKEGSTQLHENEMTVIGLFNSVIQIHLLL IMNKASPEYEENMHRYQKAAKLFQGKILFILVDSGMKENGKVISFFKLKESQLPALAIYQ TLDDEWDTLPTAEVSVEHVQNFCDGFLSGKLLKPYMARPRELEKEKVKHEHLLSVELLEK SPWNPGLQHAMIAPDLCTTPDHFLKYLKPRIQEVLKASLALGMIAETLRELSELVTSTFL DALETEMLLSAHKNGQPRASLPTAATQDPETDLQDETMKSLILLAILAALAVVTLCYGEW QKEENFGFDIVSVLSLNWHRAQESHESMESYELNPFINRRNANTFISPQQRWRAKVQERI RERSKPVHELNREACDDYRLCERYAMVYGYNAAYNRYFRKRRGTK >gi568815586r:14842525_15050712|GENSCAN_predicted_CDS_5|1578_bp atggaagctgccccgtccaggttcatgttcctcttatttctcctcacgtgtgagctggct gcagaagttgctgcagaagttgagaaatcctcagatggtcctggtgctgcccaggaaccc acgtggctcacagatgtcccagctgccatggaattcattgctgccactgaggtggctgtc ataggcttcttccaggatttagaaataccagcagtgcccatactccatagcatggtgcaa aaattcccaggcgtgtcatttgggatcagcactgattctgaggttctgacacactacaac atcactgggaacaccatctgcctctttcgcctggtagacaatgaacaactgaatttagag gacgaagacattgaaagcattgatgccaccaaattgagccgtttcattgagatcaacagc ctccacatggtgacagagtacaaccctgtgggtgggaaggaaggatctacacagttgcat gaaaacgagatgactgtgattgggttattcaacagcgtaattcagattcatctcctcctg ataatgaacaaggcctccccagagtatgaagagaacatgcacagataccagaaggcagcc aagctcttccaggggaagattctctttattctggtggacagtggtatgaaagaaaatggg aaggtgatatcatttttcaaactaaaggagtctcaactgccagctttggcaatttaccag actctagatgacgagtgggatacactgcccacagcagaagtttccgtagagcatgtgcaa aacttttgtgatggattcctaagtggaaaattgttgaaaccttacatggcgagacctaga gaattggaaaaggagaaagttaaacacgaacaccttttgagtgttgaacttttggaaaaa agcccttggaatccgggactccaacatgcaatgattgccccagatctctgtactacacct gaccactttctgaaatatttgaagccgaggatacaggaagttcttaaagccagtctggct ctggggatgatcgcagaaaccctaagagaattatctgagctggtcactagcacctttctt gatgctttggaaacagagatgctcctgagtgcccataagaatggtcagcccagagcctct ctccctactgctgctacacaagaccctgagactgacctgcaggacgaaaccatgaagagc ctgatccttcttgccatcctggccgccttagcggtagtaactttgtgttatggagagtgg cagaaagaagaaaacttcggctttgatatcgtttcagttctctctctgaactggcatcgt gcccaggaatcacatgaaagcatggaatcttatgaacttaatcccttcattaacaggaga aatgcaaataccttcatatcccctcagcagagatggagagctaaagtccaagagaggatc cgagaacgctctaagcctgtccacgagctcaatagggaagcctgtgatgactacagactt tgcgaacgctacgccatggtttatggatacaatgctgcctataatcgctacttcaggaag cgccgagggaccaaatga >gi568815586r:14842525_15050712|GENSCAN_predicted_peptide_6|258_aa MTEKAPEPHVEEDDDDELDSKLNYKPPPQKSLKELQEMDKDDESLIKYKKTLLGDGPVVT DPKAPNVVVTRLTLVCESAPGPITMDLTGDLEALKKETIVLKEGSEYRVKIHFKVNRDIV SGLKYVQHTYRTGVKAIDSDENTIKAVDAGSRKRHKYTSDIDIDKANAQGTSFRLHQLLT FSSQLFGFNIITLDKATFMVGSYGPRPEEYEFLTPVEEAPKGMLARGTYHNKSFFTDDDK QDHLSWEWNLSIKKEWTE >gi568815586r:14842525_15050712|GENSCAN_predicted_CDS_6|777_bp atgactgaaaaagccccagagccacatgtggaggaggatgacgatgatgagctggacagc aagctcaattataagcctccaccacagaagtccctgaaagagctgcaggaaatggacaaa gatgatgagagtctaattaagtacaagaaaacgctgctgggagatggtcctgtggtgaca gatccgaaagcccccaatgtcgttgtcacccggctcaccctggtttgtgagagtgccccg ggaccaatcaccatggaccttactggagatctggaagccctcaaaaaggaaaccattgtg ttaaaggaaggttctgaatatagagtcaaaattcacttcaaagtgaacagggatattgtg tcaggcctgaaatacgttcagcacacctacaggactggggtgaaagctatagactcagat gagaatacaataaaagctgtggatgccggctctaggaaaaggcacaaatacacctctgat atagacattgacaaggccaatgctcaggggacctcgtttagactgcatcagctgctaacg ttcagcagtcagctttttggattcaatattatcactctggataaagcaacatttatggtt ggcagctatggacctcggcctgaggagtatgagttcctcactccagttgaggaggctccc aagggcatgctggcgcgaggcacgtaccacaacaagtccttcttcaccgacgatgacaag caagaccacctcagctgggagtggaacctgtcgattaagaaggagtggacagaatga >gi568815586r:14842525_15050712|GENSCAN_predicted_peptide_7|64_aa MCEHDSPCDVEMHECTRKSVEPIAMGIMNNCEGISDNQVIAVHKPMESPVYIGGYGSTPS QKNL >gi568815586r:14842525_15050712|GENSCAN_predicted_CDS_7|195_bp atgtgcgagcatgacagcccgtgtgacgtggagatgcatgaatgtacacgcaagagtgtg gagcctatagctatgggtattatgaataattgtgagggtatttctgataatcaggtcatt gcagtgcataagcctatggagagcccagtgtatatcggagggtatgggagcaccccttcc cagaaaaatctgtaa >gi568815586r:14842525_15050712|GENSCAN_predicted_peptide_8|239_aa MRLAFGDVYGQQDWLPMQEKVLRLEGQPHEASPVGDKGRNHSMSRWSRVLANLTIPGTGK ASNQGPTTPRKGPPKFKQRQTRQFKSKPPKKGVKGFGDDIPGMEGLGTVTKLTQEMSVKD IVMVNFRCHFDWLERCQMAGDALFLGVSVRVLPQEIVMASPEGASKQDKADSPQDPPSPP LFASSRITRLKSQQTPKGEIESVIHEENEGWGSGIGISSDILVSSTDVQIGYSTKRKFW >gi568815586r:14842525_15050712|GENSCAN_predicted_CDS_8|720_bp atgaggttagcctttggtgatgtttatggacaacaggactggcttccaatgcaggaaaag gttctgagactggaaggacagcctcatgaagctagtccagtaggggataaaggaaggaat cacagcatgagcaggtggtccagggtcctggccaatctcaccatccctggtactggcaag gcttcaaaccagggtcctaccaccccacgcaaaggccctcccaagttcaagcagaggcag actcgccaattcaagagtaaacctccaaagaaaggtgtgaaaggatttggagatgacatt ccaggaatggaggggctaggaacagtcaccaaacttacacaagagatgtcagttaaagac attgtgatggttaattttagatgtcactttgactggcttgagagatgccagatggctggt gatgcattgtttctgggtgtgtctgtgagggtgcttccacaggagattgtaatggcctcc cctgagggagcttccaagcaagacaaggctgattctcctcaggacccaccttcacctcct ctctttgcttctagccgcataaccagactcaagtcccagcagacccctaaaggtgagata gaaagtgtgatccacgaggagaatgaaggatggggatctggtataggcatctcctctgat atactggtgtccagtactgatgtccagataggctattccaccaagcggaaattctggtag >gi568815586r:14842525_15050712|GENSCAN_predicted_peptide_9|84_aa MVLEKRDKGKSKTLKKCEEFILQPSSKHPVPPSQRSLTGSQNSISYHPSPSAIKLSKERH FLGSGVGITVNNTGKFPVLVEVML >gi568815586r:14842525_15050712|GENSCAN_predicted_CDS_9|255_bp atggtgcttgaaaagagggacaagggaaaatccaagacactgaaaaagtgtgaagagttt atcttacagccttccagcaagcatccagtgcctccctctcagagaagcctcacaggcagc caaaactctatttcctatcatcccagccccagtgccataaagctgagtaaggaaaggcat ttcctaggaagtggagtgggtataacagtgaacaacacaggcaaattccctgtccttgtg gaggtgatgttgtaa >gi568815586r:14842525_15050712|GENSCAN_predicted_peptide_10|91_aa MSTSLSAPSSGVPAWLHLLAAQSQMPTGVLPGGCCHRSFTTRPHLTVAGKVNALGGYKFG VSAGKALVATCLLFGGPRLQRWIQKPAQVAA >gi568815586r:14842525_15050712|GENSCAN_predicted_CDS_10|276_bp atgagcacctccctctctgcaccctcttctggggtccctgcctggttgcatctgcttgca gcccagtctcagatgcccactggggtgcttcctggtggctgctgccataggtcttttact accagacctcacctaactgttgcaggcaaggtgaacgcactgggtggttacaaatttggc gtgtcagccgggaaagctcttgtggctacctgcctgttgttcggtggcccccgactccag cgatggatccagaagccagcccaagtggctgcctag