GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:34:20 Sequence gi568815584f:50487830_50732327 : 244498 bp : 38.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1219 1214 6 1.05 1.02 Term - 28363 28212 152 2 2 92 43 166 0.994 9.59 1.01 Init - 37583 37523 61 2 1 46 85 49 0.177 1.86 1.00 Prom - 39533 39494 40 -10.45 2.03 PlyA - 39825 39820 6 1.05 2.02 Term - 40564 40370 195 1 0 113 36 111 0.958 4.83 2.01 Init - 44220 44113 108 0 0 90 115 135 0.494 16.95 2.00 Prom - 46947 46908 40 -5.75 3.00 Prom + 47450 47489 40 -8.55 3.01 Sngl + 49102 49851 750 0 0 46 37 286 0.462 15.22 3.02 PlyA + 52439 52444 6 1.05 4.00 Prom + 57508 57547 40 -4.35 4.01 Init + 61297 61406 110 0 2 100 61 67 0.416 4.94 4.02 Intr + 61455 61546 92 0 2 3 37 143 0.412 -0.58 4.03 Term + 64020 64450 431 1 2 33 32 238 0.889 7.18 4.04 PlyA + 64454 64459 6 1.05 5.03 PlyA - 68286 68281 6 1.05 5.02 Term - 72533 72253 281 0 2 98 46 209 0.652 12.32 5.01 Init - 77288 77240 49 2 1 86 58 42 0.282 0.16 5.00 Prom - 87122 87083 40 -5.45 6.00 Prom + 88140 88179 40 -4.15 6.01 Init + 94466 94520 55 1 1 8 109 12 0.422 -3.20 6.02 Intr + 100002 100249 248 1 2 85 85 285 0.998 24.16 6.03 Intr + 103112 103246 135 1 0 49 86 100 0.907 5.74 6.04 Intr + 103706 103810 105 1 0 98 80 21 0.388 1.79 6.05 Intr + 107747 107803 57 1 0 84 99 78 0.461 6.66 6.06 Intr + 125430 125522 93 2 0 93 76 59 0.768 4.44 6.07 Intr + 126544 126682 139 0 1 33 76 56 0.487 -1.98 6.08 Intr + 132770 132897 128 0 2 64 64 86 0.506 3.28 6.09 Intr + 134014 134070 57 0 0 112 68 29 0.732 1.46 6.10 Intr + 135348 135419 72 2 0 55 94 112 0.980 7.18 6.11 Intr + 138509 138802 294 1 0 50 16 155 0.626 0.78 6.12 Intr + 140202 140633 432 2 0 51 110 374 0.905 28.82 6.13 Term + 144400 144510 111 0 0 82 38 92 0.923 1.18 6.14 PlyA + 147195 147200 6 1.05 7.06 PlyA - 147272 147267 6 1.05 7.05 Term - 147555 147354 202 2 1 48 49 162 0.963 4.18 7.04 Intr - 153064 152921 144 0 0 60 100 186 0.991 15.48 7.03 Intr - 157185 156915 271 1 1 133 60 114 0.990 8.78 7.02 Intr - 177790 177350 441 2 0 81 70 232 0.013 13.20 7.01 Init - 198623 198383 241 2 1 54 37 158 0.080 5.38 7.00 Prom - 210718 210679 40 -4.05 8.07 PlyA - 211328 211323 6 1.05 8.06 Term - 217477 217097 381 1 0 22 49 445 0.156 27.85 8.05 Intr - 217750 217530 221 2 2 48 7 236 0.087 8.60 8.04 Intr - 234311 234163 149 1 2 31 61 118 0.184 2.36 8.03 Intr - 235843 235641 203 1 2 79 25 198 0.831 9.76 8.02 Intr - 238237 238124 114 1 0 109 63 169 0.998 16.22 8.01 Intr - 241894 241694 201 1 0 81 80 209 0.917 17.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 119239 119340 102 0 0 47 95 84 0.827 5.29 S.002 Intr + 179904 180016 113 2 2 47 64 139 0.916 5.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:50487830_50732327|GENSCAN_predicted_peptide_1|70_aa MKGRDGFESYLEDDQKHLVIDSLPVPNGDKLGLQYGTGNRRKCVDLAYKFKLSGFADMSD VKSVKERNKE >gi568815584f:50487830_50732327|GENSCAN_predicted_CDS_1|213_bp atgaaaggaagggatggatttgagtcatatttagaagatgatcaaaaacacctggtaatc gatagtttaccagttcctaatggagacaagcttggattacagtatggcactggaaataga agaaagtgtgtggatttggcttataagtttaagctgtcaggatttgctgatatgtcagat gtgaagagtgtgaaggaaagaaataaagaatga >gi568815584f:50487830_50732327|GENSCAN_predicted_peptide_2|100_aa MEAPLRPAADILRRNPQQDYELVQRVGSGTYGDVYKVSFEAEALKDSGFEILRNLLLCSS IPQIPSEASIFPENPQEGEFNLILAVLWLKEEEEHKDSFR >gi568815584f:50487830_50732327|GENSCAN_predicted_CDS_2|303_bp atggaggccccgctgcggcctgccgcggacatcctgaggcggaacccgcagcaggactac gaactcgtccagagggtcggcagcggcacctacggggacgtctataaggtatcttttgaa gcagaagctttaaaggactctggctttgagattcttcgtaatttacttttatgttcatca ataccacagatcccctctgaagcttccatctttcctgagaatccccaagagggagagttc aatcttatattggctgttctttggttgaaagaagaggaggaacataaagattcttttaga taa >gi568815584f:50487830_50732327|GENSCAN_predicted_peptide_3|249_aa MEKKIPFSEEKFKQAAEICISNEKLNINSKDNGENAFRACQRSSWQTFPSQARRPRRKKW FRWLGPGSLCCVQPRDLVPCFPAAPAAAERGQHRAGAVASEGASLQPWQLPRGVEPSSTQ KSRIEVWESLPRFQMMYGNAWMSRQKFSAGVVPSRRTSARVVRKGNVGSEPPDRVPTEVL PSGAVRRGPLSSRSQNGGSTDSLHRAPGKAADTQHGFPMKAARGRLYPAKPQGQSCPRPW EPTSCISMT >gi568815584f:50487830_50732327|GENSCAN_predicted_CDS_3|750_bp atggaaaagaaaatcccattttctgaggaaaaattcaagcaggctgcagaaatttgcata agtaacgaaaagctgaatattaattccaaagacaatggggaaaatgccttcagggcgtgt cagaggtcttcatggcagaccttcccatcacaggccaggagacctaggaggaaaaagtgg tttcgttggctaggcccagggtccctgtgctgtgtgcagcctagagacttggtgccctgc tttccagctgctccagctgcagctgaaaggggccaacatagagctggggccgtggcttca gagggtgcaagcctccagccttggcagcttccacgtggtgttgagccttctagtacacag aagtcaagaattgaggtttgggaatctttgcctagatttcagatgatgtatggaaatgcc tggatgtccaggcagaagttttctgcaggggtagtgccctcacggagaacctctgctagg gtggtgcgaaagggaaatgtggggtcagagcccccagacagagttcctactgaggtactg cctagtggagctgtgagaagagggccactgtcctccagatcacagaatggtggatccact gacagcttgcaccgtgcacctggaaaagctgcagacactcaacatggctttcccatgaaa gcagccagagggaggctgtaccctgcaaagccacaggggcagagctgcccaagaccatgg gaaccaacctcttgcatcagcatgacttag >gi568815584f:50487830_50732327|GENSCAN_predicted_peptide_4|210_aa MATEPPTRPTIVYKGTPPIPTRTWYIDGSSKGTQHQWYIIRAATGWATAKAAGISILYAD ILAAVQNYGTYLKVIKAIYDTPTGNTIPNKENLKAFPQRTRTRQEYPLSPLLLNIVLEIL ARAVRQKKEIKGIQIGKEEVKLLLFADDMIMYLENPKDSSKNLDLINEFSKVSGYKINVH KSVALLYTNNDKAENQIKYSIPFTTAAKKN >gi568815584f:50487830_50732327|GENSCAN_predicted_CDS_4|633_bp atggcaacggaaccacctaccaggccaaccattgtatataaggggaccccaccaataccc actaggacctggtacattgatgggtctagcaaaggcacccaacaccaatggtacatcata agagcagccacagggtgggccacagcaaaggcagcaggcatctccatcctctatgcagat attctagcagctgttcagaactatgggacatacctcaaagtaataaaagccatctatgac acaccaacaggcaacacaataccaaacaaggaaaatttgaaagcattcccccagagaact agaacaagacaagaatacccactttcaccacttctactcaacatagtactggaaatccta gccagagcagtcagacaaaagaaagaaataaagggtattcaaattggaaaagaggaagtc aaactgttgctgttcgccgatgatatgatcatgtacctagaaaaccctaaagactcatcc aaaaatctagacctgataaatgagttcagtaaagtttcgggatacaaaatcaatgtacac aaatcagtagcactgctatataccaacaatgacaaagctgagaatcaaatcaagtactca atcccttttacaacagctgcaaaaaaaaattaa >gi568815584f:50487830_50732327|GENSCAN_predicted_peptide_5|109_aa MGFHRVTQAGLELLTSEAESGGPEDPVQAAEVLICYSPQLFLSLRFLAMVVQELVAVRRC HCARCLLRGRRLLAVGGSVPALGAVRSDSEDVAVALVGRSGEEKGGESR >gi568815584f:50487830_50732327|GENSCAN_predicted_CDS_5|330_bp atggggtttcaccgtgttacccaggctggtctcgaactcctgacctcagaagcagaaagt ggagggccagaagaccccgtgcaggctgcagaagttctcatttgctactcaccccaactg tttctgtccctgcggttcttggccatggtggtccaggagctggtggcggtgaggcgctgt cactgcgctcgctgccttctccggggccgcaggttgctggcggtgggaggctccgtgccc gcgctgggcgcagttcgctcagactctgaggatgttgctgtggcgctggtgggaaggagt ggggaggaaaagggaggagaaagcagatga >gi568815584f:50487830_50732327|GENSCAN_predicted_peptide_6|641_aa MTVKNHLTKKTPGSQSFTGGFSEKTYEWSSEEEEPVKKAGPVQVLIVKDDHSFELDETAL NRILLSEAVRDKEVVAVSVAGAFRKGKSFLMDFMLRYMYNQESVDWVGDYNEPLTGFSWR GGSERETTGIQIWSEIFLINKPDGKKVAVLLMDTQGTFDSQSTLRDSATVFALSTMISSI QLFTEYGRLAMEETFLKPFQSLIFLVRDWSFPYEFSYGADGGAKFLEKRLKVSGNQHEEL QNVRKHIHSCFTNISCFLLPHPGLKVATNPNFDGKLKEIDDEFIKNLKILIPWLLSPESL DIKEINGNKITCRGLVEYFKAYIKIYQGEELPHPKSMLQATAEANNLAAVATAKDTYNKK MEEEAKISTRMWKKLIPTLLDDFERGPRLQWRKEVHVWCKQQENLKVEPEDVTQFCILLT DKELLPMDKQRQWFLEMESTPSEDAVNSVEMTTKGLEYFLLICGGDKPFLAPNDLQTKHL QLKEESVKLFRGVKKMGGEEFSRRYLQQLESEIDELYIQYIKHNDSKNIFHAARTPATLF VVIFITYVIAGVTGFIGLDIIASLCNMIMGLTLITLCTWAYIRYSGEYRELGAVIDQVAA ALWDQALYKLYSAAATHRHLYHQAFPTPKSESTEQSEKKKM >gi568815584f:50487830_50732327|GENSCAN_predicted_CDS_6|1926_bp atgacagttaaaaatcatctcacaaagaaaacaccaggttcacaaagttttacaggtgga ttttcggaaaagacatatgaatggagctcagaagaggaggagccagtgaaaaaggcagga ccagtccaagtcctcattgtcaaagatgaccattcctttgagttagatgaaactgcatta aatcggatccttctctcggaggctgtcagagacaaggaggttgttgctgtatctgttgct ggagcatttagaaaaggaaaatcattcctgatggacttcatgttgagatacatgtacaac caggaatcagttgattgggttggagactacaatgaaccattgactggtttttcatggaga ggtggatctgagcgagagaccacaggaattcagatatggagtgaaatcttccttatcaat aaacctgatggtaaaaaggttgcagtgttattgatggatactcagggaacctttgatagt cagtcaactttgagagattcagccacagtatttgcccttagcacaatgatcagctcaata cagcttttcactgagtatggcagactggcaatggaggaaacattcctgaagccatttcag agtctgatatttcttgttcgagactggagtttcccatacgaattttcatatggagccgat ggtggtgccaaattcttggaaaaacgcctcaaggtctcagggaaccagcatgaagaacta cagaacgtcagaaaacacatccattcctgtttcaccaacatttcctgttttctgctacct catcctggcttaaaagtagctaccaatccaaactttgatggaaaattgaaagaaatagat gatgaattcatcaaaaacttgaaaatactgattccttggctacttagtcccgagagccta gatattaaagagatcaatgggaataaaatcacctgccggggtctggtggagtacttcaag gcttatataaagatctatcaaggtgaagaattaccacatcccaaatccatgttacaggcc acagcagaagctaacaatttagcagccgtggcaactgccaaggacacatacaacaaaaaa atggaagaggaggcaaaaatatcaacaagaatgtggaaaaagttgataccaacccttctg gatgactttgagaggggtccaagacttcagtggaggaaggaagtgcatgtgtggtgcaaa cagcaagagaacttaaaagtggagcctgaagatgtgactcaattttgtattctcttaacg gataaggagttgcttcctatggataagcaaagacagtggtttctggagatggaatctact cctagcgaagatgctgtgaacagtgttgaaatgacaacaaagggtttagaatatttttta ttgatttgtggtggtgacaaaccatttctggccccaaatgacttgcagaccaaacacctg caacttaaggaagaatctgtgaagctattccgaggggtgaagaagatgggtggggaagaa tttagccggcgttacctgcagcagttggagagtgaaatagatgaactttacatccaatat atcaagcacaatgatagcaaaaatatcttccatgcagctcgtaccccagccacactgttt gtagtcatctttatcacatatgtgattgctggtgtgactggattcattggtttggacatc atagctagcctatgcaatatgataatgggactgacccttatcaccctgtgcacttgggca tatatccggtactctggagaataccgagagctgggagctgtaatagaccaggtggctgca gctctgtgggaccaggctttgtacaagctttacagtgcagcagcaacccacagacatctg tatcatcaagctttccctacaccaaagtcggaatctactgaacaatcagaaaagaaaaaa atgtaa >gi568815584f:50487830_50732327|GENSCAN_predicted_peptide_7|432_aa MDLQYIWQQTLQWKHYRPGECGMTSKVLKEKNFYPGILYPMKTSFKHEEKKTFTDKQKPR DFINTRTALQEMLKGVLLPENLMPSFIRHGPTIPRRTDICLPDSSPNAFSTSGDVVSRNQ SFLRTPIQRTPHEIMRRESNRLSAPSYLARSLADVPREYGSSQSFVTEVSFAVENGDSGS RYYYSDNFFDGQRKRPLGDRAHEDYRYYEYNHDLFQRMPQNQGRHASGIGRVAATSLGNL TNHGSEDLPLPPGWSVDWTMRGRKYYIDHNTNTTHWSHPLEREGLPPGWERVESSEFGTY YVDHTNKKAQYRHPCAPSVPRYDQPPPVTYQPQQTERNQSLLVPANPYHTAEIPDWLQVY ARAPVKYDHILKWELFQLADLDTYQGMLKLLFMKELEQIVKMYEAYRQALLTELENRKQR QQWYAQQHGKNF >gi568815584f:50487830_50732327|GENSCAN_predicted_CDS_7|1299_bp atggatctccagtacatctggcagcagactcttcagtggaaacattacaggccaggagag tgtggcatgacatctaaagtgctgaaagagaaaaacttttaccctggaatactatatcca atgaaaacatctttcaaacatgaagaaaaaaagactttcacagataaacaaaagccgagg gatttcatcaacaccagaactgccctacaagaaatgctaaagggagtacttctaccagaa aatcttatgccttcattcatccggcatggtccaacaattccaagacgaactgatatctgt cttccagattcaagccctaatgccttttcaacttctggagatgtagtttcaagaaaccag agtttccttagaactccaattcaaagaacacctcatgaaataatgagaagagaaagcaac agattatctgcaccttcttatcttgccagaagtctagcagatgtccctagagagtatggt tcttctcagtcatttgtaacggaagttagttttgctgttgaaaatggagactctggttcc cgatattattattcagacaatttttttgatggtcagagaaagcggccacttggagatcgt gcacatgaagactacagatattatgaatacaaccatgatctcttccaaagaatgccacag aatcaggggaggcatgcttcaggtattgggagagttgctgctacatctttaggaaatttg actaaccatggttctgaagatttaccccttcctcctggctggtctgtggactggacaatg agagggagaaaatattatatagatcataacacaaatacaactcactggagccatcctctt gagcgagaaggacttcctcctggatgggaacgagttgagtcatccgaatttggaacctat tatgtagatcacacaaataagaaggcccaatacaggcatccctgtgctcctagtgtacct cggtatgatcaaccacctcctgtcacataccagccacagcaaactgaaagaaatcagtcc cttctggtacctgcaaatccatatcatactgcagaaattcctgactggcttcaggtttac gcacgagcccctgtgaaatatgaccacattctgaagtgggaactcttccagctggctgac ctggatacataccagggaatgctaaagttgctcttcatgaaagaattggagcagattgtt aaaatgtatgaagcatacagacaagcccttcttacagagttggaaaaccgaaagcagaga cagcagtggtatgcccaacaacatggaaaaaatttttga >gi568815584f:50487830_50732327|GENSCAN_predicted_peptide_8|422_aa MQHLRSTATPSPSPHAWDLQLLQQQACPMVPREQFLQLQRQLLQAERINQHLQEELENRT SETNTPQGNQEQLVTVMEERMIEVEQKLKLVKRLLQEKVNQLKEQLCKNTKADAMVKDLY VENAQLLKALEVTEQRQKTAEKKNYLLEEKIASLSNIVRNLTPAPLTSTPPLSHFRQQLV EDLLGAGHFPSSSQQPACFHETIIRKVRFDIFLEGQAHLFTKGKSSPRSGPSSEGPKDGK ERVVQLQNWIMIFQKQPASPLKAVTESRRPLSSLFPQSQGKKQPPLERGHSTVATQNPQR VTNRSSVSSVTSASAEGQPDHAHVMPHECYLCDYAAVDSRSLKKHLRIHSDEQPYKCQLC PYVSRNSSQLTVHLRSHTGDISFQCWLCSAKFKISSDLKRHMIVHWGRSLSSASSAATSA AP >gi568815584f:50487830_50732327|GENSCAN_predicted_CDS_8|1269_bp atgcagcacctgaggtccactgcgacgcctagcccgtcccctcatgcttgggatttgcag ctgctccagcagcaagcctgtccgatggtgcccagggagcagtttctgcagcttcaacgc cagctgctgcaggcagaaaggataaaccagcacctgcaggaggaacttgaaaacaggacc tccgaaaccaacacaccacagggaaaccaggaacaactggtaactgtcatggaggaacga atgatagaagttgaacagaaactgaaactagtgaaaaggcttcttcaagagaaagtgaat cagctcaaagaacaactctgcaagaacactaaggcagacgcaatggtgaaggacttgtat gttgaaaatgcccagttgttgaaagctctggaagtgactgaacagcgacagaaaacagca gagaagaaaaattacctcctggaggagaagattgccagcctcagtaatatagttaggaat ctgacaccagcgccattgacttctacacctcctttgagccacttccgccagcagttggtg gaagacttactaggtgcagggcactttccaagttcatcacaacaacctgcttgttttcat gagacaataatccgaaaagttcgctttgatatattcctggagggccaagcccatctattt acaaaagggaaatcctcaccaaggagcggaccttcgagtgaaggcccgaaggatgggaag gagcgagttgtgcagcttcagaactggataatgatattccaaaagcaacctgcctctcca ctgaaagcagtgactgaaagcagaaggcccctgtcatcactcttccctcagagtcaaggg aagaaacagccgcccttggagagaggacattcgactgttgctacccagaatccacaacgg gtgacgaaccgctcaagtgtgagttctgtgacaagtgcttcagcggaaggacaacctgac catgcacatgtgatgccgcacgagtgttacctgtgtgactatgctgccgtggacagcagg agcctcaagaagcacctacggatccactcggacgagcagccatacaaatgccagctctgc ccctatgtcagccgcaactccagccagctcaccgtccacctgcgatcccacactggggat atctccttccagtgctggctctgtagcgccaagttcaaaatcagctcagacttgaaaagg cacatgatcgtgcactgggggagaagcctttcgagtgcgagttctgcagcgacgtccgct gcaccatga