GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:36:00 Sequence gi568815586r:3712126_3930018 : 217893 bp : 41.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 246 307 62 2 2 95 67 70 0.617 6.37 1.02 Intr + 12048 12159 112 0 1 96 62 115 0.901 9.16 1.03 Term + 13144 13266 123 0 0 77 54 110 0.915 3.90 1.04 PlyA + 14014 14019 6 1.05 2.06 PlyA - 14601 14596 6 1.05 2.05 Term - 21599 21372 228 2 0 -45 43 253 0.005 3.15 2.04 Intr - 29699 29560 140 0 2 20 75 141 0.654 5.26 2.03 Intr - 30248 30102 147 0 0 34 115 65 0.074 3.09 2.02 Intr - 33308 33000 309 0 0 36 38 240 0.076 9.16 2.01 Init - 38878 38779 100 1 1 83 77 76 0.595 6.48 2.00 Prom - 39884 39845 40 -9.25 3.00 Prom + 39896 39935 40 -10.65 3.01 Init + 40448 40508 61 1 1 77 97 97 0.995 10.96 3.02 Term + 40750 41342 593 2 2 28 43 261 0.718 9.30 3.03 PlyA + 41449 41454 6 1.05 4.05 PlyA - 43234 43229 6 1.05 4.04 Term - 44181 44039 143 1 2 75 49 105 0.095 2.41 4.03 Intr - 64660 64634 27 2 0 142 49 16 0.043 0.27 4.02 Intr - 65271 65205 67 0 1 86 48 47 0.032 -1.94 4.01 Init - 72551 72486 66 2 0 76 69 88 0.159 6.82 4.00 Prom - 73531 73492 40 -7.25 5.02 PlyA - 73718 73713 6 1.05 5.01 Sngl - 74505 74065 441 0 0 66 34 202 0.541 8.60 5.00 Prom - 86058 86019 40 -4.05 6.09 PlyA - 87601 87596 6 1.05 6.08 Term - 100314 99998 317 1 2 63 34 224 0.993 8.62 6.07 Intr - 102063 101912 152 2 2 84 86 79 0.974 6.19 6.06 Intr - 109878 109748 131 0 2 66 94 67 0.908 3.67 6.05 Intr - 110032 109960 73 0 1 66 110 40 0.896 2.49 6.04 Intr - 114108 114033 76 1 1 93 80 14 0.827 -1.25 6.03 Intr - 116905 116785 121 1 1 64 91 61 0.898 3.15 6.02 Intr - 117893 117765 129 2 0 92 82 91 0.716 8.87 6.01 Init - 122098 122075 24 1 0 58 100 36 0.375 -0.37 6.00 Prom - 125573 125534 40 -8.45 7.00 Prom + 125590 125629 40 -6.65 7.01 Init + 127177 128341 1165 0 1 103 92 1059 0.886 101.63 7.02 Intr + 128870 129258 389 0 2 44 27 251 0.234 8.18 7.03 Term + 129361 130014 654 0 0 71 48 362 0.635 23.50 7.04 PlyA + 130463 130468 6 1.05 8.00 Prom + 138543 138582 40 -6.55 8.01 Init + 140046 140191 146 2 2 88 81 161 0.302 15.04 8.02 Intr + 142891 142988 98 1 2 28 40 92 0.036 -2.87 8.03 Intr + 143412 143579 168 1 0 -1 86 155 0.016 5.40 8.04 Term + 155171 155283 113 0 2 8 55 133 0.005 -0.26 8.05 PlyA + 155457 155462 6 1.05 9.05 PlyA - 156050 156045 6 1.05 9.04 Term - 161622 161425 198 0 0 60 37 127 0.391 1.22 9.03 Intr - 180615 180454 162 0 0 54 92 51 0.446 1.35 9.02 Intr - 181946 181756 191 0 2 -28 89 183 0.293 5.18 9.01 Init - 198032 197780 253 2 1 106 89 85 0.334 8.20 9.00 Prom - 201940 201901 40 -6.45 10.05 PlyA - 202431 202426 6 1.05 10.04 Term - 206289 206109 181 2 1 92 38 93 0.742 0.80 10.03 Intr - 208707 208604 104 0 2 78 87 158 0.871 12.75 10.02 Intr - 212539 212442 98 2 2 107 66 58 0.738 4.31 10.01 Intr - 216466 216308 159 2 0 18 81 95 0.341 0.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 21614 21372 243 2 0 45 43 258 0.917 11.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:3712126_3930018|GENSCAN_predicted_peptide_1|98_aa MVRTSAWLLGKPQGALTHGRRPYYTQGELSAPRVLALSEFTVQNNTWEMHAQDPEMRLFW RPEVLNQGDCGAVSPLKAPAENLLLASSGSGTSRCSLA >gi568815586r:3712126_3930018|GENSCAN_predicted_CDS_1|297_bp atggtgcgaacatctgcttggttactggggaagccccagggagctcttactcatggcaga aggccgtattatactcagggtgagctctctgctcccagagtattggccctttctgagttc actgttcaaaacaacacctgggaaatgcacgcacaagatccagaaatgagattgttctgg aggccagaagtcctaaatcaaggtgactgcggcgctgtgtcccctctaaaggctcctgca gagaatctgctccttgcttcttccggctctggtacctccaggtgttccttggcttga >gi568815586r:3712126_3930018|GENSCAN_predicted_peptide_2|307_aa MMAIWIPELEWLVMPLTEMEPQVWGVIGDASRCGRFCEQDSSSATIPRGGHGGIEGGGVL GSSFLWVVSSESRSEIAIGEIPSGITHCSAEEPLDLSVVPWPAVAFWFSGLLGVWIEQTP RIQIIHRMEILTHRAFGWRWYSEPCRRCQRAIEWKKMGPPSETHPLLIQLSTRKSQPKPN EQCPDDVSHVSYPPALSHSECKCRCDVTPNAVPLGTGPQKLEAGRQESQKAQDIYRMCNK KSGGRVTLVLSDPVVNDVIKGPDSVLLFCLRLPPHGHKVAAEVAVNTSDGTASSRGKEPG KHFSEHP >gi568815586r:3712126_3930018|GENSCAN_predicted_CDS_2|924_bp atgatggcgatttggattccagagttagagtggctggtgatgccactgacagagatggaa ccccaggtgtggggggtgattggagatgcgtcacgctgtggcaggttttgtgagcaagac agcagcagcgccaccatccccagaggaggacacggtgggattgagggaggaggagttctt ggaagttcatttctgtgggtagtgtcatctgaatcacggtcagaaattgccattggggag ataccatcagggataacacattgttcagcagaagaacctctagatctttcagtggtgcca tggcctgcggtagctttttggttttcgggtcttctgggagtttggattgaacagacaccc cggatacagatcatccacaggatggagatcttgactcacagggcctttggatggagatgg tattcggaaccctgtagaaggtgccagagagctatagagtggaagaagatgggaccaccc tcagagactcaccctctccttatacagttatccaccaggaaaagccaaccaaaaccgaat gagcaatgcccagatgatgtaagccatgtctcctatcctcctgccctgtcccattctgaa tgcaaatgccgatgtgatgtgactcctaatgctgttcctttaggaactggtccacagaag ctggaagcgggcagacaggaaagtcagaaagctcaggatatttatcgcatgtgtaataag aagtctggaggaagggtgacattggtgttgtctgatccagtggtcaacgatgtcatcaag ggcccggattcagtccttctgttctgcctcaggcttcctcctcatggtcacaaagtggct gcagaagttgcagtcaacacatcagatggcacagcatccagcagaggaaaagagccagga aaacacttctccgagcacccctag >gi568815586r:3712126_3930018|GENSCAN_predicted_peptide_3|217_aa MATVVDAACVARAPSSTDLQGHTASIQALCEFHVRQTRAHTSQPEEQAGSAAPRPPRPPT LNLPPRYLGRVGFPWTCPRRPLGSQLSSSRLLRPTLSAPSRAGWTSPSAELNGKRRPLDA LRDVRAPSGGVRACRRDLRPPGRLGGEGGAAADLAGLTGRASGVPLLPRPLAPIQAPKVG HQSGPRWLTGEVVSGSAPPKTRQQPGVTPAVTSGSRN >gi568815586r:3712126_3930018|GENSCAN_predicted_CDS_3|654_bp atggcaacagttgtggatgctgcctgcgtggccagggctcctagtagcactgacctgcaa gggcacacagcctccatccaggccctctgcgagttccatgtgcggcagacacgcgctcac acttcgcagccggaggagcaggcaggaagcgccgccccccggccgccccggccgcccacc ctcaaccttcctccacgttaccttgggcgtgtaggcttcccgtggacctgcccgcgccgc ccgctcggctctcagctgtcaagcagccgcctgctccgcccgactctttccgctccgagc cgcgcgggctggaccagcccctcagcggagctgaacgggaagcggcgccccctggacgcc ctccgggacgtgcgcgcgccaagcggtggggtccgggcctgcaggcgcgacctgaggccg ccagggaggctgggtggcgagggcggggcggctgccgacttagccggcctgactggtcgg gcttccggggtcccgctgctccctaggccccttgccccgattcaggctcccaaggtgggg catcagtccggccctcgctggctcactggagaggtggtgtctgggagtgcacccccgaaa acccggcagcagcccggcgtcactccggctgttacctctgggagcagaaactaa >gi568815586r:3712126_3930018|GENSCAN_predicted_peptide_4|100_aa MDEGIALYVSYVTEEEERYYGKMGLFSFHPDFTVSRNAREEDGTAIPPCAAMRVWITLCA LFWKLTGSGQAAESRVHWQSGDRGHEIRDMSPGLEDMRTA >gi568815586r:3712126_3930018|GENSCAN_predicted_CDS_4|303_bp atggatgaaggtatagcgttgtatgtttcttatgttacagaggaagaggaacgatactat gggaagatggggctcttctcatttcatcctgatttcacagtctccagaaatgccagggaa gaagatggcacagctatacccccttgtgctgctatgagagtgtggatcactttatgtgcc cttttctggaagttgactgggtctgggcaggcagcagaaagtagagtccactggcagtct ggtgatcggggacatgagatcagggacatgagtcctgggttggaggacatgagaacagct tag >gi568815586r:3712126_3930018|GENSCAN_predicted_peptide_5|146_aa MDPFLTPYTKIYSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMTKTPKAMATKARI DKSDLTKLKSFCTAKETIIRVNRQPTEWEKIFAIYPFGNDFLNIASKAQAIKAKIGKLDY IKLKSLCTARKQSEKAAYEMEEHICK >gi568815586r:3712126_3930018|GENSCAN_predicted_CDS_5|441_bp atggatcccttccttacaccttatacaaaaatttactcaagatggattaaagacttaaac gtaagacctaaaaccataaaaaccctagaagaaaacctaggcaataccattcaggatata ggcatgggcaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccagaata gacaaatcggatctgactaaactaaagagcttctgcacagcaaaagaaactatcatcaga gtgaataggcaacctacagaatgggagaaaatttttgcaatctatccatttggcaatgat ttcttgaatatagcatcaaaagcacaggcaataaaagcaaaaataggcaagttggactac atcaaactaaagagcctctgtacagcaaggaaacaaagtgaaaaggcagcctatgaaatg gaagaacatatttgcaaataa >gi568815586r:3712126_3930018|GENSCAN_predicted_peptide_6|340_aa MPLLLGLKEMFHKAEELFSKTTNNEVDDMDTSDTQWGWFYLAECGKWHMFQPDTNSQCSV SSEDIEKSFKTNPCGSISFTTSKFSYKIDFAEMKQMNLTTGKQRLIKRAPFSISAFSYIC ENEAIPMPPHWENVNTQVPYQLIPLHNQTHEYNEVANLFGKTMDRNRIKRIQRIQNLDLW EFFCRKKAQLKKKRGVPQINEQMLFHGTSSEFVEAICIHNFDWRINGIHGAVFGKGTYFA RDAAYSSRFCKDDIKHGNTFQIHGVSLQQRHLFRTYKSMFLARVLIGDYINGDSKYMRPP SKDGSYVNLYDSCVDDTWNPKIFVVFDANQIYPEYLIDFH >gi568815586r:3712126_3930018|GENSCAN_predicted_CDS_6|1023_bp atgccactgctcctggggctgaaggagatgtttcacaaagcagaagaattattttctaaa acaacaaacaatgaagtggatgacatggacacgtcagatacccagtggggctggttttac ttggcagaatgtgggaagtggcacatgtttcagccggataccaacagtcagtgttcagtt agcagtgaagatatcgaaaaaagcttcaaaacaaacccttgtggctccatttcttttact acttccaaattcagctacaagatagactttgcagaaatgaagcaaatgaatctcaccact ggaaagcagcgcttaataaaaagagcccccttttctatcagtgctttcagttacatctgt gaaaacgaggccatccctatgccaccacactgggagaatgtgaatactcaagtaccatat cagcttattcctctgcacaatcaaacacatgaatataatgaagttgctaatctctttggg aagacgatggatcgcaaccgaattaaaagaattcagagaattcaaaacctagatttgtgg gagttcttttgcaggaaaaaggctcagctcaagaaaaaaagaggtgtgcctcagattaat gaacaaatgctgtttcatggtaccagcagtgaatttgtggaagcaatctgcattcataac tttgattggagaataaatggtatacatggtgctgtctttggaaaaggaacctattttgct agagatgctgcttattccagtcgtttctgcaaagatgacataaagcatgggaacacattc caaattcatggtgtcagcttgcaacagcggcatctgtttagaacatataaatctatgttt cttgctcgagtgctaattggagattacataaacggagactccaaatacatgcgacctcct tccaaagacgggagctatgtgaatttatatgacagctgtgtggatgatacctggaaccca aagatctttgtggtttttgatgccaaccaaatctatcctgagtacttgatagactttcat tga >gi568815586r:3712126_3930018|GENSCAN_predicted_peptide_7|735_aa MEAAVGVPDGEDQGGAGPREDATPMDAYLRKLGLYRKLVAKDGSCLFRAVAEQVLHSQSR HVEVRMACIHCLRENREKFEAIIGGSFEGYLKRLENPQEWVGQMEISALSLMYRKDFITN LEPNVSPSQVTENNFPEKVLLCFSNGNHYDIVYPVKYKESSAMCQSLLYELLYEKVFKTD VSKIVMELDTLEVADEDNSEISDSEDDSCKSKTAVAAADVNGFKPLSGNEQLKNNGNSTS LPLARKVLKSLNPAVYRNVEYEIWLESKQAQQKRDYSIAAGLQYEVGDKCQVRLDHNGKF LNADVQGVHSENGPVLVEELGKYTSKNLKAPPPESWNTVSGKKMKKPSTSGQNFHSDMDY RGPKNPSKPIKAPLALPPRLQHPSGVRQRVPAPIPGRSVTQTLTPGPDSAVSQTHLTPSP VPVSIQAVNQPLMPLPQTLSLYQDPLYPGFPCNEKGDRAIVPPSSLCQTGEDLPKDKNIL RFFFNLGVKAYSCPVWAPHSYLYPLHQAYLAACRMYPKTEASVNGQMLQPVIGPPTFSSP LVIPPSQVSESHGQLSYQADLESETPGQLLHADYEESLSGKNMFPQPSFGPNPFLGPVPI APPFFPHVWHGYPFQGFIENPVMRQNIVLPSDEKGELDLSLENLDLSKDCGSVSTVDEFP AARSEHVHSLPEASVSSKPDEGRTEQSSQTRKADMALASIPPVAEGKAHPPTQILNRERE KLCLLNLNLKGPFKV >gi568815586r:3712126_3930018|GENSCAN_predicted_CDS_7|2208_bp atggaggcggccgtcggcgtccccgacggcgaggaccagggcggcgcggggccccgcgag gacgcgacgcccatggacgcctatctgcggaaactgggcttgtatcggaaactggtcgcc aaggacgggtcgtgcctgttccgggccgtggcggagcaggtattgcactctcagtctcgc catgttgaagtcagaatggcctgtattcactgtcttcgagagaacagagagaaatttgaa gcgattataggaggatcatttgaaggatatttaaagcgcttggaaaatccacaggaatgg gtaggacaaatggaaataagtgccctttctcttatgtacaggaaagattttataactaat ctggaacctaatgtttctccttcacaagtaactgaaaataattttcctgaaaaggtgtta ctgtgtttttcaaatggaaatcattatgatattgtgtatcccgtaaagtataaagaaagc tctgctatgtgtcagtctctcctttatgaattgctgtatgagaaggtatttaaaactgat gttagtaaaattgtgatggaactagacacgttggaagtagctgatgaagacaacagtgaa atatcagattcagaggatgacagctgcaagagtaaaactgctgttgctgctgctgatgtg aatggatttaaacctttgtcaggcaacgagcagctgaagaacaatgggaactctactagc ctgcctttggctagaaaggttcttaagtcactcaatcctgcagtctatagaaatgtggaa tatgaaatttggctggagtctaaacaagctcagcaaaaacgtgattattccattgctgct ggcttacaatatgaagttggagacaaatgtcaagttaggttggatcacaatggaaaattt ttgaatgcagacgttcaaggagttcattctgagaatggaccagttttggttgaagaactg ggaaagtacacatcaaagaacctcaaggcacctcccccagaaagctggaacacagtgtca gggaagaagatgaaaaaaccttccacttctggacaaaatttccattctgatatggattac agagggccaaagaatccaagcaagccaataaaagccccattagcactacctcctcgactg cagcatccttcaggagtaagacaacgtgtccctgctccaattcctggtcggtcagtgaca cagactttgacccctggacctgattctgctgtatcccaaactcatttaacaccctctcca gttcctgtgtcaatacaggcagttaaccagcccttgatgcctttgcctcagacattgagc ctttatcaagacccactctatcctgggtttccttgtaatgaaaagggagatcgagccatt gtaccaccttcttcactgtgtcagactggggaggacctacctaaagataagaatattctt cgattcttcttcaatcttggtgtgaaggcatacagttgtcctgtgtgggccccacattct tacctgtaccctctgcaccaggcctacctggcagcctgcaggatgtacccaaagactgag gctagtgttaatggtcaaatgctacagccagtgattggaccgccgacattttcttcacct ctggttatccctccatctcaggtgtctgaaagtcacggacaattgtcttaccaggctgat cttgaatctgagacccctgggcagcttctgcatgctgattatgaagagtcactgagtggc aagaatatgttcccccagccatcttttggacccaatccattcttaggcccagttcctatt gcacctcctttctttcctcatgtttggcatgggtacccttttcagggattcatagaaaat ccagtaatgaggcagaatattgtcctgccctctgatgagaaaggagaattggatctgtct ctggaaaatctggatctgtctaaagattgtggttcagtttccacagtagatgagtttcca gcagccaggagtgaacatgtacattctctccctgaagcaagtgtgagcagtaagccagac gaaggccggacagagcaatcttcccagacacgaaaggcagatatggcattggcttccatc cctcctgtagcagagggaaaggctcaccctcccactcagattctaaacagagagagagag aaattgtgcctgttgaacttgaacctaaaaggaccattcaaagtctga >gi568815586r:3712126_3930018|GENSCAN_predicted_peptide_8|174_aa MGKNQSRKAENSKNQSTSSPPKERSSLQATEQSWMENDFDELTEVGFRRTKHKNHMIISI DAEKAFNKIQQRFMLKTLNKLENKIPRNPTYKGCEGPLQGELQSTAQRNKRGHKRMEEHS MLMDRKNQYRENGHTAQEYALSDVAEGNGSVCKIILPTQVCWLSRKKGADKFLK >gi568815586r:3712126_3930018|GENSCAN_predicted_CDS_8|525_bp atggggaaaaaccagagcagaaaagctgaaaattctaaaaaccagagcacctcttctcct ccaaaggaacgcagctccttgcaagcaacggaacaaagctggatggagaatgactttgac gagttgacagaagttggcttcagaagaaccaaacacaaaaaccacatgattatctcaata gatgcagaaaaggccttcaacaaaattcaacaacgcttcatgctaaaaactctcaataaa ctagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaagga gaactacaaagcactgctcaacgaaataaaagaggacacaaacgaatggaagaacattcc atgctcatggataggaagaatcaatatcgtgaaaatggccatactgcccaagagtatgcg ctctcagatgttgcagaaggaaatggcagcgtttgcaagattattcttcccacgcaagtt tgctggctgtcaaggaaaaaaggtgctgacaagtttctgaaatga >gi568815586r:3712126_3930018|GENSCAN_predicted_peptide_9|267_aa MATLLLAVVTPHLHSHNMGTCGYYTHTATLWSSVSSPGALVLTRGHRLLSNSVTSTVLIN ICISGCMLLHVGTTAHVAVEHLIGGVQDDEDLEMTIGCHGEEMIGDLDKNSFGAGGLCIG ERVGGPGCCEVLIRMTPTEDVGEERSDMKGIQLSMQERTRCRQFPEGRRHQLGHLLQGGL GRGEAWKYHQIWEEGHWLLREQGDITTISILRMKILKDYEVSPKCHCQACVTLGRYTVIN KNLFHRKEIQPAVALRSLGDTVGPETL >gi568815586r:3712126_3930018|GENSCAN_predicted_CDS_9|804_bp atggccacacttctgttggcggtggtcactccccatctacattcccataacatgggcaca tgcgggtactacacccacactgctaccttgtggagttctgtttcttcccctggggcactg gttcttacaaggggacatagattgctcagcaactctgttactagcactgtcctcattaat atttgcatctctggatgtatgctgctccacgtaggaaccactgcacacgtggctgtcgag cacttgataggtggtgtccaggatgatgaggacttagaaatgaccataggatgtcatggt gaagagatgattggtgaccttgataagaacagttttggcgctggtggtttgtgcatcgga gagagagtaggagggcctgggtgctgtgaagtcctaattagaatgactccaacagaggat gtgggagaagagaggtctgacatgaagggaatacaactgagcatgcaagagaggaccaga tgtagacaatttcctgaaggaaggagacatcagctgggccacctgcttcagggaggcctg ggaagaggggaggcttggaaatatcaccagatttgggaagaaggtcactggctattgagg gagcagggagatattactaccatctccattttgaggatgaagatattgaaagattatgaa gtttctcccaaatgccattgccaggcctgtgtgactcttggtcgttacacggtgatcaat aaaaatttgtttcataggaaagaaatacagccagccgtggctttaagatcactgggtgat acagttggaccagaaaccctttaa >gi568815586r:3712126_3930018|GENSCAN_predicted_peptide_10|180_aa XRSLVYAILPRTQSRGIALWTGVVRRGFVEEVGLELPREEEQLGQKGKRHPQQGKSMTCE EWAGLVCQTWGLLAHWVGQSTQETTEAAAVKVFVAVAAMLMTQLGALPPPPAASSKENKS NLTPHGSWGTFLEAEEELALHSCKGCLGERVEELKLDEGDLGWEREKPRQIRSSVDLWQK >gi568815586r:3712126_3930018|GENSCAN_predicted_CDS_10|543_bp nttagaagcctggtctatgccatactgcccagaacccagagcagagggattgccctttgg actggggtggtcagaagaggctttgtggaagaggtgggacttgagctgcctagggaggag gaacaacttgggcagaaaggcaagaggcatcctcaacaaggaaaatccatgacatgtgag gaatgggctggcttggtctgccagacgtgggggctccttgcccactgggtcgggcagtcc actcaagagaccacagaggcagcggctgtaaaggtctttgttgctgttgctgccatgtta atgacccagcttggggcgctgccgccgccgccggccgcttcttccaaagaaaacaaatca aacttgacaccacatgggagttggggcacgtttcttgaggcagaggaagaactggcccta cactcttgcaagggctgcttgggagaaagagttgaagaattgaagctggatgagggagat ttgggctgggaaagagagaaacctagacagattagatcaagtgtggatttatggcagaaa taa