GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:35:33 Sequence gi568815575r:16745038_16969220 : 224183 bp : 43.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9908 10056 149 2 2 58 76 174 0.889 13.08 1.02 Intr + 11626 11684 59 2 2 48 74 16 0.481 -5.20 1.03 Intr + 12125 12272 148 1 1 53 93 140 0.580 10.81 1.04 Term + 15195 15322 128 1 2 86 37 169 0.999 10.04 1.05 PlyA + 15844 15849 6 1.05 2.00 Prom + 34994 35033 40 -3.76 2.01 Init + 40688 40703 16 1 1 76 108 6 0.473 1.79 2.02 Intr + 41911 41997 87 2 0 32 85 106 0.464 4.64 2.03 Intr + 42990 43039 50 1 2 60 67 51 0.378 -1.40 2.04 Term + 45710 45802 93 1 0 36 49 127 0.595 1.43 2.05 PlyA + 47585 47590 6 1.05 3.00 Prom + 49938 49977 40 -4.16 3.01 Init + 55947 56098 152 2 2 36 58 175 0.727 6.82 3.02 Intr + 72911 73055 145 2 1 84 50 86 0.849 4.68 3.03 Intr + 73537 73840 304 0 1 114 58 132 0.897 8.96 3.04 Intr + 75127 75218 92 2 2 74 70 76 0.729 4.11 3.05 Intr + 83057 83227 171 1 0 11 96 127 0.398 5.94 3.06 Intr + 84539 84733 195 1 0 -15 80 289 0.689 17.41 3.07 Intr + 87586 87705 120 0 0 50 78 161 0.996 11.99 3.08 Intr + 92556 92648 93 2 0 68 63 57 0.700 1.36 3.09 Term + 96391 96729 339 0 0 81 44 360 0.999 25.44 3.10 PlyA + 97198 97203 6 1.05 4.12 PlyA - 98611 98606 6 1.05 4.11 Term - 100066 99998 69 1 0 83 40 83 0.950 0.94 4.10 Intr - 100901 100791 111 2 0 61 101 27 0.742 1.88 4.09 Intr - 104264 104207 58 1 1 61 80 87 0.652 4.09 4.08 Intr - 107591 107374 218 2 2 80 81 48 0.255 0.60 4.07 Intr - 107838 107712 127 2 1 97 8 96 0.943 3.18 4.06 Intr - 108805 108645 161 1 2 113 67 185 0.981 17.69 4.05 Intr - 112672 112557 116 2 2 89 47 20 0.895 -1.83 4.04 Intr - 113812 113639 174 2 0 57 91 161 0.904 13.21 4.03 Intr - 118063 117918 146 0 2 98 66 118 0.992 10.53 4.02 Intr - 124217 124039 179 2 2 58 69 189 0.942 12.72 4.01 Init - 124561 124457 105 1 0 60 80 91 0.359 5.72 4.00 Prom - 128670 128631 40 -0.66 5.00 Prom + 130614 130653 40 -0.96 5.01 Sngl + 153029 153343 315 1 0 67 43 434 0.889 32.75 5.02 PlyA + 154697 154702 6 1.05 6.00 Prom + 165888 165927 40 -3.16 6.01 Init + 201825 202097 273 2 0 89 94 553 0.998 50.97 6.02 Term + 211703 211813 111 1 0 112 48 49 0.816 1.86 6.03 PlyA + 214892 214897 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:16745038_16969220|GENSCAN_predicted_peptide_1|161_aa XVKEEVFWRNYFYRVSLIKQSAQLTALAAQQQAAGKEEKSNGREQDLPLAEAVRPKTPPV VIKSQLKTQEDEEEISTSPGVSEFVSDAFDACNLNQEDLRKEMEQLVLDKKQEETAVLEE DSADWEKELQQELQEYEVVTESEKRDENWDKEIEKMLQEEN >gi568815575r:16745038_16969220|GENSCAN_predicted_CDS_1|486_bp nntgtgaaggaagaagtgttctggaggaactacttttaccgcgtctccctgattaagcag tcagcccagctcacggccctggctgcccaacagcaggccgcagggaaggaggagaagagc aatggcagagagcaagatttgccgctggcagaggcagtacggcccaaaacgccacccgtt gtaatcaaatctcagcttaaaactcaagaggatgaggaagaaatttctactagcccaggt gtttctgagtttgtcagtgatgccttcgatgcctgtaacctaaatcaggaagatctaagg aaagaaatggagcaactagtgcttgacaaaaagcaagaggagacagccgtactggaagag gattctgcagattgggaaaaagaactgcagcaggaacttcaagaatatgaagtggtgaca gaatctgaaaaacgagatgaaaactgggataaggaaatagagaaaatgcttcaagaggaa aattag >gi568815575r:16745038_16969220|GENSCAN_predicted_peptide_2|81_aa MNEVPITAVTVLPALSAAVRGLAGVVGSPSWRRAERSSACRTEKEAVDQCPHQSDAVEKA GQLSLPSFADDKLLNDVNSLA >gi568815575r:16745038_16969220|GENSCAN_predicted_CDS_2|246_bp atgaatgaagtgccaatcaccgccgtcaccgtgctcccggcgctgagcgccgccgtccgg gggctcgctggggtcgtgggctcgccctcctggcgtcgggcagaaagatccagtgcttgc aggactgaaaaagaagctgtggaccaatgtcctcaccagtcagatgctgtggagaaggct gggcagctttcactgccctcatttgctgatgacaagctgctcaacgatgttaactcactt gcctaa >gi568815575r:16745038_16969220|GENSCAN_predicted_peptide_3|536_aa MCDAPDPLALLLLLSLCGTPAPVLPSVMNKSSLGLARSDASAMLPVQPTEPVLEMSAFCQ SRGEMVEKELVLLRIKVLDSRVLCPHSVGGNFTWKSPLGFEIGTMEEAGICGLGVKADML CNSQSNDILQHQGSNCGGTSNKHSLEEDEGSDFITENRNLVSPAYCTQESREEIPGGEAR TDPPDGQQDSECNRNKEKTLGKEVLLLMQALNTLSTPEEKLAALCKKYADLLEESRSVQK QMKILQKKQAQIVKEKVHLQSEHSKAILARSKLESLCRELQRHNKTLKEENMQQAREEEE RRKEATAHFQITLNEIQAQLEQHDIHNAKLRQENIELGEKLKKLIEQYALREEHIDKVFK HKELQQQLVDAKLQQTTQLIKEADEKHQREREFLSLYMDKFEEFQTTMAKSNELFTTFRQ EMEKKTVRDKEYKALQIKLERLEKLCRALQTERNELNEKVEVLKEQVSIKAAIKAANRDL ATPVMQPCTALDSHKELNTSSKRALGAHLEAEPKSQRSAVQKPPSTGSAPAIESVD >gi568815575r:16745038_16969220|GENSCAN_predicted_CDS_3|1611_bp atgtgtgatgcccctgaccccttagctctcttgcttctgctctcgctgtgtggcacgcct gctcctgttttgccttccgtcatgaataagagctccctgggcctcgccagaagcgatgcc agcgccatgcttcctgtacagccaacagaaccggtcttggagatgtctgcattttgccag agtcggggagagatggttgagaaagaacttgtgcttctaagaataaaagtcttggacagt cgtgtcctatgcccacattcagttggtgggaacttcacgtggaagtcgccccttggtttt gaaattggcacaatggaagaagctggaatttgtgggctaggggtgaaagcagatatgttg tgtaactctcaatcaaatgatattcttcaacatcaaggctcaaattgtggtggcacaagt aacaagcattcattggaagaggatgaaggcagtgactttataacagagaacaggaatttg gtgagcccagcatactgcacgcaagaatcaagagaggaaatccctgggggagaagctcga acagatccccctgatggtcagcaagattcagagtgcaacaggaacaaagaaaaaacttta ggaaaagaagttttattactgatgcaagccctaaacaccctttcaaccccagaggagaag ctggcagctctctgtaagaaatatgctgatcttctggaggagagcaggagtgttcagaag caaatgaagatcctgcagaagaagcaagcccagattgtgaaagagaaagttcacttgcag agtgaacatagcaaggctatcttggcaagaagcaagctagaatctctttgcagagaactt cagcgtcacaataagacgttaaaggaggaaaatatgcagcaggcacgagaggaagaagaa cgacgtaaagaagcaactgcacatttccagattaccttaaatgaaattcaagcccagctg gagcagcatgacatccacaacgccaaactccgacaggaaaacattgagctgggggagaag ctaaagaagctcatcgaacagtacgcactgagggaagagcacattgataaggtgttcaaa cataaggaactgcaacagcagctcgtggatgccaaactgcagcaaacgacacaactgata aaagaagctgatgaaaaacatcagagagagagagagtttctttctctttatatggataag tttgaagaattccagactaccatggcaaaaagcaatgaactgtttacaaccttcagacag gaaatggaaaagaaaacagtccgtgataaagagtacaaggcccttcaaataaaactggaa cggttagagaagctgtgcagggctcttcagacagaaaggaatgagctcaatgagaaggtg gaagtcctgaaagagcaggtatccatcaaagcggccatcaaagcggcgaacagggattta gcaacacctgtgatgcagccctgtactgccctggattctcacaaggagctgaacacttcc tcgaaaagagccctgggagcgcacctggaggctgagcccaagagtcagagaagcgctgtg caaaagcccccgtccacaggctctgctccggccatcgagtcggttgactaa >gi568815575r:16745038_16969220|GENSCAN_predicted_peptide_4|487_aa MGIGETRPVGFCYTCICLPDWVAQPLYVQVVILEQPLNRVGFLQISVFEDTVEERVINEE YKIWKKNTPFLYDLVMTHALQWPSLTVQWLPEVTKPEGKDYALHWLVLGTHTSDEQNHLV VARVHIPNDDAQFDASHCDSDKGEFGGFGSVTGKIECEIKINHEGEVNRARYMPQNPHII ATKTPSSDVLVFDYTKHPAKPDPSGECNPDLRLRGHQKEGYGLSWNSNLSGHLLSASDDH TVCLWDINAGPKEGKIVDAKAIFTGHSAVVEDVAWHLLHESLFGSVADDQKLMIWDTRSN TTSKPSHLVDAHTAEVNCLSFNPYSEFILATGSADKTVALWDLRNLKLKLHTFESHKDEI FQVCDSFLVSVCQEMKSRDSSVKLYQTRGSTIEKNVLYVVLHGDAHRVSKIGEEQSAEDA EDGPPELLFIHGGHTAKISDFSWNPNEPWVICSVSEDNIMQIWQMAENIYNDEESDVTTS ELEGQGS >gi568815575r:16745038_16969220|GENSCAN_predicted_CDS_4|1464_bp atggggattggagagaccaggcctgtgggcttctgctacacgtgcatttgtcttcccgac tgggtcgcgcagcccctgtacgtacaggtcgtcatcttagaacagccacttaatcgtgtg ggctttctccaaatttcagtgtttgaagatactgtggaggagcgtgtcatcaatgaagaa tataaaatctggaagaagaatacaccgtttctatatgacctggttatgacccatgctctt cagtggcccagtcttaccgttcagtggcttcctgaagtgactaaacctgaaggaaaagat tatgcccttcattggctagtgctggggactcatacgtctgatgagcagaatcatctggtg gttgctcgagtacatattcccaatgatgatgcacagtttgatgcttcccattgtgacagt gacaagggtgaatttggtggctttggttctgtaacaggaaaaattgaatgtgaaattaaa atcaatcacgaaggagaagtaaaccgtgctcgttacatgccgcagaatcctcacatcatt gctacaaaaacaccatcttctgatgtgttggtttttgactatacaaaacaccctgctaaa ccagacccaagtggagaatgtaatcctgatctcagattaagaggtcaccagaaggaaggc tatggtctctcctggaattcaaatttgagtggacatctcctaagtgcatctgatgaccat actgtttgtctgtgggatataaacgcaggaccaaaagaaggcaaaattgtggatgctaaa gccatctttactggccactcagctgttgtagaggatgtggcctggcacctgctgcacgag tcattgtttggatctgttgctgatgatcagaaacttatgatatgggacaccaggtccaat accacctccaagccgagtcacttggtggatgcgcacactgccgaagtcaactgcctctca ttcaatccctacagcgaatttattctagccaccggctctgcggataagaccgtagcttta tgggatctgcgtaacttaaaattaaaactccataccttcgaatctcataaagatgaaatt ttccaggtatgtgacagttttctggtgtctgtatgtcaggaaatgaaatccagggattca tcagttaagctataccagactcgtgggtccactattgaaaaaaatgtcttgtacgtggtg ttacatggagatgctcatagggtaagtaaaattggggaagaacaatcagcagaagatgca gaagatgggcctccagaactcctgtttattcatggaggacacactgctaagatttcagat tttagctggaaccccaatgagccttgggtcatttgctcagtgtctgaggataacatcatg cagatatggcaaatggctgaaaatatttacaatgatgaagagtcagatgtcacgacatcc gaactggagggacaaggatcttaa >gi568815575r:16745038_16969220|GENSCAN_predicted_peptide_5|104_aa MATFTYSKEREKPQKREWLQVEHAVDGILEAKISLLQECPGEGSANATNEQGPQHQGKAL HIGLCGLEGEHEETPGDEEDHKDQCHCSSSSLEIREKIDDEEDS >gi568815575r:16745038_16969220|GENSCAN_predicted_CDS_5|315_bp atggcgactttcacatacagcaaagagagggaaaagccccagaaacgtgagtggctgcag gtggaacatgctgtcgatggaatcctggaagccaagatcagccttctgcaggaatgtccg ggtgagggttcagcgaatgccaccaatgagcaaggtccccagcaccaaggcaaagccctc cacattgggctgtgtggacttgaaggtgaacatgaagagacccccggcgatgaggaggat cacaaggaccagtgtcattgcagctcctccagcttggagatcagagagaagattgatgat gaagaggacagctaa >gi568815575r:16745038_16969220|GENSCAN_predicted_peptide_6|127_aa MEAAAAAAAAAAAAAAAGGGCGSGPPPLLLSEGEQQCYSELFARCAGAAGGGPGSGPPEA ARVAPGTATAAAGPVADLFRASQLPAETLHQLCFTAKEEGQGPLEPVPAACGACFIYYCY LYFRDLG >gi568815575r:16745038_16969220|GENSCAN_predicted_CDS_6|384_bp atggaggcggcagcggcggcggcggcggcggcagcggcagcggcagcggcgggcgggggc tgtggctccgggccgccgccgctgctgctgagcgagggcgagcagcagtgctactccgag ctcttcgcgcgctgtgccggcgccgcgggcgggggccccgggtctgggccccccgaggcc gccagagtcgcccccggcacggccactgcggccgccggccccgtggctgacctgtttcgg gcatcgcagctgcccgccgagacgctgcaccagctgtgcttcactgcaaaggaggaagga caaggccctctggagccagtgccagctgcttgtggggcctgttttatttactactgctat ctctacttcagagacttgggctag