GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:10:36 Sequence gi568815594r:40325907_40538893 : 212987 bp : 43.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 251 246 6 1.05 1.04 Term - 3255 3128 128 1 2 46 36 142 0.811 3.24 1.03 Intr - 3919 3844 76 1 1 53 110 66 0.931 4.39 1.02 Intr - 5204 5079 126 2 0 75 89 112 0.988 10.88 1.01 Init - 5401 5351 51 1 0 58 44 65 0.936 -1.72 1.00 Prom - 6816 6777 40 -6.06 2.00 Prom + 8490 8529 40 -6.56 2.01 Init + 9562 9625 64 0 1 63 78 4 0.883 -1.65 2.02 Intr + 9921 10066 146 1 2 27 101 165 0.978 11.70 2.03 Intr + 11304 11458 155 2 2 39 108 146 0.944 10.57 2.04 Term + 14657 14744 88 2 1 55 46 92 0.664 -1.17 2.05 PlyA + 15196 15201 6 1.05 3.00 Prom + 18707 18746 40 -1.66 3.01 Init + 22803 22915 113 2 2 73 74 58 0.897 2.58 3.02 Intr + 22976 23508 533 2 2 89 95 632 0.997 56.57 3.03 Term + 28073 28614 542 0 2 65 38 391 0.235 26.22 3.04 PlyA + 29100 29105 6 1.05 4.00 Prom + 29196 29235 40 -4.86 4.01 Init + 33974 34014 41 1 2 61 105 11 0.264 -0.24 4.02 Intr + 37789 37920 132 1 0 116 65 61 0.353 6.36 4.03 Term + 49988 50018 31 2 1 117 42 43 0.050 -0.07 4.04 PlyA + 52297 52302 6 1.05 5.05 PlyA - 55852 55847 6 1.05 5.04 Term - 70903 70820 84 1 0 95 44 59 0.401 -0.15 5.03 Intr - 73566 73546 21 0 0 108 72 19 0.242 0.04 5.02 Intr - 78535 78504 32 2 2 103 100 32 0.238 3.75 5.01 Init - 81305 81179 127 2 1 81 98 22 0.184 2.83 5.00 Prom - 81711 81672 40 -4.56 6.00 Prom + 92316 92355 40 -2.96 6.01 Init + 93427 93501 75 0 0 68 80 107 0.414 8.99 6.02 Term + 93766 93861 96 0 0 79 34 17 0.267 -6.63 6.03 PlyA + 94109 94114 6 1.05 7.06 PlyA - 94817 94812 6 1.05 7.05 Term - 100237 99998 240 1 0 100 54 314 0.994 25.23 7.04 Intr - 106956 106745 212 1 2 100 80 237 0.962 22.73 7.03 Intr - 110741 110535 207 0 0 27 82 110 0.469 3.35 7.02 Intr - 113018 111865 1154 1 2 135 96 3198 0.892 313.97 7.01 Init - 129171 129023 149 0 2 86 97 50 0.367 5.26 7.00 Prom - 130196 130157 40 -3.96 8.00 Prom + 132700 132739 40 -3.16 8.01 Init + 137412 137647 236 2 2 93 35 87 0.052 1.51 8.02 Term + 165816 165975 160 0 1 66 45 174 0.392 8.31 8.03 PlyA + 167577 167582 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:40325907_40538893|GENSCAN_predicted_peptide_1|126_aa MRRRMAGPGLTPTAAQPGDRDSVVDEELEFPDIGDGGNCGYSQARGWQGEQQGNGSEVQP LLTAICTNHTCLLAANGAPEFPSPVDSLQWILTLFGIKAKILTGNYKAVRDVSPDFPSPT LSPSLM >gi568815594r:40325907_40538893|GENSCAN_predicted_CDS_1|381_bp atgcgcagacgcatggcagggccagggctgactcccactgcagctcagccaggtgaccgt gatagtgtcgtggatgaggaattagagtttccagatataggtgatggtgggaactgtggc tacagccaggccaggggctggcagggtgagcagcaggggaatgggtcagaagttcaacca ttgctgactgccatctgcaccaaccacacttgcctcctagcagccaacggtgcccctgaa ttcccttctccagtggattctctccagtggattctcaccttatttggaataaaagccaaa atccttactggcaactacaaggccgtacgtgatgtctcccccgactttccttctccaaca ctctccccctcgctcatgtag >gi568815594r:40325907_40538893|GENSCAN_predicted_peptide_2|150_aa MNWSHSCISFCWIYFAASRLRAAETADGKYAQKLFNDLFEDYSNALRPVEDTDKVLNVTL QITLSQIKDMDERNQILTAYLWIRQIWHDAYLTWDRDQYDGLDSIRIPSDLVWRPDIVLY NKYNIIASQKQVQNIIGNTMGFLSCGKKEK >gi568815594r:40325907_40538893|GENSCAN_predicted_CDS_2|453_bp atgaactggtcccattcctgcatctccttttgctggatctactttgctgcttccagactg agagctgcagagacggcagatggaaaatatgctcagaagttgtttaatgacctttttgaa gattattctaatgctcttcgtccagtggaagatacagataaagtcctgaatgtgaccctg cagattacgctctctcagattaaggatatggatgaaagaaaccaaattctgactgcttat ttgtggatccgccaaatctggcacgatgcctatctcacgtgggaccgagatcagtacgat ggcctagactccatcaggatccccagtgacctcgtgtggaggccagacatcgtcttatat aacaaatacaatatcattgcttctcagaagcaggtacagaacatcataggcaacacaatg ggcttcctgagttgtggcaagaaggagaaatag >gi568815594r:40325907_40538893|GENSCAN_predicted_peptide_3|395_aa MEEKDKLELIPDFRATVTGKMVVHCQNTEVMGTVSCLGADDESSEPVNTNVVLRYDGLIT WDAPAITKSSCVVDVTYFPFDNQQCNLTFGSWTYNGNQVDIFNALDSGDLSDFIEDVEWE VHGMPAVKNVISYGCCSEPYPDVTFTLLLKRRSSFYIVNLLIPCVLISFLAPLSFYLPAA SGEKVSLGVTILLAMTVFQLMVAEIMPASENVPLIGKYYIATMALITASTALTIMVMNIH FCGAEARPVPHWARVVILKYMSRVLFVYDVGESCLSPHHSRERDHLTKVYSKLPESNLKA ARNKDLSRKKDMNKRLKNDLGCQGKNPQEAESYCAQYKVLTRNIEYIAKCLKDHKATNSK GSEWKKVAKVIDRFFMWIFFIMVFVMTILIIARAD >gi568815594r:40325907_40538893|GENSCAN_predicted_CDS_3|1188_bp atggaggagaaggacaaattagagttgattccagacttccgagccacagtgacaggcaaa atggtggtccattgccaaaacactgaagtgatggggacagtgagctgtctgggggctgat gatgaatcttcagagcctgtgaacaccaatgtggtcctgcggtatgatgggctgatcacc tgggatgcaccggccatcaccaaaagctcctgtgtggtggatgtcacctacttccctttt gacaaccagcagtgcaacctgacttttggttcctggacctacaatggcaatcaggtggac atattcaacgccttggacagcggagatctctctgacttcattgaagatgtggaatgggag gtccatggcatgcccgctgtgaagaatgtgatctcctatggctgctgctctgagccttac ccggatgtcacattcaccctccttctgaagaggaggtcctcgttctatatcgtcaacctc ctcatcccatgcgtcctcatatcttttctggctcctctgagtttttatctcccagcagcc tccggagaaaaggtctccctgggagtgaccatcctgttggccatgactgtatttcagcta atggtggcagaaatcatgccggcctcagaaaatgtgcccctgataggtaaatactacata gccacgatggccctgatcacagcctccactgcgttgaccatcatggtgatgaatatccac ttctgtggggccgaggcccggccggtgccacactgggccagggtggtcatcctgaaatac atgtccagggtcttgtttgtctatgatgtgggtgaaagctgcctcagcccgcaccacagt agagagcgggaccacctcacgaaagtttatagcaaactcccagagtctaacctgaaagca gccaggaacaaagacctttccagaaagaaggacatgaacaaacgcttaaagaacgacctg ggctgccagggtaagaaccctcaggaggccgagagttactgtgcacagtacaaagtgctg acgaggaatattgagtacatcgccaagtgcctcaaagaccacaaggccaccaattccaag gggagtgaatggaagaaggtggcgaaagtcatagaccgattcttcatgtggatttttttc attatggtgtttgtgatgactattttgatcatagcaagagcggattag >gi568815594r:40325907_40538893|GENSCAN_predicted_peptide_4|67_aa MVHQRCPSANVKDCLCVSFSVSLGLRPTLYQCNRDKPGNWTVLDKLETQVLEHLKQLWHS LIPGHPI >gi568815594r:40325907_40538893|GENSCAN_predicted_CDS_4|204_bp atggtccaccaaagatgtccaagtgctaatgtgaaggactgcctctgtgtctccttttcc gtcagcttagggctcagaccaacgctttaccaatgcaaccgggacaaacctggcaactgg actgtcttggacaaactggaaacacaagttctagaacatctgaagcaattgtggcactcg ctcattcctggacatcctatctaa >gi568815594r:40325907_40538893|GENSCAN_predicted_peptide_5|87_aa MGQNKGLQAPCKSEIQQGSQVLKLQNGLLCFHVSHPGHTDAKATRAMDTTNQEIKQFFQP SKTKEQRYDSKVAVHIVKKEYNVTRII >gi568815594r:40325907_40538893|GENSCAN_predicted_CDS_5|264_bp atgggccagaacaagggactacaggccccatgcaagtctgaaatccagcagggcagtcaa gtcttaaagctccaaaatggcctcctttgcttccatgtctcacatccaggtcacactgat gcaaaagcgacaagggccatggatacaactaatcaggaaataaaacagttcttccagcca agtaaaaccaaggagcaaagatatgattctaaagttgcagtccatattgtcaaaaaggaa tataatgtaaccaggatcatctga >gi568815594r:40325907_40538893|GENSCAN_predicted_peptide_6|56_aa MQLLYDPRGKGEGPRGWNLDLHKKQLTQGHEPYLMLNSAKVGFLHTFSSFKAATLN >gi568815594r:40325907_40538893|GENSCAN_predicted_CDS_6|171_bp atgcagctgctgtatgacccaagaggaaagggtgaaggtcccagaggctggaacctggac ctgcacaagaaacagttaacacagggccatgaaccctatttaatgttgaacagtgcaaag gttggatttcttcacactttctcctcctttaaggcagccaccctgaactga >gi568815594r:40325907_40538893|GENSCAN_predicted_peptide_7|653_aa MGMVESNDFPTTLPCVCTGNSFHSISQCPFLTNRGLILLTLAIAKPAVSKFPPAVDAFDI MTAEDSTAAMSSDSAAGSSAKVPEGVAGAPNEAALLALMERTGYSMVQENGQRKYGGPPP GWEGPHPQRGCEVFVGKIPRDVYEDELVPVFEAVGRIYELRLMMDFDGKNRGYAFVMYCH KHEAKRAVRELNNYEIRPGRLLGVCCSVDNCRLFIGGIPKMKKREEILEEIAKVTEGVLD VIVYASAADKMKNRGFAFVEYESHRAAAMARRKLMPGRIQLWGHQIAVDWAEPEIDVDED VMETVKILYVRNLMIETTEDTIKKSFGQFNPGCVERVKKIRDYAFVHFTSREDAVHAMNN LNGTELEGSCLEVTLAKPVDKEQYSRYQKAARGGGAAEAAQQPSYVYSCDPYTLAYYGYP YNALIGPNRDYFVKAGSIRGRGRGAAGNRAPGPRGSYLGGYSAGRGIYSRYHEGKGKQQE KGYELVPNLEIPTVNPVAIKPGTVAIPAIGAQYSMFPAAPAPKMIEDGKIHTVEHMISPI AVQPDPASAAAAAAAAAAAAAAVIPTVSTPPPFQGRPITPVYTVAPNVQRIPTAGIYGAS YVPFAAPATATIATLQKNAAAAAAMYGGYAGYIPQAFPAAAIQVPIPDVYQTY >gi568815594r:40325907_40538893|GENSCAN_predicted_CDS_7|1962_bp atggggatggtagaaagtaatgatttcccaacaactttaccatgtgtctgtactggaaat tccttccactctatcagccaatgtccatttctcaccaatcgtggtcttatcctgttaaca ctggctattgctaaacctgctgtaagtaagtttccgccagctgtggatgcctttgacatt atgaccgcagaggattccaccgcagccatgagcagtgactcggccgccgggtcctccgcc aaggtgcccgagggcgtggcgggcgcgcccaacgaggcagcactgctggcgctgatggag cgcacgggctacagcatggtgcaagagaacgggcagcgcaagtacggcggcccaccgccc ggctgggagggcccgcacccgcagcgtggctgcgaggtcttcgtgggcaagatcccgcgc gacgtgtacgaggacgagctggtgcccgtgttcgaggccgtgggccgcatctacgagctg cgcctcatgatggactttgacggcaagaaccgcggctacgccttcgtcatgtactgccac aagcacgaggccaagcgcgcagtgcgtgagctcaacaactacgagatccgcccgggccgc ctgctcggcgtgtgctgcagcgtggacaactgccgcctcttcatcggcgggatccccaag atgaagaagcgcgaggaaatcctggaggagattgccaaggtcaccgagggcgtgctggac gtgatcgtctacgccagcgcggccgacaagatgaagaaccgcggcttcgccttcgtggag tacgagagccaccgcgcggctgccatggctcgccgcaagctcatgcctggccgcatccag ctgtggggccaccagatcgccgtggactgggccgaacctgagatcgacgtggacgaggac gtgatggagaccgtgaagatcctctacgtgcgcaacctcatgatcgagaccaccgaggac accatcaagaagagcttcggccagttcaaccccggctgcgtggagcgcgtcaagaagatc cgcgactacgccttcgtgcacttcaccagccgcgaggatgccgtgcatgccatgaacaac ctcaacggcactgagctggagggctcgtgcctggaggtcacgctggccaagcccgtggac aaggagcagtactcgcgctaccagaaggcagccaggggcggcggcgcggctgaggcagcg cagcagcccagctacgtgtactcctgcgacccctacacactggcctactacggctacccc tacaacgcgctcattgggcccaacagggactactttgtgaaagcaggcagcataagaggc cgagggcgaggtgcagctggcaacagagccccagggcctaggggttcctacctcggggga tattctgctggtcgtggtatatatagccgatatcatgaagggaaaggaaagcagcaagaa aaaggatatgaactggtgccgaatttggaaatccctaccgtcaacccagttgccattaaa cctggtacagtagccatccctgccattggggctcagtattccatgtttccagcagctcca gcccctaaaatgattgaagatggcaaaatccacacagtggagcacatgatcagccccatt gctgtgcagccagacccagccagtgctgctgccgccgcagccgcggccgcagccgccgca gccgctgtcattcccactgtgtcgacgccaccacctttccagggccgcccaataactcca gtatacacggtggctccaaacgttcagagaattcctactgccgggatctacggggccagt tacgtgccatttgctgctccagctacagccacgatcgccacactacagaagaacgcggca gccgcggccgccatgtatggaggatacgcaggctacatacctcaggccttccctgctgct gccattcaggtccccatccccgacgtctaccagacatactga >gi568815594r:40325907_40538893|GENSCAN_predicted_peptide_8|131_aa MIYRFTLTVMAIIKKTKPQVLVRMWRNWNPCALLVGMYNGAASVGKSMVVLQKYNHNKQK YHMSQQLLLMHSARNEHKCCRSQMTKRTSSSGKHRNKMHTMCHRCGSEAYHLQKSTCGKC GYIAKCKRKCN >gi568815594r:40325907_40538893|GENSCAN_predicted_CDS_8|396_bp atgatataccgcttcacactcactgtgatggcaattatcaaaaaaacaaaaccacaagtg ttggtgagaatgtggagaaactggaacccttgtgcactgctggtgggaatgtataatggt gcagcttctgtgggaaaaagtatggtggtgcttcaaaaatataatcataataaacagaag taccacatgagccagcagcttctcctgatgcacagtgcaagaaatgaacataagtgctgc agaagccaaatgacaaagagaacgtcatcgtctggaaagcatcgcaataagatgcacacg atgtgccaccgctgtggctctgaagcctaccaccttcagaagtcaacctgtggcaaatgt ggctacattgccaagtgcaagagaaagtgtaactga