GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:42:15 Sequence gi568815592f:34657841_34873567 : 215727 bp : 43.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 25452 25389 64 2 1 103 75 43 0.186 3.22 1.01 Init - 38763 38606 158 0 2 110 93 351 0.924 37.18 1.00 Prom - 55933 55894 40 -5.86 2.00 Prom + 60578 60617 40 -2.86 2.01 Init + 60954 60956 3 2 0 108 81 0 0.485 1.30 2.02 Intr + 86495 86666 172 1 1 1 84 133 0.079 3.72 2.03 Intr + 99540 99711 172 1 1 72 89 96 0.357 7.10 2.04 Intr + 100072 100114 43 1 1 120 84 53 0.954 6.34 2.05 Intr + 104755 104863 109 0 1 94 55 39 0.610 1.06 2.06 Intr + 110128 110157 30 2 0 125 88 1 0.320 1.90 2.07 Intr + 112432 112555 124 2 1 51 119 91 0.419 8.14 2.08 Term + 115606 115804 199 1 1 130 43 80 0.986 4.57 2.09 PlyA + 115926 115931 6 1.05 3.00 Prom + 126890 126929 40 -5.06 3.01 Init + 134285 134328 44 1 2 110 78 107 0.517 9.85 3.02 Intr + 157179 157310 132 0 0 94 32 98 0.835 4.56 3.03 Intr + 163813 163975 163 1 1 93 80 184 0.876 18.08 3.04 Intr + 165417 165508 92 2 2 79 88 72 0.995 5.09 3.05 Intr + 176377 176566 190 1 1 77 46 272 0.999 21.49 3.06 Intr + 176881 177021 141 0 0 109 89 35 0.982 6.25 3.07 Intr + 177454 177624 171 0 0 91 66 192 0.982 17.44 3.08 Intr + 178316 178508 193 1 1 76 80 243 0.983 21.37 3.09 Intr + 182037 182228 192 1 0 77 49 100 0.760 4.46 3.10 Intr + 193290 193363 74 1 2 89 75 19 0.544 -0.17 3.11 Intr + 194345 194375 31 1 1 68 110 29 0.395 0.80 3.12 Intr + 197777 197863 87 0 0 115 66 88 0.943 9.24 3.13 Intr + 198399 198569 171 1 0 104 94 76 0.989 9.71 3.14 Intr + 198950 199089 140 0 2 73 94 72 0.977 6.48 3.15 Intr + 199570 199630 61 0 1 57 115 63 0.500 4.21 3.16 Intr + 199884 200077 194 1 2 101 100 159 0.998 17.61 3.17 Intr + 200572 201731 1160 0 2 99 108 697 0.637 60.49 3.18 Intr + 206162 206358 197 2 2 71 84 192 0.900 16.16 3.19 Intr + 209385 209539 155 1 2 103 65 56 0.966 4.59 3.20 Intr + 209626 209776 151 0 1 40 110 186 0.963 15.74 3.21 Intr + 213014 213276 263 0 2 69 70 173 0.965 10.81 3.22 Intr + 213744 213852 109 2 1 110 101 93 0.948 12.66 3.23 Intr + 213979 214085 107 2 2 69 94 76 0.997 6.23 3.24 Term + 214473 214598 126 2 0 110 47 125 0.999 8.88 3.25 PlyA + 215353 215358 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:34657841_34873567|GENSCAN_predicted_peptide_1|74_aa MEGMDVDLDPELMQKFSCLGTTDKDVLISEFQRLLGFQLNPAGCAFFLDMTNWVVDNKAL IPPLFEWELNLVVK >gi568815592f:34657841_34873567|GENSCAN_predicted_CDS_1|222_bp atggagggcatggacgtagacctggacccggagctgatgcagaagttcagctgcctgggc accaccgacaaggacgtgctcatctccgagttccagaggctgctcggcttccagctcaat cctgccggttgcgccttcttcctggacatgaccaactgggtagtagataacaaggccctt atacctccattgtttgagtgggaactgaatttagttgtcaag >gi568815592f:34657841_34873567|GENSCAN_predicted_peptide_2|283_aa MSRGYVKEQFAWRHFYWYLTNEGIQYLHDYLHLPPEIVPATLRRRHPETGRPRPKGLEDE RKKRKSPAPRSAAGGEGFGSLHASLVGFRGVVAGCARHFRASRNGVANGLQSNMPKFYCD YCDTYLTHDSPSVRKTHCSGRKHKENVKDYYQKWMEEQAQSLIDKTRAMIPPPPSLRSIL YFSGSSSPWYDASTPYGGPSHDANDGPSSSWDDASGTCSWNEAAHGRPYANDAWAPNDET SCPSHDGAHSARNDSTRQIRIEGRPYCIGFILPVLLHQEIMLL >gi568815592f:34657841_34873567|GENSCAN_predicted_CDS_2|852_bp atgtcccgaggctacgtgaaggaacagtttgcctggagacatttctactggtaccttacc aatgagggtatccagtatctccatgattaccttcatctgcccccggagattgtgcctgcc accctacgccgtaggcatccagagactggcaggcctcggcctaaaggtctggaggatgaa aggaagaaaaggaaaagccccgcccctcgctcggctgctggaggcgagggcttcggaagt cttcatgctagtctcgtggggttccgcggtgtcgtcgctggctgtgcgcgtcatttccgg gcgtcacgtaacggagtggccaacggcctgcagagcaacatgcccaagttttattgtgac tactgcgatacatacctcacccatgactctccatctgtgagaaagacacactgcagtgga aggaaacacaaagagaatgtgaaagactattatcagaaatggatggaagagcaggctcag agcctgattgacaaaacaagggcgatgataccacctccccccagccttcgctccattctt tatttcagcgggtcctcctcgccctggtatgatgccagcaccccatatggggggccctcc catgatgccaatgatgggccctcctcctcctgggatgatgccagtgggacctgctcctgg aatgaggccgcccatgggaggccatatgccaatgatgcctgggcccccaatgatgagacc tcctgcccgtcccatgatggtgcccactcggcccggaatgactcgaccagacagataagg atagaggggaggccttattgtatcggttttatattacctgttctgcttcaccaggagatc atgctgctgtga >gi568815592f:34657841_34873567|GENSCAN_predicted_peptide_3|1447_aa MAAAAAVSGAHAAARANCFISEPHALPLSNGDNNAYLMWDIAKINSDVMKTLRSVPGAKF TKNLSPDKINLSTLKGEGQLTNLELDEEVLQNVLELPTWLAITRVYCNRASIRCLDKVEV EMKTCEDPRPPNGQSPIALASGQSEYGFAEKVVEGMFIIVNSITIKIHSKAFHASFELWQ LQGYSVNPNWQQSDLRLTRITDPCRGEVLTFKEITWQTLRIEADATDNGDQDPVTTPLRL ITNQGRIQIALKRRTKDCNVISSKLMFLLDDLLWVLTDSQLKAMMKYAESLSEAMEKSAH QRKSLAPEPVQITPPAPSAQQSWAQAFGGSQGNSNSSSSRLSQYFEKFDVKESSYHLLIS RLDLHICDDSQSREPGVNSFTLSGRQRLYKSCAMPCCAVPSKLSPKSIASAGQLHLLQTI VVQGQGMTFLCYGYMAVITGLVTGALFSSFGEVMFSWMVLVLVEQEESLLVVTTGVSANR LMGGAMQLTFRKMAFDYYPFHWAGDSCKHWVRHCEAMETRGQWAQKLVMEFQSKMEKWHE ETGLKPPWHLGVDSLFRRKADSLSSPRKNPLERSPSQGRQPAFQPPAWNRLRSSCMVVRV DDLDIHQVSAIHIEFTEYYFPDNQELPVPCPNLYIQLNGLTFTMDPVSLLWGNLFCLDLY RSLEQFKAIYKLEDSSQKDEHLDIRLDAFWLKRPKASWDLWSVHFTQISLDFEGTENFKG HTLNFVAPFPLSIWACLPLRWQQAQARKLLLASEGRLKPSASFGSPVQSEALAPDSMSHP RSKTEHDLKSLSGLTEVMEILKEGSSGMDNKGPLTELEDVADVHMLVHSPAHVRVRLDHY QYLALLRLKEVLQRLQEQLTKDTESMTGSPLQNQTACIGVLFPSAEVALLMHPAPGAVDA DSAGSDSTSLVDSELSPSEDRELKSDASSDQGPASPEKVLEESSIENQDVSQERPHSNGE LQDSGPLAQQLAGKGHEAVESLQAKKLSRTQASSSPAALKPPAGRETAVNGQGELIPLKN IEGELSSAIHMTKDATKEALHATMDLTKEAVSLTKDAFSLGRDRMTSTMHKMLSLPPAKE PMAKTDEGVAAPVSGGAARLRFFSMKRTVSQQSFDGVSLDSSGPEDRISVDSDGSDSFVM LLESESGPESVPPGSLSNVSDNAGVQGSPLVNNYGQGSPAANSSVSPSGEDLIFHPVSVL VLKVNEVSFGIEVRGEDLTVALQAEELTLQQLGTVGLWQFLHGQCPGTCFQESSTLKTGH IRPAVGLRFEVGPGAAVHSPLASQNGFLHLLLHGCDLELLTSVLSGLGPFLEDEEIPVVV PMQIELLNSSITLKDDIPPIYPTSPGPIPITLAMEHVVLKRSDDGVFHIGAAAQDKPSAE VLKSEKRQPPKEQVFLVPTGEVFEQQVKELPILQKELIETKQALANANQDKEKLLQEIRK YNPFFEL >gi568815592f:34657841_34873567|GENSCAN_predicted_CDS_3|4344_bp atggcggcggcggcggctgtgtccggtgctcacgccgcggcgagggcaaactgcttcata tccgaacctcatgctcttcctctgagcaatggggataataatgcatacctcatgtgggat attgcaaagattaattcagatgttatgaagacccttcgttcagtgcctggtgcaaagttc actaagaatctttccccagacaaaatcaacctgagcaccctgaaaggggagggtcagctg accaacctggagctggatgaagaggttctacagaatgtactggagctgcccacctggtta gccatcactcgggtctactgcaacagggcctccatccggtgtctggataaggtagaggtg gagatgaagacatgtgaggatcctcggccccccaatggacagtctcccattgcccttgct tcaggacagagtgaatatggctttgccgaaaaggtggtggaagggatgttcatcattgtc aattctatcaccatcaagattcactccaaggccttccacgcttcttttgaattgtggcag ctccagggctatagtgtcaaccccaactggcagcagagtgaccttcgccttacccgcatc actgacccctgccgaggagaggttttaacatttaaggaaataacttggcaaacactccga attgaggcagatgctacagacaatggtgatcaggacccagtcaccactccattgaggctt attacgaaccaaggcaggatccaaatagccctcaaaagaagaaccaaagattgcaatgtg atatcctccaagctgatgttcctgttggatgacctgctctgggtgctgactgactcacag ctcaaggctatgatgaagtatgcagagtcactgagtgaagccatggagaagtcagcccat caaagaaagagcctggcccctgaacctgtgcagatcactccaccagcccccagtgcccag cagtcctgggcccaggcatttggtggcagccagggcaacagcaacagcagcagcagccgc ctcagccagtactttgagaaatttgatgtgaaagagtcctcctaccatctgctcatctcc cgcctggacctgcacatttgtgatgatagccagtcccgagagccaggggtcaatagcttc actctgtctgggcgccagcgcctgtacaagagctgtgccatgccgtgctgtgccgtgccg tccaagctgagccccaagagcatagcatcagcagggcagttacaccttttacagacaata gtggtacagggccaagggatgaccttcctatgttatggctacatggctgtgataacaggt ttggtcactggggccttatttagttcatttggtgaggtcatgttttcctggatggtcttg gtgctggtggagcaggaggagtctctccttgtggtcaccacaggtgtctctgccaacaga ctcatgggtggtgccatgcagcttaccttccgcaagatggcgtttgactattaccctttc cattgggcaggtgatagctgcaaacattgggtacgccactgtgaggccatggagacccga ggccagtgggcccagaagctggtgatggaatttcagagcaaaatggagaagtggcatgaa gagacgggtctgaaaccaccctggcaccttggagtagactctctctttcggagaaaagca gattctctttccagtcctcgaaagaaccctcttgagagaagcccctctcagggcagacag cctgcctttcagcctccagcatggaaccgcttacgctctagctgcatggtggtacgggtg gatgacctggacatccaccaggtctctgccattcatattgagttcacagagtattacttc ccagataatcaggagcttccagttccttgtcctaatctctacattcagttaaatggtctg acatttactatggatcctgtcagtttgctctggggaaacctcttttgcctggatttatac cgcagcttggagcagttcaaagctatctacaagctggaagattcaagtcagaaagatgaa cacttggacatccgactagatgcattctggttgaagagacctaaggcttcctgggatctc tggtctgtccactttacccagatctccttggactttgagggaacagaaaacttcaaaggc cataccttgaattttgtagcccccttccccctgtccatttgggcctgcctacccctccgc tggcagcaagcccaggcacggaagcttcttttggcctcagaggggaggctgaaaccatca gccagttttggaagtcctgtccagtctgaggctcttgcccctgactctatgtcccatccg cggtcaaagactgaacatgacttgaaaagcttatcaggacttacagaagtcatggaaatt ctgaaagaaggcagtagtggtatggacaacaaagggcctctgacagagctggaggatgta gcagatgttcatatgcttgtacattccccggcccatgtccgcgtgaggcttgaccactac cagtacttggctctgcttcgcctgaaggaggtgctgcagaggcttcaggagcagctgact aaggatacagagtcaatgactgggtctcccctgcagaatcagacagcttgcattggagtt ctctttcccagtgctgaagtggctctgcttatgcatcctgcacccggtgctgtcgatgct gactctgcaggctcagatagcactagcctcgtagattcagagctatctccttcagaggat cgggaactgaagtctgatgcctcatcagaccagggcccagcaagccctgagaaggtcttg gaggaaagtagcattgaaaatcaggatgtatcccaggagaggccacatagcaatggagaa ctgcaggactcaggtccacttgcccagcagctggcagggaagggccatgaggcagtagag tccctacaggccaagaaactgagcagaacccaagcctccagctcaccagctgcattgaag cccccagctggcagggagactgctgtgaatggacagggtgagctcatccccttgaagaac attgagggagaattgtcaagtgctattcacatgaccaaggatgccaccaaggaggctcta catgccaccatggacctcaccaaggaagctgtgtccctgactaaggatgccttcagtttg ggcagagatcgaatgacctccaccatgcacaagatgttgtccctgcccccagccaaggag cccatggccaagacagatgagggggtggcagccccagtgagtggaggtgctgcacgactc cgatttttctccatgaagaggacggtatctcaacagtcatttgatggtgtctcattggat agcagtggccctgaagaccggatttcagtggacagtgatggcagtgatagctttgtgatg ctcttggagtctgagtctggtccagaatctgttccaccaggatctctttcaaatgtctca gataatgctggtgttcaagggagccctcttgtgaataattatggccaggggtcaccagca gccaacagttcagtttcacccagtggagaagacctcatctttcacccggtctcagttctg gtcctgaaggtgaatgaggtgtcttttgggattgaggtacgtggtgaggacctgactgtg gccctgcaagcagaggaactgaccctccagcagctgggcaccgtgggactctggcagttc ctgcatggacagtgcccaggtacatgctttcaggaatcctcaactttgaagactggccac atcaggccagctgtgggccttcgctttgaggtggggcctggagcagctgttcattccccc ctggcctcacaaaatggcttcctacatttattgcttcatggctgtgacctcgagctgctc acttcagtgctcagtggcctggggcccttcttggaggatgaggagatcccggtggtagtc cccatgcagattgagcttctgaactccagcatcaccctaaaggatgatatcccccccatc tatccaacatctccaggccccatccccatcactctggccatggaacatgttgtgctgaag aggagtgatgatggtgtgttccacataggcgctgctgctcaggacaaaccatcagctgaa gtacttaaaagtgagaagagacagcccccaaaagaacaggtgtttttggtgcccacagga gaggtttttgaacagcaggtgaaagaactgcctatcctacaaaaagaacttatagaaact aaacaagccttggccaatgccaaccaggataaagaaaaacttcttcaggagattaggaaa tataaccccttctttgagctctga