GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:25:32 Sequence gi568815590f:38687583_38948020 : 260438 bp : 44.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 2401 2396 6 1.05 1.05 Term - 8541 8351 191 1 2 44 48 133 0.175 2.51 1.04 Intr - 13432 13372 61 1 1 56 91 60 0.150 1.41 1.03 Intr - 22896 22855 42 0 0 94 105 20 0.300 2.74 1.02 Intr - 23491 23371 121 0 1 50 95 59 0.146 3.30 1.01 Init - 32484 32399 86 0 2 88 48 74 0.045 2.18 1.00 Prom - 35958 35919 40 -2.06 2.05 PlyA - 36084 36079 6 -3.44 2.04 Term - 36483 36362 122 1 2 119 47 137 0.954 11.34 2.03 Intr - 47218 47100 119 0 2 87 50 47 0.043 0.91 2.02 Intr - 51278 51148 131 2 2 60 33 86 0.122 -0.21 2.01 Init - 55334 55332 3 2 0 113 81 0 0.248 1.80 2.00 Prom - 94923 94884 40 -2.46 3.00 Prom + 95542 95581 40 -5.06 3.01 Init + 97845 97920 76 2 1 48 83 53 0.751 2.05 3.02 Intr + 99975 100228 254 1 2 83 18 269 0.491 16.55 3.03 Intr + 100505 100668 164 1 2 14 61 162 0.952 4.87 3.04 Intr + 101122 101237 116 1 2 85 98 54 0.920 6.19 3.05 Intr + 131940 133053 1114 1 1 81 92 681 0.976 56.59 3.06 Intr + 137726 137786 61 2 1 115 100 40 0.988 6.64 3.07 Intr + 139586 139793 208 1 1 92 80 146 0.848 12.85 3.08 Intr + 143543 143595 53 0 2 53 89 60 0.701 1.23 3.09 Intr + 148580 148705 126 1 0 95 83 173 0.997 18.38 3.10 Intr + 150888 150964 77 2 2 63 50 123 0.968 4.31 3.11 Intr + 152642 152685 44 2 2 101 87 28 0.888 1.88 3.12 Intr + 154705 154865 161 2 2 43 75 90 0.897 2.91 3.13 Intr + 155707 155813 107 0 2 50 71 137 0.983 7.21 3.14 Intr + 159117 159237 121 0 1 40 87 139 0.997 9.50 3.15 Term + 160373 160441 69 1 0 86 53 86 0.988 2.84 3.16 PlyA + 162808 162813 6 1.05 4.11 PlyA - 163751 163746 6 1.05 4.10 Term - 178817 178712 106 1 1 66 45 102 0.064 1.58 4.09 Intr - 187006 186879 128 1 2 10 94 126 0.198 4.88 4.08 Intr - 215968 215745 224 2 2 105 86 72 0.643 6.55 4.07 Intr - 227016 226897 120 1 0 8 99 74 0.001 0.97 4.06 Intr - 233842 233678 165 2 0 41 109 43 0.039 1.63 4.05 Intr - 234351 234201 151 0 1 52 99 63 0.131 3.54 4.04 Intr - 235452 235366 87 0 0 94 78 126 0.951 12.37 4.03 Intr - 236693 236582 112 1 1 82 89 -11 0.192 -1.22 4.02 Intr - 243239 243141 99 1 0 24 97 145 0.936 8.23 4.01 Init - 244101 243989 113 0 2 74 20 107 0.802 2.18 4.00 Prom - 244734 244695 40 -5.76 5.00 Prom + 249972 250011 40 -1.46 5.01 Init + 251821 251851 31 0 1 61 103 7 0.035 -0.42 5.02 Intr + 258542 258639 98 0 2 55 103 87 0.099 6.63 5.03 Term + 259094 259165 72 1 0 78 44 58 0.167 -1.69 5.04 PlyA + 259875 259880 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 230348 230488 141 1 0 59 94 250 0.982 22.83 S.002 Term + 231405 231515 111 2 0 95 41 27 0.820 -2.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:38687583_38948020|GENSCAN_predicted_peptide_1|166_aa MAGCRSRALPRGKAAKARREIEPSSCWPRCPGFSVLSAAARFSITSPSRPSRNPTIASLS LFNNVDTYMLSAQVLVMLLLDALIWNEDLVSQSSMIPATAIGSEQAQPAMLNMNSKCQPS MHACKRTAPQTALALTHQGLFQCVMDPTVQGIQVAALELGYLKQHS >gi568815590f:38687583_38948020|GENSCAN_predicted_CDS_1|501_bp atggcgggctgcaggtcccgagccctgccccgcgggaaggcagctaaggcccggcgagaa attgagcccagcagctgctggcccaggtgcccaggcttctctgttttatcagctgctgca agattctccatcacctccccaagcagacccagcaggaatccaaccattgcaagcctatca cttttcaacaacgtggacacttacatgctttcagcacaggtgcttgtgatgttgttactg gatgctctgatatggaacgaagacctcgtcagtcaatcttccatgatcccggccaccgca attggctcagagcaggcccagcctgccatgctcaacatgaattctaagtgccaaccaagc atgcacgcctgcaagcggacagcaccacagactgctctggcattaactcatcaggggctg tttcagtgtgtcatggaccccactgttcagggcatccaagttgctgccctggagcttggc tatctgaaacagcactcctga >gi568815590f:38687583_38948020|GENSCAN_predicted_peptide_2|124_aa MLRGPRTNGTPVSTSTSSTKILVFNTMHQRKEPGILEEMADSRTWKLKGASQLTMASAAL RLASNATARPGRPSKASAASSHADGLYSRDVNKCPHINLYTNAYSSIIHNSCFRKQPKCP STDE >gi568815590f:38687583_38948020|GENSCAN_predicted_CDS_2|375_bp atgctgagagggcctagaaccaatggcaccccagtatcaacaagcacatctagcaccaag attttggttttcaacaccatgcaccaacgaaaggaaccagggatacttgaagaaatggct gattctaggacttggaaactcaagggggcatcgcagctaactatggctagtgcagccctc aggctggccagcaacgcgactgccaggcccggcaggccaagcaaagcctcggctgccagc agccatgcagatggtctatactcgagagatgtgaacaaatgtccacacataaacctgtac acgaatgcttatagcagcattattcataacagctgcttccgaaaacaacccaaatgtcca tcaactgatgaatga >gi568815590f:38687583_38948020|GENSCAN_predicted_peptide_3|916_aa MGTERLEHESLISQAGVTSGKKTFRGAGAGGAALMAFSPWQILSPVQWAKWTWSAVRGGA AGEDEAGGPEGDPEEEDSQAETKSLSFRQVHGVPAEMQTRLPPSICDSLCGPPVGKKSLP GPSRGPTVYPKRSRGEEGRVMTGPWQLLGNGGLCRAVIRVIAEGSSDSEGNFETPEAETP IRSPFKESCDPSLGLAGPGAKSQESQEADEQLVAEVVEKCSSKTCSKPSENEVPQQAIDS HSVKNFREEPEHDFSKISIVRPFSIETKDSTDISAVLGTKAAHGCVTAVSGKALPSSPPD ALQDEAMTEGSMGVTLEASAEADLKAGNSCPELVPSRRSKLRKPKPVPLRKKAIGGEFSD TNAAVEGTPLPKASYHFSPEELDENTSPLLGDARFQKSPPDLKETPGTLSSDTNDSGVEL GEESRSSPLKLEFDFTEDTGNIEARKALPRKLGRKLGSTLTPKIQKDGISKSAGLEQPTD PVARDGPLSQTSSKPDPSQWESPSFNPFGSHSVLQNSPPLSSEGSYHFDPDNFDESMDPF KPTTTLTSSDFCSPTGNHVNEILESPKKAKSRLITSGCKVKKHETQSLALDACSRDEGAV ISQISDISNRDGHATDEEKLASTSCGQKSAGAEVKGEPEEDLEYFECSNVPVSTINHAFS SSEAGIEKETCQKMEEDGSTVLGLLESSAEKAPVSVSCGGESPLDGICLSESDKTAVLTL IREEIITKEIEANEWKKKYEETRQEVLEMRKIVAEYEKTIAQMIEDEQRTSMTSQKSFQQ LTMEKEQALADLNSVERSLSDLFRRYENLKGVLEGFKKNEEALKKCAQDYLARVKQEEQR YQALKIHAEEKLDKANEEIAQVRTKAKAESAALHAGLRKEQMKVESLERALQQKNQEIEE LTKICDELIAKLGKTD >gi568815590f:38687583_38948020|GENSCAN_predicted_CDS_3|2751_bp atgggaaccgaaagattagagcacgagtcactgatcagccaggcaggggtgacctcaggc aagaaaaccttcagaggagctggagccggaggagccgcgctcatggcgttcagcccgtgg cagatcctgtcccccgtgcagtgggcgaaatggacgtggtctgcggtacgcggcggggcc gccggcgaggacgaggctggcgggcccgagggcgaccccgaggaggaggattcgcaagcc gagaccaaatccttgagtttcaggcaagtacacggcgtccccgctgagatgcagacgcgc ttgcctccatccatctgcgactccctctgtggcccccccgtggggaagaagtcgctcccc ggcccttcccggggccctaccgtgtacccaaaacgcagccgcggtgaagaaggtcgcgtg atgacaggcccctggcagctgctgggaaatggtggtctctgccgggctgtcatccgcgtg attgcggaaggcagctcggattctgaaggtaattttgagactcctgaagctgaaaccccg atccgatcacctttcaaggagtcctgtgatccatcactcggattggcaggacctggggcc aaaagccaagaatcacaagaagctgatgaacagcttgtagcagaagtggttgaaaaatgt tcatctaagacttgttctaaaccttcagaaaatgaagtgccacagcaggccattgactct cactcagtcaagaatttcagagaagaacctgaacatgattttagcaaaatttccatcgtg aggccattttcaatagaaacgaaggattccacggatatctcggcagtcctcggaacaaaa gcagctcatggctgtgtaactgcagtctcaggcaaggctctgccttccagcccgccagac gccctccaggacgaggcgatgacagaaggcagcatgggggtcaccctcgaggcctccgca gaagctgatctaaaagctggcaactcctgtccagagcttgtgcccagcagaagaagcaag ctgagaaagcccaagcctgtccccctgaggaagaaagcaattggaggagagttctcagac accaacgctgctgtggagggcacacctctccccaaggcatcctatcacttcagtcctgaa gagttggatgagaacacaagtcctttgctaggagatgccaggttccagaagtctccccct gaccttaaagaaactcccggcactctcagtagtgacaccaacgactcaggggttgagctg ggggaggagtcgaggagctcacctctcaagcttgagtttgatttcacagaagatacagga aacatagaggccaggaaagcccttccaaggaagcttggcaggaaactgggtagcacactg actcccaagatacaaaaagatggcatcagtaagtcagcaggtttagaacagcctacagac ccagtggcacgagacgggcctctctcccaaacatcttccaagccagatcctagtcagtgg gaaagccccagcttcaacccctttgggagccactctgttctgcagaactccccacccctc tcttctgagggctcctaccactttgacccagataactttgacgaatccatggatcccttt aaaccaactacgaccttaacaagcagtgacttttgttctcccactggtaatcacgttaat gaaatcttagaatcacccaagaaggcaaagtcgcgtttaataacgagtggctgtaaggtg aagaagcatgaaactcagtctctcgccctggatgcatgttctcgggatgaaggggcagtg atctcccagatttcagacatttctaatagggatggccatgctactgatgaggagaaactg gcatccacgtcatgtggtcagaaatcagctggtgccgaggtgaaaggtgagccagaggaa gacctggagtactttgaatgttccaatgttcctgtgtctaccataaatcatgcgttttca tcctcagaagcaggcatagagaaggagacgtgccagaagatggaagaagacgggtccact gtgcttgggctgctggagtcctctgcagagaaggcccctgtgtcggtgtcctgtggaggt gagagccccctggatgggatctgcctcagcgaatcagacaagacagccgtgctcacctta ataagagaagagataattactaaagagattgaagcaaatgaatggaagaagaaatacgaa gagacccggcaagaagttttggagatgaggaaaattgtagctgaatatgaaaagactatt gctcaaatgattgaagatgaacaaaggacaagtatgacctctcagaagagcttccagcaa ctgaccatggagaaggaacaggccctggctgaccttaactctgtggaaaggtccctttct gatctcttcaggagatatgagaacctgaaaggtgttctggaagggttcaagaagaatgaa gaagccttgaagaaatgtgctcaggattacttagccagagttaaacaagaggagcagcga taccaggccctgaaaatccacgcagaagagaaactggacaaagccaatgaagagattgct caggttcgaacaaaagcaaaggctgagagtgcagctctccatgctggactccgcaaagag cagatgaaggtggagtccctggaaagggccctgcagcagaagaaccaagaaattgaagaa ctgacaaaaatctgtgatgagctgattgcaaagctgggaaagactgactga >gi568815590f:38687583_38948020|GENSCAN_predicted_peptide_4|434_aa MVKFPTSPYKPTEFIKYWKTNAALPTTGKQRFLVTSPRDTAAGERKGYASQQPTESPWLL HQQPTERKEERRAHLCPTMTAVSPTQNALLLLFISAWGRVGITIKGHLLTLKLPGGKGLT EFIFHIIYKTLHGLSTENLTETSTLKSGVAVWLFTQQRTNPTTEEVKCPELFLYHSKTLF FQEYAKPGCEERSNSPGRQLATARQDSPSKSVRVKHSPSGEAWREAGKSRQKSIRMTCTN TAGSNRVEFLVAQAHEGPLSTATQSCEMAGSGHPPQRDAGSAEQKPAPTKTCMSSRFILT TQKYRAEYGSAHYHDMMGSAGPISLTQKREESTNDKEKERVAKTLHQRAGSFSINTKTNG AKFPCSPDTADSEALPSGCHCLRLRHKQHNLEEDSWLQLRSASASDSHRSTNLTVNCVCE ESRLHAAYENLTNA >gi568815590f:38687583_38948020|GENSCAN_predicted_CDS_4|1305_bp atggtgaagttcccaaccagtccatacaagcccactgagttcataaagtactggaagacc aatgcagctctgcccaccaccggcaaacaacgttttctcgtgacatctcccagagacaca gcagctggtgagagaaaaggctacgccagccagcagcccacagagtcaccctggctgctg caccagcagccaacagagaggaaggaggaaaggcgagctcacctctgccccacaatgact gctgtttccccaactcagaacgcccttctcctcctcttcatctctgcttggggcagagta ggtataacaatcaaaggacatcttctaaccctgaagcttcctggtggcaaaggcctcacc gaattcattttccacatcatctacaagaccttgcacgggctgagcacagagaacttaact gaaacatccacacttaaatctggagtggccgtttggctgttcacacaacaaagaactaac ccaaccactgaagaagtaaaatgtcctgaattatttctgtatcacagcaagactctcttt tttcaagaatatgccaagccagggtgtgaagaaagatccaactctccagggaggcagctg gcaacagccagacaggattccccaagcaagtcagttcgggtaaagcacagccctagtggg gaagcatggcgtgaagcaggtaaatcaagacaaaaatccattcgaatgacttgcactaac acagcaggctccaacagagtggagttcctggtggcccaagcccatgaaggccctctgtcc acagccacacaaagttgtgagatggctggctcaggccatcccccacaacgggacgctggg tcagcagaacaaaaacctgcaccaaccaaaacctgcatgtcttcaagatttattttaacc actcaaaaataccgagctgaatatggttcagctcactaccatgacatgatgggatctgca ggccctatttccttaacacagaagagagaagaaagcaccaatgacaaggaaaaagaaagg gttgcaaaaactcttcaccaaagggcaggcagtttctcaatcaataccaagaccaatggt gccaagttcccttgctctccggatacggctgattcagaagctttgccaagtggatgccat tgcctgcgcctgcgtcacaagcagcacaacttagaagaggacagctggctgcagctcaga tcagcgtcagcatcggattctcatagaagcacaaaccttaccgtgaattgcgtgtgcgag gaatctaggttgcacgctgcttatgagaatctaaccaatgcctga >gi568815590f:38687583_38948020|GENSCAN_predicted_peptide_5|66_aa MSWFAESPFHVINALSQRYFLQANDQKDMKDWVEALNQASKITKSLLFKSITELVHGLSY ECALFP >gi568815590f:38687583_38948020|GENSCAN_predicted_CDS_5|201_bp atgagttggtttgccgaatcgcccttccatgttatcaatgccctgtctcagagatatttc cttcaagccaatgatcagaaagatatgaaggactgggttgaagccctgaaccaagccagc aagatcaccaaaagccttctgtttaaaagcatcacagagctcgttcatggactgtcctac gagtgtgctctttttccttga