GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:52:22 Sequence gi568815595f:42402736_42636278 : 233543 bp : 44.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 1389 1311 79 1 1 100 115 48 0.957 7.31 1.03 Intr - 4263 4111 153 1 0 87 131 208 0.998 25.04 1.02 Intr - 4529 4378 152 1 2 44 74 146 0.658 8.61 1.01 Init - 11561 11416 146 2 2 39 42 133 0.314 3.39 1.00 Prom - 13576 13537 40 -8.06 2.00 Prom + 15600 15639 40 -4.66 2.01 Init + 18877 19153 277 0 1 52 59 200 0.202 10.74 2.02 Term + 31644 31729 86 1 2 93 37 70 0.151 0.22 2.03 PlyA + 32072 32077 6 1.05 3.03 PlyA - 34257 34252 6 1.05 3.02 Term - 40723 40550 174 1 0 46 34 158 0.454 3.96 3.01 Init - 42697 42692 6 1 0 72 77 0 0.176 -1.83 3.00 Prom - 47831 47792 40 -0.76 4.03 PlyA - 48372 48367 6 1.05 4.02 Term - 56212 55984 229 0 1 36 39 189 0.387 4.90 4.01 Init - 91141 91032 110 1 2 57 92 68 0.359 3.92 4.00 Prom - 95021 94982 40 -4.56 5.00 Prom + 95126 95165 40 -6.76 5.01 Init + 100001 100078 78 1 0 73 101 139 0.608 12.76 5.02 Intr + 111014 111119 106 1 1 108 66 141 0.993 13.69 5.03 Intr + 116488 116595 108 2 0 101 116 46 0.665 8.96 5.04 Intr + 123152 123258 107 0 2 97 90 128 0.865 13.83 5.05 Intr + 124634 124761 128 1 2 21 94 183 0.520 11.68 5.06 Intr + 125256 125388 133 0 1 70 85 252 0.997 23.75 5.07 Intr + 128044 128197 154 0 1 137 39 178 0.854 17.45 5.08 Intr + 128736 128796 61 1 1 83 80 38 0.708 0.29 5.09 Intr + 129068 129134 67 2 1 76 91 48 0.791 2.81 5.10 Intr + 129507 129598 92 2 2 68 80 109 0.697 6.89 5.11 Intr + 132240 132369 130 0 1 93 80 162 0.992 16.60 5.12 Intr + 132608 132649 42 1 0 127 99 67 0.995 10.14 5.13 Term + 133355 133546 192 1 0 110 47 203 0.594 15.82 5.14 PlyA + 134816 134821 6 1.05 6.06 PlyA - 135963 135958 6 -0.45 6.05 Term - 139848 139782 67 2 1 88 47 64 0.040 -0.29 6.04 Intr - 148016 147893 124 0 1 16 64 110 0.068 1.04 6.03 Intr - 158561 158382 180 0 0 126 119 83 0.985 15.04 6.02 Intr - 160951 160788 164 0 2 70 10 103 0.650 0.32 6.01 Init - 166311 166130 182 0 2 72 98 78 0.741 4.05 6.00 Prom - 172644 172605 40 -2.16 7.00 Prom + 174653 174692 40 -6.86 7.01 Init + 188163 188231 69 2 0 59 88 55 0.319 3.65 7.02 Intr + 188790 188866 77 2 2 126 92 158 0.571 18.31 7.03 Term + 191687 191774 88 2 1 125 33 9 0.395 -3.77 7.04 PlyA + 192236 192241 6 1.05 8.00 Prom + 195115 195154 40 -4.36 8.01 Init + 198061 198145 85 0 1 59 41 119 0.950 3.28 8.02 Intr + 198249 198329 81 1 0 132 88 151 0.960 19.11 8.03 Intr + 216285 216392 108 1 0 120 75 31 0.913 5.26 8.04 Intr + 217281 217308 28 1 1 111 116 0 0.927 2.27 8.05 Intr + 228436 228581 146 1 2 49 95 114 0.861 8.13 8.06 Intr + 229941 230088 148 1 1 102 83 50 0.985 5.19 8.07 Intr + 230845 231000 156 1 0 28 103 138 0.969 8.43 8.08 Intr + 231878 231902 25 2 1 105 87 15 0.666 1.13 8.09 Term + 232486 232635 150 0 0 57 48 88 0.862 -0.39 8.10 PlyA + 233017 233022 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 88401 88293 109 2 1 149 42 53 0.960 4.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:42402736_42636278|GENSCAN_predicted_peptide_1|177_aa MENSIRGFNRKFNIPEERIPELPGRPEYMIQDAVQGYRHEKYETLRHGGSLEKMKASVVL SLLGYLVVPSGAYILGRCTVAKKLHDGGLDYFEGYSLENWVCLAYFESKFNPMAIYENTR EGYTGFGLFQMRGSDWCGDHGRNRCHMSCSALLNPNLEKTIKCAKTIVKGKEGMGAX >gi568815595f:42402736_42636278|GENSCAN_predicted_CDS_1|531_bp atggaaaactcaattcgtggatttaaccgcaaatttaatattcctgaagagagaattcct gagctgccaggtagacctgaatacatgatccaggatgcagtccagggatacagacatgag aaatatgagacattgagacatggagggagcctggagaagatgaaggcatccgtggttctc tccctccttggctacctggtggttccaagtggtgcttacatcttggggcgttgcacagtg gctaagaaactccacgatggaggcctggattattttgagggctatagccttgagaactgg gtgtgcctggcctacttcgagagcaagttcaaccccatggccatctacgagaacacacgt gagggctacactggctttggcctctttcagatgcgtggcagtgactggtgtggcgaccat ggcaggaaccgctgccatatgtcatgttccgctttactgaatcctaatttagagaagaca attaaatgtgccaagaccattgtaaaaggaaaagaagggatgggagcatgn >gi568815595f:42402736_42636278|GENSCAN_predicted_peptide_2|120_aa MAPPEVILLNTHSAAFMLQRQRYGWVLMLLLNSPCDLGQETSLSKPQYPYVQNGAESGAN LFRSYGKFLVHHKLSMNAKDKEEERPGVQCGADLAAFSGSNSSGKDSLCSQTSMLIQTFM >gi568815595f:42402736_42636278|GENSCAN_predicted_CDS_2|363_bp atggctcctccagaggtcatccttttaaacacgcactcagctgcttttatgctccagaga cagagatatggctgggtcctgatgctgcttcttaatagcccgtgtgacctaggacaagaa acatcactctccaaacctcagtatccctacgtgcaaaatggggctgaaagtggcgctaac ctcttcaggtcgtatggaaagttcctggtccaccacaagctctccatgaatgccaaggac aaggaagaagagcgcccaggggtgcagtgcggagcggatcttgctgccttctcaggatcc aacagtagtgggaaagattctctttgctctcagacctccatgctgattcagaccttcatg taa >gi568815595f:42402736_42636278|GENSCAN_predicted_peptide_3|59_aa MELQPMAEYVFSDEEGRQQNQKKYKERCFFITPPQSLVNFPDFVTHGPKILLKESSTRR >gi568815595f:42402736_42636278|GENSCAN_predicted_CDS_3|180_bp atggagctacagcccatggcagagtacgtgttttctgatgaagaaggaagacagcaaaat cagaaaaaatacaaagagagatgcttcttcatcacaccaccccaaagccttgtcaacttt ccagacttcgtgacccatggacctaagattctgctgaaagagtcttccacaagacgttaa >gi568815595f:42402736_42636278|GENSCAN_predicted_peptide_4|112_aa MGPESGLPLAEKVEHSPGLCLSDEDLRTLKKRPEKPSKSCEIYQAQNNKFLLEKSVSICA RFHRIYDRANQGNHKRDCGYGKKVEDEEFRDMAVGEIQELTDTTLEELTEDD >gi568815595f:42402736_42636278|GENSCAN_predicted_CDS_4|339_bp atgggcccagagtctgggctcccattggctgagaaggtagagcacagcccaggtctgtgc ctctctgacgaggacctcaggaccctgaagaaacgcccagagaaacccagtaaaagctgt gaaatctatcaagcccaaaacaataaattcctgctagagaaatctgtgtccatatgtgca agatttcacagaatttatgacagagccaatcaaggaaatcataaaagagattgtggatat ggcaagaaggtagaggatgaagagtttagagatatggctgttggagaaattcaagagctg acagacactacactagaggaattaacagaagacgactaa >gi568815595f:42402736_42636278|GENSCAN_predicted_peptide_5|465_aa MRPPSPLPARWLCVLAGALAWALGPAGGQAARLQEECDYVQMIEVQHKQCLEEAQLENET IGCSKMWDNLTCWPATPRGQVVVLACPLIFKLFSSIQGRNVSRSCTDEGWTHLEPGPYPI ACGLDDKAASLDESLSLSLQQQQTMFYGSVKTGYTIGYGLSLATLLVATAILSLFRKLHC TRNYIHMHLFISFILRAAAVFIKDLALFDSGESDQCSEGSVGCKAAMVFFQYCVMANFFW LLVEGLYLYTLLAVSFFSERKYFWGYILIGWGVPSTFTMVWTIARIHFEDYGCWDTINSS LWWIIKGPILTSILVNFILFICIIRILLQKLRPPDIRKSDSSPYSRLARSTLLLIPLFGV HYIMFAFFPDNFKPEVKMVFELVVGSFQGFVVAILYCFLNGEVQAELRRKWRRWHLQGVL GWNPKYRHPSGGSNGATCSTQVSMLTRVSPGARRSSSFQAEVSLV >gi568815595f:42402736_42636278|GENSCAN_predicted_CDS_5|1398_bp atgcgcccgccaagtccgctgcccgcccgctggctatgcgtgctggcaggcgccctcgcc tgggcccttgggccggcgggcggccaggcggccaggctgcaggaggagtgtgactatgtg cagatgatcgaggtgcagcacaagcagtgcctggaggaggcccagctggagaatgagaca ataggctgcagcaagatgtgggacaacctcacctgctggccagccacccctcggggccag gtagttgtcttggcctgtcccctcatcttcaagctcttctcctccattcaaggccgcaat gtaagccgcagctgcaccgacgaaggctggacgcacctggagcctggcccgtaccccatt gcctgtggtttggatgacaaggcagcgagtttggatgagagcctctccctgtccctccaa cagcagcagaccatgttctacggttctgtgaagaccggctacaccattggctacggcctg tccctcgccacccttctggtcgccacagctatcctgagcctgttcaggaagctccactgc acgcggaactacatccacatgcacctcttcatatccttcatcctgagggctgccgctgtc ttcatcaaagacttggccctcttcgacagcggggagtcggaccagtgctccgagggctcg gtgggctgtaaggcagccatggtctttttccaatattgtgtcatggctaacttcttctgg ctgctggtggagggcctctacctgtacaccctgcttgccgtctccttcttctctgagcgg aagtacttctgggggtacatactcatcggctggggggtacccagcacattcaccatggtg tggaccatcgccaggatccattttgaggattatgggtgctgggacaccatcaactcctca ctgtggtggatcataaagggccccatcctcacctccatcttggtaaacttcatcctgttt atttgcatcatccgaatcctgcttcagaaactgcggcccccagatatcaggaagagtgac agcagtccatactcaaggctagccaggtccacactcctgctgatccccctgtttggagta cactacatcatgttcgccttctttccggacaattttaagcctgaagtgaagatggtcttt gagctcgtcgtggggtctttccagggttttgtggtggctatcctctactgcttcctcaat ggtgaggtgcaggcggagctgaggcggaagtggcggcgctggcacctgcagggcgtcctg ggctggaaccccaaataccggcacccgtcgggaggcagcaacggcgccacgtgcagcacg caggtttccatgctgacccgcgtcagcccaggtgcccgccgctcctccagcttccaagcc gaagtctccctggtctga >gi568815595f:42402736_42636278|GENSCAN_predicted_peptide_6|238_aa MSVIFFACVVRVRDGLPLSASTDFYHTQDFLEWRRRLKSLALRLAQYPGRGSAEGCDFSI HFSSFGDVACMAICSCQCPAAMAFCFLETLWWEFTASYDTTCIGLASRPYAFLEFDSIIQ KVKWHFNYVSSSQMECSLEKIQEELKLQPPAVLTLEDTDVANGVMNGHTPMHLEPGSKET VLEDESFLVMLPDHRGTTRDVIHTRSNVWISVNIYMGFNGCATEILAVMFGKVAAAMV >gi568815595f:42402736_42636278|GENSCAN_predicted_CDS_6|717_bp atgtccgtgatcttttttgcctgcgtggtacgggtaagggatggactgcccctctcagcc tctactgatttttaccacacccaagattttttggaatggaggagacggctcaagagttta gccttgcgactggcccagtatccaggtcgaggttctgcagaaggttgtgactttagtata catttttcttctttcggggacgtggcctgcatggctatctgctcctgccagtgtccagca gccatggccttctgcttcctggagaccctgtggtgggaattcacagcttcctatgacact acctgcattggcctagcctccaggccatacgcttttcttgagtttgacagcatcattcag aaagtgaagtggcattttaactatgtaagttcctctcagatggagtgcagcttggaaaaa attcaggaggagctcaagttgcagcctccagcggttctcactctggaggacacagatgtg gcaaatggggtgatgaatggtcacacaccgatgcacttggagcctggcagtaaagaaacg gtccttgaagatgagtccttcctggtaatgcttcctgaccaccgaggcactaccagagat gttatccacaccaggtcgaatgtgtggatatcagttaacatctacatgggctttaatgga tgtgcaaccgagatattggcagtaatgttcggcaaggtggcagctgcaatggtgtga >gi568815595f:42402736_42636278|GENSCAN_predicted_peptide_7|77_aa MSVAFVPDWLRGKAEVNQETIQRLLEENDQLIRCIVEYQNKGRGNECVQYQHVLHRNLIY LATIADASPTSTSKAME >gi568815595f:42402736_42636278|GENSCAN_predicted_CDS_7|234_bp atgtcggtggccttcgtaccggactggctgaggggcaaggcggaagtcaatcaagagact atccagcggctccttgaggagaatgaccagctgatccgctgtattgtggagtatcagaac aagggccgcgggaacgagtgcgtgcagtaccagcatgtgttacatagaaatctcatttat ttggctaccattgcagatgccagtccaaccagcacttcaaaagcaatggaataa >gi568815595f:42402736_42636278|GENSCAN_predicted_peptide_8|308_aa MRARCVGAARPGAGGGLVLASCAVATGRPLAATSVAMGAQDRPQCHFDIEINREPGEKGL GKTTGKKLCYKGSTFHRVVKNFMIQGGDFSEENVVFCKMKRVHVVFGLVISGFEVIEQIE NLKTDAASRPYADVRVIDCGVLATKSIKDESSSESELEHERSRRRKHKRRPKVKRSKKRR KEASSSEEPRNKHAMNPKGHSERSDTNEKRSVDSSAKREKPVVRPEEIPPVPENRFLLRR DMPVVTAEPEPKIPDVAPIRYHTPPRSRSCSESDDDDSSETPPHWKEEMQRLRAYRPPSG EKWSKGDK >gi568815595f:42402736_42636278|GENSCAN_predicted_CDS_8|927_bp atgcgtgcgcgctgcgtgggagccgcgaggcccggggcgggagggggcctcgtcttggcc tcctgcgctgtcgcgacgggccggcctcttgccgccacctcggtcgcgatgggggcgcag gaccggccgcagtgccacttcgacatcgagatcaaccgggagccgggagagaaaggcctt gggaaaacaactgggaagaagttatgttataaaggttctacgttccatcgtgtggttaaa aactttatgattcagggtggggacttcagtgaagagaatgtggtcttttgcaaaatgaaa agggtgcatgtagtctttggactggttatttctggttttgaagtaatcgaacaaattgaa aatctgaagaccgatgctgcaagcagaccatatgcagatgtgcgagttattgactgtgga gtacttgccacaaaatcaataaaagatgaatcatcttcagaaagtgaacttgaacatgag agaagcagaaggaggaaacataagaggaggccaaaagttaaacgttctaaaaagaggcga aaggaagcaagcagttcagaagagccaaggaataaacatgcaatgaacccaaaaggtcac tctgagaggagtgataccaatgaaaaaaggtcagttgattccagtgctaaaagggaaaaa cctgtggtccgcccagaagagattcctccagtgcctgagaaccgatttttactgagaaga gatatgcctgttgttactgcagaacctgaaccgaagattcctgatgttgcacccattcgc tatcacacacctccaagatcaagatcctgttctgagtcagatgatgatgacagcagtgaa actcctcctcactggaaagaggaaatgcagagattaagagcatatagaccacctagtgga gaaaaatggagtaaaggagataagtaa