GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:18:04 Sequence gi568815597f:26429921_26674198 : 244278 bp : 47.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2642 2803 162 1 0 95 27 118 0.689 6.13 1.02 Intr + 8248 8364 117 0 0 117 90 184 0.995 22.16 1.03 Intr + 12811 12953 143 0 2 112 69 175 0.997 17.15 1.04 Intr + 16396 16512 117 1 0 84 108 103 0.925 11.48 1.05 Intr + 17639 17740 102 2 0 148 68 53 0.742 8.59 1.06 Intr + 27871 27985 115 1 1 78 96 84 0.210 8.65 1.07 Intr + 30117 30224 108 2 0 95 110 63 0.927 9.68 1.08 Term + 38972 39211 240 1 0 69 42 261 0.640 15.63 1.09 PlyA + 39686 39691 6 -0.45 2.00 Prom + 40715 40754 40 -4.66 2.01 Init + 42693 42707 15 2 0 114 102 14 0.770 5.45 2.02 Intr + 43563 43607 45 2 0 92 100 51 0.960 5.31 2.03 Intr + 44165 44215 51 1 0 40 115 55 0.778 2.50 2.04 Intr + 44652 44747 96 2 0 77 75 124 0.955 10.21 2.05 Term + 45193 45228 36 0 0 73 45 58 0.723 -2.66 2.06 PlyA + 46024 46029 6 1.05 3.00 Prom + 67247 67286 40 -2.16 3.01 Sngl + 70281 71015 735 2 0 101 50 141 0.701 7.88 3.02 PlyA + 73852 73857 6 1.05 4.03 PlyA - 73989 73984 6 1.05 4.02 Term - 89516 89434 83 0 2 97 43 68 0.889 1.06 4.01 Init - 90369 90150 220 0 1 39 34 186 0.668 7.09 4.00 Prom - 96775 96736 40 -7.46 5.00 Prom + 97343 97382 40 -4.36 5.01 Init + 100001 100063 63 1 0 96 105 76 0.609 11.30 5.02 Intr + 107005 107049 45 0 0 125 100 24 0.215 5.91 5.03 Intr + 109274 109487 214 1 1 51 38 122 0.165 1.79 5.04 Intr + 109743 109808 66 1 0 100 76 19 0.116 0.78 5.05 Intr + 112653 112793 141 1 0 97 43 36 0.093 0.32 5.06 Intr + 114155 114236 82 0 1 118 38 40 0.098 0.70 5.07 Intr + 116043 116115 73 0 1 83 109 40 0.124 5.01 5.08 Intr + 116947 117063 117 0 0 75 87 160 0.711 15.26 5.09 Intr + 117269 117350 82 1 1 117 113 120 0.999 16.61 5.10 Intr + 121477 121557 81 2 0 74 58 107 0.980 5.91 5.11 Intr + 121724 121803 80 0 2 108 83 67 0.999 7.47 5.12 Intr + 123471 123577 107 2 2 118 81 99 0.999 11.21 5.13 Intr + 124294 124331 38 1 2 92 105 62 0.999 6.21 5.14 Intr + 124676 124818 143 0 2 80 101 245 0.954 25.07 5.15 Intr + 125231 125301 71 1 2 69 74 85 0.882 3.18 5.16 Intr + 125617 125705 89 1 2 75 111 91 0.998 9.71 5.17 Intr + 126734 126798 65 0 2 78 82 69 0.980 3.74 5.18 Intr + 127078 127180 103 0 1 14 69 158 0.713 6.25 5.19 Intr + 128887 129017 131 2 2 107 94 204 0.981 23.31 5.20 Intr + 130806 130931 126 2 0 73 86 250 0.999 24.18 5.21 Intr + 131125 131214 90 0 0 118 76 155 0.978 17.49 5.22 Intr + 131585 131743 159 1 0 -24 84 271 0.995 15.68 5.23 Intr + 141529 141690 162 0 0 71 105 271 0.994 27.27 5.24 Intr + 141929 142005 77 1 2 114 96 176 0.999 19.31 5.25 Intr + 142256 142373 118 2 1 60 113 91 0.996 9.27 5.26 Intr + 143304 143441 138 2 0 121 86 117 0.987 15.46 5.27 Intr + 144159 144251 93 2 0 133 44 158 0.402 16.06 5.28 Intr + 167031 167145 115 2 1 59 81 48 0.049 1.22 5.29 Term + 177805 177842 38 2 2 106 50 42 0.224 -0.30 5.30 PlyA + 179374 179379 6 1.05 6.03 PlyA - 179444 179439 6 1.05 6.02 Term - 190532 190418 115 1 1 80 54 84 0.441 2.24 6.01 Init - 199807 199737 71 1 2 94 97 29 0.598 4.92 6.00 Prom - 210030 209991 40 -1.66 7.02 PlyA - 210402 210397 6 1.05 7.01 Term - 215411 215255 157 1 1 97 54 105 0.228 5.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 25267 24743 525 1 0 70 49 310 0.906 21.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:26429921_26674198|GENSCAN_predicted_peptide_1|367_aa MEKEETGNDQDVGTMMWQETDLDVQKKSVQIRRKWEWEEVCKARRTEERLLPPEAGPMPK HIAFIMDGNRRYAKKCQVERQEGHSQGFNKLAETLRWCLNLGILEVTVYAFSIENFKRSK SEVDGLMDLARQKFSRLMEEKEKLQKHGVCIRVLGDLHLLPLDLQELIAQAVQATKNYNK CFLNVCFAYTSRHEISNAVREMAWGVEQGLLDPSDISESLLDKCLYTNRSPHPDILIRTS GEVRLSDFLLWQTSHSCLVFQPVLWPEYTFWNLFEAILQFQMNHSVLQQKARDMYAEERK RQQLERDQATVTEQLLREGLQASGDAQLRRTRLHKLSARREERVQGFLQALELKRADWLA RLGTASA >gi568815597f:26429921_26674198|GENSCAN_predicted_CDS_1|1104_bp atggagaaggaagaaacggggaatgatcaagatgtgggcaccatgatgtggcaggaaacc gacttagatgttcagaagaagtcagtgcagatacggaggaaatgggaatgggaagaagtc tgcaaagccaggagaaccgaggaaaggcttttgcctccagaggcaggcccaatgccgaaa cacattgcattcataatggacgggaaccgtcgctatgccaagaagtgccaggtggagcgg caggaaggccactcacagggcttcaacaagctagctgagactctgcggtggtgtttgaac ctgggcatcctagaggtgacagtctacgcattcagcattgagaacttcaaacgctccaag agtgaggtagacgggcttatggatctggcccggcagaagttcagccgcttgatggaagaa aaggagaaactgcagaagcatggggtgtgtatccgggtcctgggcgatctgcacttgttg cccttggatctccaggagctgattgcacaagctgtacaggccacgaagaactacaacaag tgtttcctgaatgtctgttttgcatacacatcccgtcatgagatcagcaatgctgtgaga gagatggcctggggggtggagcaaggcctgttggatcccagtgatatctctgagtctctg cttgataagtgcctctataccaaccgctctcctcatcctgacatcttgatacggacttct ggagaagtgcggctgagtgacttcttgctatggcagacctctcactcctgcctggtgttc caacccgttctgtggccagagtatacattttggaacctcttcgaggccatcctgcagttc cagatgaaccatagcgtgcttcagcagaaggcccgagacatgtatgcagaggagcggaag aggcagcagctggagagggaccaggctacagtgacagagcagctgctgcgagaggggctc caagccagtggggacgcccagctccgaaggacacgcttgcacaaactctcggccagacgg gaagagcgagtccaaggcttcctgcaggccttggaactcaagcgagctgactggctggcc cgtctgggcactgcatcagcctga >gi568815597f:26429921_26674198|GENSCAN_predicted_peptide_2|80_aa MPKRKAEGDAKGDKAKVKDEKPAPPKPEPKPKKAPAKKGEKVPKGKKGKADAGKEGNNPA ENGDAKTDQAQKAEGAGDAK >gi568815597f:26429921_26674198|GENSCAN_predicted_CDS_2|243_bp atgcccaagagaaaggctgaaggggatgctaagggagataaagcaaaggtgaaggacgaa aaacctgctcctccaaagccagagcccaagcctaaaaaggcccctgcaaagaagggagag aaggtacccaaagggaaaaagggaaaagctgatgctggcaaggaggggaataaccctgca gaaaatggagatgccaaaacagaccaggcacagaaagctgaaggtgctggagatgccaag tga >gi568815597f:26429921_26674198|GENSCAN_predicted_peptide_3|244_aa MATSTPLLDPSSEGGRVSKADRQRLLGASCESPPSAPRSLLAPSRRCPLRPHGGGAEPSS APHLGGGARGMGLLGPAPQPRQGYGPTSRPGASLGLNRRSPRRPSTPPAGSRIPHYPADP NPCSRDSPATLEELSWGRPALPQVTRCAPGGHLASRQLEPAPAQARLNPGPRLLRAQEER PVQLPSAPARGEGGGSGLQRCGPRSSDPGPSPPRGKKEASASASFPERCHPHSGPLTAGP GHCA >gi568815597f:26429921_26674198|GENSCAN_predicted_CDS_3|735_bp atggcaacctcaaccccactcctggaccctagctcggaagggggcagggtatcgaaagcc gatcgccagaggctcctcggtgcctcgtgcgagtccccgccgtcagccccgaggagcctc ctggcaccaagcaggcgctgcccccttcggccacacggtggcggcgcagagccgagctcc gcgccccacctgggaggcggcgcccgcgggatggggctcctcggcccagctccccagcct cgacaaggatacggcccgacatccaggccaggagcgtcgctggggcttaaccgccgttcc ccaaggcgcccctccactcctccagcgggctcccgcatcccccattatccggcggacccc aacccctgctcacgtgactcgcccgccaccctcgaagaactctcgtggggccgccccgcc ctgccgcaggtcacgcgctgcgcacctggagggcatctggccagccgccagctagagcct gcccctgctcaggctcggctcaacccgggcccgcgcctgctccgagcccaggaagagcgt cctgtccagctcccaagtgcgccggcccgtggggaaggaggcgggagtgggctccagcgg tgcggccctcgctcctccgacccggggccctctccacctcgggggaagaaagaggcctct gcctccgcctccttccctgagcgctgccaccctcactccggccctctgaccgcgggtccc ggacattgcgcttag >gi568815597f:26429921_26674198|GENSCAN_predicted_peptide_4|100_aa MYHRMDKTGVLLKMSDSNLDSSKKNFFEGEVADEESVILTLLPVKDDPNMEQTEPSVSST SDVKLEKPMKYNQGTEDNMLCPNCAKKNKKMMKRLMTIEK >gi568815597f:26429921_26674198|GENSCAN_predicted_CDS_4|303_bp atgtatcaccgaatggacaagacaggggtgttgctgaaaatgtcagactcaaatttggat agcagcaagaagaatttctttgagggggaagtagctgatgaggaaagtgtgattttgaca ttgctgccagttaaagatgacccaaatatggaacaaacagaaccaagtgtttcttcaact tctgatgtcaaactggagaaacctatgaaatacaatcaaggcacagaagataatatgtta tgccccaactgtgctaagaagaataagaagatgatgaaaagattaatgacaatagagaag tag >gi568815597f:26429921_26674198|GENSCAN_predicted_peptide_5|968_aa MPLAQLKEPWPLMELVPLDPENGQTSGEEAGLQPSKHDEMSTSPGTQLMASEFRAECGPE TWSLPCMAEEEELGWALGHRPCSLRNWHLAGTQKAGFGQLEKELGLPGRKRDLGFGIWLL AAIVWIAIQDLCEVLGQAFCPPYPSWGLGLAAFFGAAEQGRTGGGPLVPAFWAAIVGTAA RTHSSALNPFLGIPALLPARVSARKQRPRISQTSLPVPGPGSGPQRDSDEGVLKEISITH HVKAGSEKADPSHFELLKVLGQGSFGKVFLVRKVTRPDSGHLYAMKVLKKATLKVRDRVR TKMERDILADVNHPFVVKLHYAFQTEGKLYLILDFLRGGDLFTRLSKEVMFTEEDVKFYL AELALGLDHLHSLGIIYRDLKPENILLDEEGHIKLTDFGLSKEAIDHEKKAYSFCGTVEY MAPEVVNRQGHSHSADWWSYGVLMFEMLTGSLPFQGKDRKETMTLILKAKLGMPQFLSTE AQSLLRALFKRNPANRLGSGPDGAEEIKRHVFYSTIDWNKLYRREIKPPFKPAVAQPDDT FYFDTEFTSRTPKDSPGIPPSAGAHQLFRGFSFVATGLMEDDGKPRAPQAPLHSVVQQLH GKNLVFSDGYVVKETIGVGSYSECKRCVHKATNMEYAVKVIDKSKRDPSEEIEILLRYGQ HPNIITLKDVYDDGKHVYLVTELMRGGELLDKILRQKFFSEREASFVLHTIGKTVEYLHS QGVVHRDLKPSNILYVDESGNPECLRICDFGFAKQLRAENGLLMTPCYTANFVAPEVLKR QGYDEGCDIWSLGILLYTMLAGYTPFANGPSDTPEEILTRIGSGKFTLSGGNWNTVSETA KDLVSKMLHVDPHQRLTAKQVLQHPWVTQKDKLPQSQLSHQDLQLVKGAMAATYSALNSS KPTPQLKPIESSILAQRREVRSPSAQPPPRLGGVPNSSLRTGHDDNGGFVEQKGGKGSCS ITEAEVQK >gi568815597f:26429921_26674198|GENSCAN_predicted_CDS_5|2907_bp atgccgctcgcccagctcaaggagccctggccgctcatggagctagtgcctctggacccg gagaatggacagacctcaggggaagaagctggacttcagccgtccaagcatgacgagatg agcacgtccccgggaacccagctgatggcatctgagttcagggctgagtgtggaccagag acatggtcattgccgtgcatggccgaggaagaagagttggggtgggctctagggcacagg ccatgttccctgcggaactggcatttggctgggactcaaaaggcaggatttgggcagctg gagaaagaattgggccttccaggcagaaagcgggatttgggatttgggatttggctttta gcagcaatagtgtggatagccatccaggacctctgtgaggtccttggtcaggctttctgc ccgccttacccatcctggggcttgggactggcagccttcttcggggcagctgagcagggc aggaccggaggggggccactggtgcctgctttctgggctgccattgtgggtacagcagca agaacccacagctctgcccttaaccccttcctggggatccctgccctgctgcctgcccgt gtgtctgccaggaagcagcggcccaggatcagccagacctctctgcctgtccctggccct ggctctggcccccagcgggactcggatgagggcgtcctcaaggagatctccatcacgcac cacgtcaaggctggctctgagaaggctgatccatcccatttcgagctcctcaaggttctg ggccagggatcctttggcaaagtcttcctggtgcggaaagtcacccggcctgacagtggg cacctgtatgctatgaaggtgctgaagaaggcaacgctgaaagtacgtgaccgcgtccgg accaagatggagagagacatcctggctgatgtaaatcacccattcgtggtgaagctgcac tatgccttccagaccgagggcaagctctatctcattctggacttcctgcgtggtggggac ctcttcacccggctctcaaaagaggtgatgttcacggaggaggatgtgaagttttacctg gctgagctggctctgggcctggatcacctgcacagcctgggtatcatttacagagacctc aagcctgagaacatccttctggatgaggagggccacatcaaactcactgactttggcctg agcaaagaggccattgaccacgagaagaaggcctattctttctgcgggacagtggagtac atggcccctgaggtcgtcaaccgccagggccactcccatagtgcggactggtggtcctat ggggtgttgatgtttgagatgctgacgggctccctgcccttccaggggaaggaccggaag gagaccatgacactgattctgaaggcgaagctaggcatgccccagtttctgagcactgaa gcccagagcctcttgcgggccctgttcaagcggaatcctgccaaccggctcggctccggc cctgatggggcagaggaaatcaagcggcatgtcttctactccaccattgactggaataag ctataccgtcgtgagatcaagccacccttcaagccagcagtggctcagcctgatgacacc ttctactttgacaccgagttcacgtcccgcacacccaaggattccccaggcatccccccc agcgctggggcccatcagctgttccggggcttcagcttcgtggccaccggcctgatggaa gacgacggcaagcctcgtgccccgcaggcacccctgcactcggtggtacagcaactccat gggaagaacctggtttttagtgacggctacgtggtaaaggagacaattggtgtgggctcc tactctgagtgcaagcgctgtgtccacaaggccaccaacatggagtatgctgtcaaggtc attgataagagcaagcgggatccttcagaagagattgagattcttctgcggtatggccag caccccaacatcatcactctgaaagatgtgtatgatgatggcaaacacgtgtacctggtg acagagctgatgcggggtggggagctgctggacaagatcctgcggcagaagttcttctca gagcgggaggccagctttgtcctgcacaccattggcaaaactgtggagtatctgcactca cagggggttgtgcacagggacctgaagcccagcaacatcctgtatgtggacgagtccggg aatcccgagtgcctgcgcatctgtgactttggttttgccaaacagctgcgggctgagaat gggctcctcatgacaccttgctacacagccaactttgtggcgcctgaggtgctgaagcgc cagggctacgatgaaggctgcgacatctggagcctgggcattctgctgtacaccatgctg gcaggatatactccatttgccaacggtcccagtgacacaccagaggaaatcctaacccgg atcggcagtgggaagtttaccctcagtgggggaaattggaacacagtttcagagacagcc aaggacctggtgtccaagatgctacacgtggatccccaccagcgcctcacagctaagcag gttctgcagcatccatgggtcacccagaaagacaagcttccccaaagccagctgtcccac caggacctacagcttgtgaagggagccatggctgccacgtactccgcactcaacagctcc aagcccaccccccagctgaagcccatcgagtcatccatcctggcccagcggcgagaagtg aggagcccctctgcccagccaccacctcgtctgggaggtgtacccaacagctcattgaga acgggccatgatgacaatggcggttttgtggaacagaaaggggggaaagggtcttgctcc atcaccgaggctgaagtgcagaagtga >gi568815597f:26429921_26674198|GENSCAN_predicted_peptide_6|61_aa MEIWRNLKPSKNLIAIVSIEEQPRTTGSRCCPLPPTVLCAPAGLREGRRGRSCRRGRGNQ S >gi568815597f:26429921_26674198|GENSCAN_predicted_CDS_6|186_bp atggaaatttggaggaatctgaagcctagcaagaatctcatagcgattgtcagcatagaa gaacagccaagaacaacgggttcccgctgctgccccctgccgcccacagtgctctgcgcc cctgcggggctccgagaggggcgtcgggggcgcagctgccgccggggccgtgggaaccag agctga >gi568815597f:26429921_26674198|GENSCAN_predicted_peptide_7|52_aa XSWWHFCNSWNLETYGSECSSKAGTPEGSPRAKAGRSTLQGTSDNISNISSI >gi568815597f:26429921_26674198|GENSCAN_predicted_CDS_7|159_bp nnatcctggtggcatttctgcaactcctggaacctggagacatatggatcagagtgctca tccaaggcaggaaccccagaaggttctccccgggcaaaggcagggcgctccaccctccag ggtacctctgacaacatcagtaatatcagcagcatctga