GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:16:50 Sequence gi568815597f:26373481_26574668 : 201188 bp : 48.38% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 37412 37442 31 1 1 97 52 63 0.603 3.50 1.02 Intr + 37906 38102 197 2 2 133 94 281 0.952 32.33 1.03 Intr + 51823 52007 185 0 2 78 80 190 0.974 15.79 1.04 Term + 52762 52978 217 1 1 100 52 206 0.999 14.82 1.05 PlyA + 56215 56220 6 1.05 2.00 Prom + 57683 57722 40 -6.86 2.01 Init + 59082 59243 162 2 0 95 27 118 0.675 6.13 2.02 Intr + 64688 64804 117 1 0 117 90 184 0.995 22.16 2.03 Intr + 69251 69393 143 1 2 112 69 175 0.997 17.15 2.04 Intr + 72836 72952 117 2 0 84 108 103 0.925 11.48 2.05 Intr + 74079 74180 102 0 0 148 68 53 0.742 8.59 2.06 Intr + 84311 84425 115 2 1 78 96 84 0.210 8.65 2.07 Intr + 86557 86664 108 0 0 95 110 63 0.927 9.68 2.08 Term + 95412 95651 240 2 0 69 42 261 0.640 15.63 2.09 PlyA + 96126 96131 6 -0.45 3.00 Prom + 97155 97194 40 -4.66 3.01 Init + 99133 99147 15 0 0 114 102 14 0.770 5.45 3.02 Intr + 100003 100047 45 0 0 92 100 51 0.960 5.31 3.03 Intr + 100605 100655 51 2 0 40 115 55 0.778 2.50 3.04 Intr + 101092 101187 96 0 0 77 75 124 0.955 10.21 3.05 Term + 101633 101668 36 1 0 73 45 58 0.723 -2.66 3.06 PlyA + 102464 102469 6 1.05 4.00 Prom + 123687 123726 40 -2.16 4.01 Sngl + 126721 127455 735 0 0 101 50 141 0.701 7.88 4.02 PlyA + 130292 130297 6 1.05 5.03 PlyA - 130429 130424 6 1.05 5.02 Term - 145956 145874 83 1 2 97 43 68 0.889 1.06 5.01 Init - 146809 146590 220 1 1 39 34 186 0.668 7.09 5.00 Prom - 153215 153176 40 -7.46 6.00 Prom + 153783 153822 40 -4.36 6.01 Init + 156441 156503 63 2 0 96 105 76 0.609 11.30 6.02 Intr + 163445 163489 45 1 0 125 100 24 0.215 5.91 6.03 Intr + 165714 165927 214 2 1 51 38 122 0.165 1.79 6.04 Intr + 166183 166248 66 2 0 100 76 19 0.116 0.78 6.05 Intr + 169093 169233 141 2 0 97 43 36 0.093 0.32 6.06 Intr + 170595 170676 82 1 1 118 38 40 0.098 0.70 6.07 Intr + 172483 172555 73 1 1 83 109 40 0.124 5.01 6.08 Intr + 173387 173503 117 1 0 75 87 160 0.711 15.26 6.09 Intr + 173709 173790 82 2 1 117 113 120 0.999 16.61 6.10 Intr + 177917 177997 81 0 0 74 58 107 0.980 5.91 6.11 Intr + 178164 178243 80 1 2 108 83 67 0.999 7.47 6.12 Intr + 179911 180017 107 0 2 118 81 99 0.999 11.21 6.13 Intr + 180734 180771 38 2 2 92 105 62 0.999 6.21 6.14 Intr + 181116 181258 143 1 2 80 101 245 0.954 25.07 6.15 Intr + 181671 181741 71 2 2 69 74 85 0.882 3.18 6.16 Intr + 182057 182145 89 2 2 75 111 91 0.998 9.71 6.17 Intr + 183174 183238 65 1 2 78 82 69 0.980 3.74 6.18 Intr + 183518 183620 103 1 1 14 69 158 0.713 6.25 6.19 Intr + 185327 185457 131 0 2 107 94 204 0.981 23.31 6.20 Intr + 187246 187371 126 0 0 73 86 250 0.999 24.18 6.21 Intr + 187565 187654 90 1 0 118 76 155 0.978 17.49 6.22 Intr + 188025 188183 159 2 0 -24 84 271 0.995 15.68 6.23 Intr + 197969 198130 162 1 0 71 105 271 0.994 27.27 6.24 Intr + 198369 198445 77 2 2 114 96 176 0.999 19.31 6.25 Intr + 198696 198813 118 0 1 60 113 91 0.996 9.27 6.26 Intr + 199744 199881 138 0 0 121 86 117 0.987 15.46 6.27 Term + 200599 200721 123 0 0 133 54 163 0.653 15.78 6.28 PlyA + 200963 200968 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 81707 81183 525 2 0 70 49 310 0.906 21.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:26373481_26574668|GENSCAN_predicted_peptide_1|209_aa MGSVSNQQFAGGCAKAAEEAPEEAPEDAARAADEPQLLHGAGICKWFNVRMGFGFLSMTA RAGVALDPPVDVFVHQSKLHMEGFRSLKEGEAVEFTFKKSAKGLESIRVTGPGGVFCIGS ERRPKGKSMQKRRSKGDRCYNCGGLDHHAKECKLPPQPKKCHFCQSISHMVASCPLKAQQ GPSAQGKPTYFREEEEEIHSPTLLPEAQN >gi568815597f:26373481_26574668|GENSCAN_predicted_CDS_1|630_bp atgggctccgtgtccaaccagcagtttgcaggtggctgcgccaaggcggcagaagaggcg cccgaggaggcgccggaggacgcggcccgggcggcggacgagcctcagctgctgcacggt gcgggcatctgtaagtggttcaacgtgcgcatggggttcggcttcctgtccatgaccgcc cgcgccggggtcgcgctcgaccccccagtggatgtctttgtgcaccagagtaagctgcac atggaagggttccggagcttgaaggagggtgaggcagtggagttcacctttaagaagtca gccaagggtctggaatccatccgtgtcaccggacctggtggagtattctgtattgggagt gagaggcggccaaaaggaaagagcatgcagaagcgcagatcaaaaggagacaggtgctac aactgtggaggtctagatcatcatgccaaggaatgcaagctgccaccccagcccaagaag tgccacttctgccagagcatcagccatatggtagcctcatgtccgctgaaggcccagcag ggccctagtgcacagggaaagccaacctactttcgagaggaagaagaagaaatccacagc cctaccctgctcccggaggcacagaattga >gi568815597f:26373481_26574668|GENSCAN_predicted_peptide_2|367_aa MEKEETGNDQDVGTMMWQETDLDVQKKSVQIRRKWEWEEVCKARRTEERLLPPEAGPMPK HIAFIMDGNRRYAKKCQVERQEGHSQGFNKLAETLRWCLNLGILEVTVYAFSIENFKRSK SEVDGLMDLARQKFSRLMEEKEKLQKHGVCIRVLGDLHLLPLDLQELIAQAVQATKNYNK CFLNVCFAYTSRHEISNAVREMAWGVEQGLLDPSDISESLLDKCLYTNRSPHPDILIRTS GEVRLSDFLLWQTSHSCLVFQPVLWPEYTFWNLFEAILQFQMNHSVLQQKARDMYAEERK RQQLERDQATVTEQLLREGLQASGDAQLRRTRLHKLSARREERVQGFLQALELKRADWLA RLGTASA >gi568815597f:26373481_26574668|GENSCAN_predicted_CDS_2|1104_bp atggagaaggaagaaacggggaatgatcaagatgtgggcaccatgatgtggcaggaaacc gacttagatgttcagaagaagtcagtgcagatacggaggaaatgggaatgggaagaagtc tgcaaagccaggagaaccgaggaaaggcttttgcctccagaggcaggcccaatgccgaaa cacattgcattcataatggacgggaaccgtcgctatgccaagaagtgccaggtggagcgg caggaaggccactcacagggcttcaacaagctagctgagactctgcggtggtgtttgaac ctgggcatcctagaggtgacagtctacgcattcagcattgagaacttcaaacgctccaag agtgaggtagacgggcttatggatctggcccggcagaagttcagccgcttgatggaagaa aaggagaaactgcagaagcatggggtgtgtatccgggtcctgggcgatctgcacttgttg cccttggatctccaggagctgattgcacaagctgtacaggccacgaagaactacaacaag tgtttcctgaatgtctgttttgcatacacatcccgtcatgagatcagcaatgctgtgaga gagatggcctggggggtggagcaaggcctgttggatcccagtgatatctctgagtctctg cttgataagtgcctctataccaaccgctctcctcatcctgacatcttgatacggacttct ggagaagtgcggctgagtgacttcttgctatggcagacctctcactcctgcctggtgttc caacccgttctgtggccagagtatacattttggaacctcttcgaggccatcctgcagttc cagatgaaccatagcgtgcttcagcagaaggcccgagacatgtatgcagaggagcggaag aggcagcagctggagagggaccaggctacagtgacagagcagctgctgcgagaggggctc caagccagtggggacgcccagctccgaaggacacgcttgcacaaactctcggccagacgg gaagagcgagtccaaggcttcctgcaggccttggaactcaagcgagctgactggctggcc cgtctgggcactgcatcagcctga >gi568815597f:26373481_26574668|GENSCAN_predicted_peptide_3|80_aa MPKRKAEGDAKGDKAKVKDEKPAPPKPEPKPKKAPAKKGEKVPKGKKGKADAGKEGNNPA ENGDAKTDQAQKAEGAGDAK >gi568815597f:26373481_26574668|GENSCAN_predicted_CDS_3|243_bp atgcccaagagaaaggctgaaggggatgctaagggagataaagcaaaggtgaaggacgaa aaacctgctcctccaaagccagagcccaagcctaaaaaggcccctgcaaagaagggagag aaggtacccaaagggaaaaagggaaaagctgatgctggcaaggaggggaataaccctgca gaaaatggagatgccaaaacagaccaggcacagaaagctgaaggtgctggagatgccaag tga >gi568815597f:26373481_26574668|GENSCAN_predicted_peptide_4|244_aa MATSTPLLDPSSEGGRVSKADRQRLLGASCESPPSAPRSLLAPSRRCPLRPHGGGAEPSS APHLGGGARGMGLLGPAPQPRQGYGPTSRPGASLGLNRRSPRRPSTPPAGSRIPHYPADP NPCSRDSPATLEELSWGRPALPQVTRCAPGGHLASRQLEPAPAQARLNPGPRLLRAQEER PVQLPSAPARGEGGGSGLQRCGPRSSDPGPSPPRGKKEASASASFPERCHPHSGPLTAGP GHCA >gi568815597f:26373481_26574668|GENSCAN_predicted_CDS_4|735_bp atggcaacctcaaccccactcctggaccctagctcggaagggggcagggtatcgaaagcc gatcgccagaggctcctcggtgcctcgtgcgagtccccgccgtcagccccgaggagcctc ctggcaccaagcaggcgctgcccccttcggccacacggtggcggcgcagagccgagctcc gcgccccacctgggaggcggcgcccgcgggatggggctcctcggcccagctccccagcct cgacaaggatacggcccgacatccaggccaggagcgtcgctggggcttaaccgccgttcc ccaaggcgcccctccactcctccagcgggctcccgcatcccccattatccggcggacccc aacccctgctcacgtgactcgcccgccaccctcgaagaactctcgtggggccgccccgcc ctgccgcaggtcacgcgctgcgcacctggagggcatctggccagccgccagctagagcct gcccctgctcaggctcggctcaacccgggcccgcgcctgctccgagcccaggaagagcgt cctgtccagctcccaagtgcgccggcccgtggggaaggaggcgggagtgggctccagcgg tgcggccctcgctcctccgacccggggccctctccacctcgggggaagaaagaggcctct gcctccgcctccttccctgagcgctgccaccctcactccggccctctgaccgcgggtccc ggacattgcgcttag >gi568815597f:26373481_26574668|GENSCAN_predicted_peptide_5|100_aa MYHRMDKTGVLLKMSDSNLDSSKKNFFEGEVADEESVILTLLPVKDDPNMEQTEPSVSST SDVKLEKPMKYNQGTEDNMLCPNCAKKNKKMMKRLMTIEK >gi568815597f:26373481_26574668|GENSCAN_predicted_CDS_5|303_bp atgtatcaccgaatggacaagacaggggtgttgctgaaaatgtcagactcaaatttggat agcagcaagaagaatttctttgagggggaagtagctgatgaggaaagtgtgattttgaca ttgctgccagttaaagatgacccaaatatggaacaaacagaaccaagtgtttcttcaact tctgatgtcaaactggagaaacctatgaaatacaatcaaggcacagaagataatatgtta tgccccaactgtgctaagaagaataagaagatgatgaaaagattaatgacaatagagaag tag >gi568815597f:26373481_26574668|GENSCAN_predicted_peptide_6|927_aa MPLAQLKEPWPLMELVPLDPENGQTSGEEAGLQPSKHDEMSTSPGTQLMASEFRAECGPE TWSLPCMAEEEELGWALGHRPCSLRNWHLAGTQKAGFGQLEKELGLPGRKRDLGFGIWLL AAIVWIAIQDLCEVLGQAFCPPYPSWGLGLAAFFGAAEQGRTGGGPLVPAFWAAIVGTAA RTHSSALNPFLGIPALLPARVSARKQRPRISQTSLPVPGPGSGPQRDSDEGVLKEISITH HVKAGSEKADPSHFELLKVLGQGSFGKVFLVRKVTRPDSGHLYAMKVLKKATLKVRDRVR TKMERDILADVNHPFVVKLHYAFQTEGKLYLILDFLRGGDLFTRLSKEVMFTEEDVKFYL AELALGLDHLHSLGIIYRDLKPENILLDEEGHIKLTDFGLSKEAIDHEKKAYSFCGTVEY MAPEVVNRQGHSHSADWWSYGVLMFEMLTGSLPFQGKDRKETMTLILKAKLGMPQFLSTE AQSLLRALFKRNPANRLGSGPDGAEEIKRHVFYSTIDWNKLYRREIKPPFKPAVAQPDDT FYFDTEFTSRTPKDSPGIPPSAGAHQLFRGFSFVATGLMEDDGKPRAPQAPLHSVVQQLH GKNLVFSDGYVVKETIGVGSYSECKRCVHKATNMEYAVKVIDKSKRDPSEEIEILLRYGQ HPNIITLKDVYDDGKHVYLVTELMRGGELLDKILRQKFFSEREASFVLHTIGKTVEYLHS QGVVHRDLKPSNILYVDESGNPECLRICDFGFAKQLRAENGLLMTPCYTANFVAPEVLKR QGYDEGCDIWSLGILLYTMLAGYTPFANGPSDTPEEILTRIGSGKFTLSGGNWNTVSETA KDLVSKMLHVDPHQRLTAKQVLQHPWVTQKDKLPQSQLSHQDLQLVKGAMAATYSALNSS KPTPQLKPIESSILAQRRVRKLPSTTL >gi568815597f:26373481_26574668|GENSCAN_predicted_CDS_6|2784_bp atgccgctcgcccagctcaaggagccctggccgctcatggagctagtgcctctggacccg gagaatggacagacctcaggggaagaagctggacttcagccgtccaagcatgacgagatg agcacgtccccgggaacccagctgatggcatctgagttcagggctgagtgtggaccagag acatggtcattgccgtgcatggccgaggaagaagagttggggtgggctctagggcacagg ccatgttccctgcggaactggcatttggctgggactcaaaaggcaggatttgggcagctg gagaaagaattgggccttccaggcagaaagcgggatttgggatttgggatttggctttta gcagcaatagtgtggatagccatccaggacctctgtgaggtccttggtcaggctttctgc ccgccttacccatcctggggcttgggactggcagccttcttcggggcagctgagcagggc aggaccggaggggggccactggtgcctgctttctgggctgccattgtgggtacagcagca agaacccacagctctgcccttaaccccttcctggggatccctgccctgctgcctgcccgt gtgtctgccaggaagcagcggcccaggatcagccagacctctctgcctgtccctggccct ggctctggcccccagcgggactcggatgagggcgtcctcaaggagatctccatcacgcac cacgtcaaggctggctctgagaaggctgatccatcccatttcgagctcctcaaggttctg ggccagggatcctttggcaaagtcttcctggtgcggaaagtcacccggcctgacagtggg cacctgtatgctatgaaggtgctgaagaaggcaacgctgaaagtacgtgaccgcgtccgg accaagatggagagagacatcctggctgatgtaaatcacccattcgtggtgaagctgcac tatgccttccagaccgagggcaagctctatctcattctggacttcctgcgtggtggggac ctcttcacccggctctcaaaagaggtgatgttcacggaggaggatgtgaagttttacctg gctgagctggctctgggcctggatcacctgcacagcctgggtatcatttacagagacctc aagcctgagaacatccttctggatgaggagggccacatcaaactcactgactttggcctg agcaaagaggccattgaccacgagaagaaggcctattctttctgcgggacagtggagtac atggcccctgaggtcgtcaaccgccagggccactcccatagtgcggactggtggtcctat ggggtgttgatgtttgagatgctgacgggctccctgcccttccaggggaaggaccggaag gagaccatgacactgattctgaaggcgaagctaggcatgccccagtttctgagcactgaa gcccagagcctcttgcgggccctgttcaagcggaatcctgccaaccggctcggctccggc cctgatggggcagaggaaatcaagcggcatgtcttctactccaccattgactggaataag ctataccgtcgtgagatcaagccacccttcaagccagcagtggctcagcctgatgacacc ttctactttgacaccgagttcacgtcccgcacacccaaggattccccaggcatccccccc agcgctggggcccatcagctgttccggggcttcagcttcgtggccaccggcctgatggaa gacgacggcaagcctcgtgccccgcaggcacccctgcactcggtggtacagcaactccat gggaagaacctggtttttagtgacggctacgtggtaaaggagacaattggtgtgggctcc tactctgagtgcaagcgctgtgtccacaaggccaccaacatggagtatgctgtcaaggtc attgataagagcaagcgggatccttcagaagagattgagattcttctgcggtatggccag caccccaacatcatcactctgaaagatgtgtatgatgatggcaaacacgtgtacctggtg acagagctgatgcggggtggggagctgctggacaagatcctgcggcagaagttcttctca gagcgggaggccagctttgtcctgcacaccattggcaaaactgtggagtatctgcactca cagggggttgtgcacagggacctgaagcccagcaacatcctgtatgtggacgagtccggg aatcccgagtgcctgcgcatctgtgactttggttttgccaaacagctgcgggctgagaat gggctcctcatgacaccttgctacacagccaactttgtggcgcctgaggtgctgaagcgc cagggctacgatgaaggctgcgacatctggagcctgggcattctgctgtacaccatgctg gcaggatatactccatttgccaacggtcccagtgacacaccagaggaaatcctaacccgg atcggcagtgggaagtttaccctcagtgggggaaattggaacacagtttcagagacagcc aaggacctggtgtccaagatgctacacgtggatccccaccagcgcctcacagctaagcag gttctgcagcatccatgggtcacccagaaagacaagcttccccaaagccagctgtcccac caggacctacagcttgtgaagggagccatggctgccacgtactccgcactcaacagctcc aagcccaccccccagctgaagcccatcgagtcatccatcctggcccagcggcgagtgagg aagttgccatccaccaccctgtga