GENSCAN 1.0 Date run: 17-Aug-118 Time: 17:46:22 Sequence gi568815591f:39897492_40163100 : 265609 bp : 40.26% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 930 925 6 1.05 1.01 Sngl - 12682 12455 228 1 0 109 55 150 0.868 8.67 1.00 Prom - 26347 26308 40 -7.75 2.00 Prom + 35559 35598 40 -4.35 2.01 Init + 40146 40221 76 2 1 96 49 83 0.430 6.50 2.02 Intr + 52261 52623 363 2 0 -8 66 243 0.903 6.73 2.03 Intr + 52644 52855 212 1 2 67 76 198 0.847 14.21 2.04 Term + 53148 54365 1218 2 0 18 44 1397 0.797 118.93 2.05 PlyA + 55639 55644 6 1.05 3.05 PlyA - 56009 56004 6 1.05 3.04 Term - 66310 65501 810 1 0 22 41 331 0.147 13.99 3.03 Intr - 66851 66707 145 1 1 24 41 120 0.112 0.26 3.02 Intr - 68892 68676 217 1 1 -38 102 154 0.006 0.84 3.01 Init - 69241 68938 304 1 1 88 -16 292 0.024 16.29 3.00 Prom - 76009 75970 40 -4.25 4.00 Prom + 79029 79068 40 -6.15 4.01 Init + 79339 79376 38 0 2 86 20 68 0.191 -0.67 4.02 Intr + 90108 90767 660 0 0 86 97 587 0.997 49.39 4.03 Intr + 100003 100173 171 1 0 90 44 180 0.999 11.94 4.04 Intr + 101870 102009 140 2 2 6 99 263 0.786 18.39 4.05 Intr + 104370 104540 171 1 0 77 95 227 0.932 21.29 4.06 Term + 130215 130309 95 1 2 68 42 108 0.182 1.21 4.07 PlyA + 131272 131277 6 1.05 5.00 Prom + 132773 132812 40 -3.95 5.01 Init + 137846 137853 8 1 2 83 84 4 0.192 -0.24 5.02 Intr + 150330 150386 57 0 0 58 121 100 0.352 7.38 5.03 Intr + 165335 165436 102 2 0 87 86 116 0.500 9.67 5.04 Intr + 181229 181360 132 2 0 35 89 186 0.760 12.24 5.05 Intr + 190635 190840 206 0 2 84 65 163 0.653 11.52 5.06 Intr + 195294 195746 453 1 0 54 114 388 0.605 30.30 5.07 Term + 196639 197489 851 2 2 89 48 687 0.990 56.82 5.08 PlyA + 198018 198023 6 1.05 6.05 PlyA - 198741 198736 6 1.05 6.04 Term - 214315 214303 13 0 1 79 42 31 0.002 -5.70 6.03 Intr - 236949 236738 212 0 2 24 90 271 0.673 17.69 6.02 Intr - 237193 237120 74 2 2 52 60 101 0.914 1.91 6.01 Intr - 237695 237483 213 0 0 97 9 151 0.708 5.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 69241 68909 333 1 0 88 37 316 0.960 22.27 S.002 Init + 180581 180630 50 1 2 74 41 105 0.924 4.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:39897492_40163100|GENSCAN_predicted_peptide_1|75_aa MELPKVPVLCVKLPGQVEEKSQMGGWVRQVCALALHVWVQAAATVEIGVQFWGHWVNVPG SSATTSAAQKNPHGD >gi568815591f:39897492_40163100|GENSCAN_predicted_CDS_1|228_bp atggagcttccaaaagtccctgtcctttgtgttaagctaccagggcaggtggaggagaaa agccagatggggggttgggtcagacaagtctgtgctctagctctccatgtgtgggtgcaa gcagcagccacagtggaaatcggagtacagttctggggccactgggttaatgttccaggg agtagcgcaactacctctgctgcacagaagaatccacatggggactga >gi568815591f:39897492_40163100|GENSCAN_predicted_peptide_2|622_aa MEKDAIHQKHKEGAFIPALMMIVTGGAEREGEERAGAGRAHQEPPRGGGERRKGVAPPPL LAAAAAAKEEESQGPYLPPQTQKRPCPHLPGNSPPPSPAQSLPQPRAAARSPPPRPAPLP SPAGRRVREALGRELLSGPGHFRGACGGRRRRRGGGGGGVLLSRPLPSPELAPNVRARWS EAATKVVLPRCAARAESAAKAAPPPPGALGGLGTPPQAMPSSSDTALGGGGGLSWAEKKL EERRKRRRFLSPQQPPLLLPLLQPQLLQPPPPPPPLLFLAAPGTAAAAAAAAAASSSCFS PGPPLEVKRLARGKRRAGGRQKRRRGPRAGQEAEKRRVFSLPQPQQDGGGGASSGGGVTP LVEYEDVSSQSEQGLLLGGASAATAATAAGGTGGSGGSPASSSGTQRRGEGSERRPRRDR RSSSGRSKERHREHRRRDGQRGGSEASKSRSRHSHSGEERAEVAKSGSSSSSGGRRKSAS ATSSSSSSRKDRDSKAHRSRTKSSKEPPSAYKEPPKAYREDKTEPKAYRRRRSLSPLGGR DDSPVSHRASQSLRSRKSPSPAGGGSSPYSRRLPRSPSPYSRRRSPSYSRHSSYERGGDV SPSPYSSSSWRRSRSPYSPVLR >gi568815591f:39897492_40163100|GENSCAN_predicted_CDS_2|1869_bp atggagaaggatgctattcaccagaaacacaaagaaggagcatttattccagccttaatg atgatagtgacggggggagcggagagggaaggggaggagcgagccggggcaggccgcgca caccaggagcctcctcgtggagggggggagcggaggaaaggggtagctccgccacctctg ctggcggcggcggcggcggcgaaagaagaagaaagtcagggcccgtacctaccgccacag actcagaaacgcccctgcccccatctccccggaaatagccccccgcccagccccgcacag tcgctgccccaaccgagagccgcagcccggtcgcccccgcctcgccccgccccgctgccc agccccgcgggccgcagagtgcgcgaggccctaggccgggagttgttgtcaggcccaggc cacttccgaggcgcctgcggaggacgacgtcgacgccgaggaggaggcggtggaggcgtg ttgttgtctcgcccactcccctcccctgaacttgcacccaacgtcagggcgcgatggagt gaagcggcgacgaaggtggtacttccgcgttgcgctgcccgagccgagagcgcggccaag gccgctcccccacccccgggggcacttggaggactcgggactcccccgcaggcgatgccg agcagctcggacacggcgctggggggaggcgggggcctgagctgggcggagaagaagttg gaggaacgccgcaagcggaggcgattcctgtcccctcagcagccgccgctgctgttgccg ctcctgcagccgcagctcctgcaaccgccgccgcccccgccgcctctgctcttcctggct gctcccggcacggccgccgccgcagccgccgccgccgcggcctcctcctcttgcttcagc ccgggcccccctctggaggtcaagcggctggcgagaggcaagaggcgcgcaggagggcgg cagaagcggcgtcgcgggccccgcgccgggcaggaggcggagaagcgtcgggtcttctcg ctgccccagccgcagcaggacggcggtggcggtgctagtagcggcgggggtgtgaccccg ctggtggaatacgaggatgtgagctcccagtccgagcaggggctgctgctggggggggcc agcgcggcaacggcggcgacggctgccgggggaacggggggcagcggcgggagtccggcc tcctcctccggcacccagcggcgcggggaggggtcggagcgcaggccccgccgggaccgc cgcagcagcagtggccgcagcaaggagcgccaccgcgagcaccggcggcgggatgggcag cgcggtggcagcgaggcctccaagtcccgcagccgccacagccacagcggcgaggaacgg gccgaggtcgccaagagcggcagcagcagcagcagcggcggccgccggaaaagcgcttcg gccacatccagcagcagtagcagccgcaaggaccgggactcgaaggcccaccgcagccgg actaagtcgtccaaggagccgccttcggcctacaaggaaccgcccaaggcctaccgggag gacaagaccgagcctaaggcctacaggcggcggcggtccctcagcccactgggaggccgg gacgacagcccggtgtcccacagggcctctcagagcctgaggagccgcaagtcccccagc ccggcaggaggtggcagcagcccctattctcggcggctgccgcgctccccgagcccctac agtcgccgccgctcccccagctacagccgccacagctcctacgagcggggcggcgacgtg tcccctagtccctacagcagcagcagctggcgccgctctcgcagtccctacagccctgtg ctcaggtga >gi568815591f:39897492_40163100|GENSCAN_predicted_peptide_3|491_aa MGKNRAEKLKNLKIRAPLLLQRNAAPHQQRNKAGQRMTSTSSEKKASFRQSNYSELKEEI RTHGKEVKNLEKKLDEWLTRITNAQKSLKDLMELKIKAQELQRVSVMEDEMKQEEKFRKK RIKRNEQSLQEIWNHVKTPNLHLIGVPESDRENGTKLENTLQDIIQENFPNLARLNQEEA EYLNRPITGSEIEAIINSLPTKKSPGPHGFTAEFYQRYKEELVLMDVSQNNKSYDKPTAN IIPNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEKVKLSLF ADDMIVYLENPNISAQNLLKLIGNFSKVSGYKINVQKLQAFLYTNNRQTESQIMSELPFT IALKRVKYPGIQLTRDVKDLFKENYKLLLNEIKEDTNKWKNIPSSWVGRINIMKMAILPK VIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHTAKSILSQKDKAGGITLPDFKLYYK ATVTKTAWYWY >gi568815591f:39897492_40163100|GENSCAN_predicted_CDS_3|1476_bp atggggaaaaacagagcggaaaaactgaaaaatctaaaaatcagagcgcctctcctcctc caaaggaacgcagctcctcaccagcaacggaacaaagctggacagagaatgacttcgacg agttcagagaagaaggcttcattcagacaatcaaactactccgagctaaaggaggaaatt cgaacccatggcaaagaagttaaaaaccttgaaaaaaaattagatgaatggctaactaga ataaccaacgcacagaagtccttaaaggacctgatggagctgaaaatcaaggcacaagaa ctacaaagggtatcagtgatggaagatgaaatgaagcaagaagagaagtttagaaaaaaa agaataaaaagaaacgaacaaagcctccaagaaatatggaaccatgtgaaaacaccaaat ctacatctgattggtgtacctgaaagtgacagggagaatggaaccaagttggaaaacact ctgcaggatattatccaggagaacttccctaatctagcaagactaaaccaggaagaagct gaatatctgaatagaccaataacaggctctgaaattgaggcaataattaatagcttacca accaaaaaaagtccaggaccacatggattcacagctgaattctaccagaggtacaaggag gagctggtattgatggacgtatctcaaaataataagagctatgacaaacccacagccaat atcataccgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacaggga tgccctctctcaccactactattcaacatagtgttggaagttctggccagggcaatcagg caggagaaggaaataaagggtattcaattaggaaaagagaaagtcaaattgtccctgttt gcagatgacatgattgtatatctagaaaaccccaacatctcagcccaaaatctccttaag ctgataggcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaattacaagca ttcttatacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcaca attgctttaaagagagtaaaatacccaggaatccaacttacaagggatgtgaaggacctc ttcaaggagaactacaaactactgctcaatgaaataaaagaggacacaaacaaatggaag aacattccaagctcatgggtaggaagaatcaatatcatgaaaatggccatactgcccaag gtaatttatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattg gaaaaaactactttaaagttcatatggaaccaaaaaagagcccacactgccaagtcaatc ctaagccaaaaggacaaagctggaggcatcacgctacctgacttcaaactatactacaag gctacagtaacgaaaacagcatggtactggtactaa >gi568815591f:39897492_40163100|GENSCAN_predicted_peptide_4|424_aa MESEIEDEGTAVRRSGKSRSRSPYSSRHSRSRSRHRLSRSRSRHSSISPSTLTLKSSLAA ELNKNKKARAAEAARAAEAAKAAEATKAAEAAAKAAKASNTSTPTKGNTETSASASQTNH VKDVKKIKIEHAPSPSSGGTLKNDKAKTKPPLQVTKVENNLIVDKATKKAVIVGKESKSA ATKEESVSLKEKTKPLTPSIGAKEKEQHVALVTSTLPPLPLPPMLPEDKEADSLRGNISV KAVKKEVEKKLRCLLADLPLPPELPGGDDLSKSPEEKKTATQLHSKRRPKICGPRYGETK EKDIDWGKRCVDKFDIIGIIGEGTYGQVYKARDKDTGEMVALKKVRLDNEKEGFPITAIR EIKILRQLTHQSIINMKEIVTDKEDALDFKKDKVCVAVKPALFLRVHDSDWAVGALQFSY EVET >gi568815591f:39897492_40163100|GENSCAN_predicted_CDS_4|1275_bp atggaaagtgaaattgaggacgaggggactgctgtacgacggtctggaaaatcccgaagc agaagcccgtattcatctaggcattcaagatctcgtagcaggcacagattgtctagatcc agaagtcgtcattctagtatttctcctagcacactaactctgaagagtagcctggcagct gaattgaacaagaataaaaaagcacgagcagcagaggcagcaagagccgcagaagcagcg aaagctgcagaagcaactaaggctgctgaggctgctgccaaggctgcaaaagcttcaaac acttctacacctaccaaggggaacacggaaactagtgccagtgcatcacaaacaaaccat gtgaaggatgtgaagaaaattaaaattgaacatgcaccttctccctcaagtggtggaact ttaaaaaatgacaaagcaaaaacaaagccacctcttcaggtaacgaaggtggaaaataat ttgattgtagataaagccaccaagaaagcagtcatagttggaaaggagagtaaatctgct gctacaaaggaggaatcagtatctcttaaagagaaaaccaaaccacttacaccaagcata ggagccaaggagaaggagcaacatgtagctttagtcacctctacattaccaccgttacct ttgcctcccatgctgcctgaagataaagaagctgatagcttacgaggaaatatttcagta aaagcagttaaaaaagaagtagaaaagaaactccgatgtcttcttgctgatttaccgctg ccccctgagctaccaggaggagatgatctttcaaagagtccagaggaaaagaaaacagca acacagttacatagtaaaaggaggcctaaaatatgtgggcctcgctatggtgaaaccaaa gaaaaagatattgactggggaaaacgctgcgtggataaatttgatatcatcggaattatt ggagaaggtacttacggacaagtttacaaagccagggataaagacactggagaaatggta gccttaaaaaaagtacgtctggataatgaaaaggaaggctttccaattacagcaattcga gaaattaaaattctccggcagcttacccatcagagtattatcaatatgaaggaaatagtg actgataaagaagatgctttggatttcaagaaggacaaagtctgtgtagctgtgaagcct gctctgtttctacgggtacatgactcagattgggcagttggagccttacagttctcttac gaggttgagacttaa >gi568815591f:39897492_40163100|GENSCAN_predicted_peptide_5|602_aa MDRGQIKLADFGLARLYSSEESRPYTNKVITLWYRPPELLLGEERYTPAIDVWSCGIPAA ALDLFDYMLALDPSKRCTAEQALQCEFLRDVEPSKMPPPDLPLWQDCHELWSKKRRRQKQ MGMTDDVSTIKAPRKDLSLGLDDSRTNTPQGVLPSSQLKSQGSSNVAPVKTGPGQHLNHS ELAILLNLLQSKTSVNMADFVQVLNIKVNSETQQQLNKINLPAGILATGEKQTDPSTPQQ ESSKPLGGIQPSSQTIQPKVETDAAQAAVQSAFAVLLTQLIKAQQSKQKDVLLEERENGS GHEASLQLRPPPEPSTPVSGQDDLIQHQDMRILELTPEPDRPRILPPDQRPPEPPEPPPV TEEDLDYRTENQHVPTTSSSLTDPHAGVKAALLQLLAQHQPQDDPKREGGIDYQAGDTYV STSDYKDNFGSSSFSSAPYVSNDGLGSSSAPPLERRSFIGNSDIQSLDNYSTASSHSGGP PQPSAFSESFPSSVAGYGDIYLNAGPMLFSGDKDHRFEYSHGPIAVLANSSDPSTGPEST HPLPAKMHNYNYGGNLQENPSGPSLMHGQTWTSPAQGPGYSQGYRGHISTSTGRGRGRGL PY >gi568815591f:39897492_40163100|GENSCAN_predicted_CDS_5|1809_bp atggacagagggcagataaaacttgcagactttggacttgctcgattgtatagctcagaa gaaagtcggccgtatactaacaaggtaattactttatggtaccgtccacctgaactgcta ctgggagaagaacgatacacaccagccattgatgtatggagctgtggtattcctgcagct gcgctagacttatttgattacatgcttgccttggatcctagtaagcgctgcactgctgaa caggctcttcagtgcgagttcctccgagatgtggaaccctcaaaaatgcctccaccagat ctccctttatggcaagattgtcatgagttatggagtaaaaagcgaagaagacagaagcag atgggcatgactgatgatgtttccacaattaaagcccccaggaaggacttgtctctgggc ttggatgacagcagaaccaacacaccccagggtgtgctgccatcttcacagctgaaatct cagggcagctcaaatgtggcacctgtaaaaacaggccctggacagcacttaaaccacagt gaattggcaattctactaaacctactacaatctaaaacaagtgttaatatggctgatttt gtccaagtgttgaacattaaggtaaactctgagactcaacagcagctaaataaaataaac cttcctgctggaattttggcaacaggtgaaaaacagacagatccatcaacaccacaacag gagtcttcgaaaccgttgggaggaattcagccttcttctcagaccatccagcctaaagtg gagactgatgctgcccaggcggctgtgcagagtgcatttgcagttctgttgactcagtta ataaaggctcagcagtcaaagcagaaagatgtgctactagaagagagggaaaatggatcg ggacatgaagcgtcattacaactcaggccacctccagaacctagcactccggtgtcggga caagatgacctcatccagcatcaagatatgaggatcttggagctaacgccagaaccagac cggcctcgaattctgcctcctgaccaacgacctcccgagcctcctgaaccaccaccagtc actgaggaagatctagattatcggacagaaaaccagcatgtacccaccaccagttcttca ttaactgaccctcatgccggagtgaaggcagccctgttacagctgcttgctcagcatcag ccccaggatgaccccaaaagagaaggtgggattgattatcaagcaggagacacttacgtg tccacttcagactacaaggacaactttggatcctcttctttctcttctgctccttatgtt agcaatgatggtctaggaagcagttctgctccaccactagaacgacgtagtttcattgga aattcagatattcagtctttggataactacagtactgcttcatctcattctggtggtcca cctcagccttctgccttttctgagtcatttcccagttcagtagctggatatggagacatt tacctcaatgctggtcccatgttgtttagtggagacaaggaccatagatttgaatatagc catggtcctattgcagtcctggcaaacagcagtgacccttccacggggccagagagtact catcctttgccagcaaagatgcacaactataactatggtggtaacttacaggaaaatccg agtggccccagcctcatgcatggacagacctggacttctcctgcccaaggacctggatat tcacaaggatacaggggacatattagcacatcaactggcagaggcagaggcagagggtta ccatactga >gi568815591f:39897492_40163100|GENSCAN_predicted_peptide_6|170_aa XRPRRQAGAGIPLATQTESRRVPDCGRPVHSPLPPPRPEKRQVLRRAATLASVASIACVS DGIGPGASGELRWLRSEDCFGKGLAVLCGGPVDTVPTGTGVRTTRRRTGPGLGRTGAVTL RDTAAASRGAGSGLRPLAATLAPTPGPPRGPSSNSATPQGSSRPTPSALR >gi568815591f:39897492_40163100|GENSCAN_predicted_CDS_6|513_bp nggcgcccgaggaggcaagctggggcagggatccctttagccacccagaccgaatctcgg agggtacctgactgcgggcggccagtccacagccccctcccgccgccccggccggagaag aggcaggttctgcgcagagctgccaccctcgccagcgtcgccagcatcgcgtgcgtctca gatggcatcggtcccggtgcaagcggcgaactccggtggctgcggtcggaggactgtttt ggtaaaggactagcagttctctgcggagggccggttgatacagttccgacgggtacggga gtccgcaccacacgccgccgtacgggccccggtctaggccgtacgggagcagtcactctc cgcgacacggcggcagcttcccggggggccggttcgggtctccgtcccctggcggctacc ctggctcctactccaggtcccccgcggggtcccagcagcaattcggctactccccagggc agcagcagacccacccccagtgcccttcgataa