GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:03:02 Sequence gi568815592r:106968707_107198755 : 230049 bp : 45.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1151 1146 6 1.05 1.03 Term - 4030 3870 161 2 2 77 49 44 0.500 -2.50 1.02 Intr - 5120 4964 157 2 1 43 102 107 0.943 7.18 1.01 Init - 5940 5872 69 0 0 96 94 197 0.930 20.15 1.00 Prom - 6062 6023 40 -4.06 2.00 Prom + 11824 11863 40 -3.86 2.01 Init + 43390 43578 189 0 0 60 62 263 0.930 19.91 2.02 Intr + 45381 45503 123 2 0 102 77 51 0.753 6.18 2.03 Intr + 59387 59565 179 1 2 43 109 16 0.041 -2.08 2.04 Intr + 71303 71524 222 2 0 31 87 174 0.161 8.84 2.05 Intr + 75554 75626 73 2 1 75 93 28 0.936 1.41 2.06 Term + 82351 82530 180 0 0 120 36 129 0.960 8.51 2.07 PlyA + 82610 82615 6 1.05 3.06 PlyA - 82891 82886 6 1.05 3.05 Term - 102244 99998 2247 1 0 84 49 4695 0.002 450.58 3.04 Intr - 102571 102502 70 0 1 125 39 47 0.001 2.68 3.03 Intr - 108873 108719 155 0 2 89 33 53 0.003 -1.23 3.02 Intr - 130047 129845 203 1 2 69 78 266 0.623 22.70 3.01 Init - 130579 130543 37 1 1 57 93 62 0.956 3.77 3.00 Prom - 139076 139037 40 -5.66 4.02 PlyA - 139358 139353 6 1.05 4.01 Sngl - 145982 145548 435 2 0 110 53 198 0.953 12.77 4.00 Prom - 152367 152328 40 -3.16 5.04 PlyA - 154925 154920 6 1.05 5.03 Term - 186071 185913 159 2 0 90 44 132 0.913 6.94 5.02 Intr - 190564 190497 68 2 2 117 60 15 0.527 0.22 5.01 Init - 195768 195693 76 0 1 83 82 21 0.539 2.25 5.00 Prom - 195945 195906 40 -2.46 6.02 PlyA - 196229 196224 6 -0.45 6.01 Sngl - 197393 196278 1116 2 0 58 47 370 0.739 26.68 6.00 Prom - 199169 199130 40 -2.16 7.03 PlyA - 199647 199642 6 1.05 7.02 Term - 200597 199875 723 2 0 -33 43 323 0.072 9.18 7.01 Intr - 225148 225116 33 1 0 113 102 32 0.932 5.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 102229 99998 2232 1 0 85 49 4683 0.968 457.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:106968707_107198755|GENSCAN_predicted_peptide_1|128_aa MGRAMVARLGLGLLLLALLLPTQRGLPRSSIRGTSPRVRVPGAAQRCLEGVWMHRAGERD RGLGLGDPSGGGGEEVRAENSDLPLFHDKGEIITCPTGLCISWLEKSVVNDATQTLGHWE NLGSWPQQ >gi568815592r:106968707_107198755|GENSCAN_predicted_CDS_1|387_bp atgggcagagcaatggtggccaggctcgggctggggctgctgctgctggcactgctccta cccacgcagcgcggtctccctagatcctccatccggggaacctcgccccgggtgcgggta cccggggccgcgcagcgctgcctcgagggtgtatggatgcaccgcgccggcgagagagac cgggggctgggcctgggagaccctagcgggggcgggggcgaagaagtgagggctgaaaac tccgatctgcctttgttccatgacaaaggagagattatcacgtgccctactggactgtgc atttcgtggcttgaaaaatcagtggtgaatgatgccactcagacgctgggacactgggag aaccttggttcttggccacagcaatga >gi568815592r:106968707_107198755|GENSCAN_predicted_peptide_2|321_aa MKKFCKGDESLEDEEHSGRPLEVDNDQLRAIIEGDPLTTTGEVAEELNVNHSTVVQHLKC IGKATEQYVYVAQGLETPVLYRRVDVFSHCQPTSPLTECSKGSVSSKTRPPAGVAASLPP WPRAPPDPLSLRRRRRKGRGADRGSLEACSPRGRGEGREGLTYRLKSNIRSTKSTKKSLQ KVDEEDSDEESHHDEMSEQEEELEDDPTVVKNYKDLEKAVQSFRYDVVLKTGLDIGRNKV EDAFYKGELRLNEEKLWKKSRTVKVGDTLDLLIGEDKEAGTETVMRILLKKVFEEKTESE KYRVVLRRWKSLKLPKKRMSK >gi568815592r:106968707_107198755|GENSCAN_predicted_CDS_2|966_bp atgaagaagttttgcaaaggagatgagagccttgaagatgaggagcatagtggccggcca ttggaagttgacaacgaccaattgagagcaatcatcgaaggtgatcctcttacaactaca ggagaagttgctgaagaactcaacgtcaaccattctacagttgttcagcatttaaagtgc attggaaaggccacagagcagtacgtgtatgtggcccaggggttggagactcctgtgtta tatagacgtgttgatgtgttttctcattgccaacccacttcacccctcacagaatgcagc aagggcagcgtgtcctccaagacgcggcctccagcaggggtcgctgcttcgctgccgccc tggcctcgcgccccgcccgacccgctctcactgcgccggcgccggcggaaggggcggggc gcagataggggtagcctggaggcctgcagtccgcgcggccgcggggagggacgagagggc ctgacgtacagactcaaaagtaatataaggtctacaaaatctactaaaaagtctctgcaa aaagtagatgaagaggactctgatgaagaaagccatcatgatgagatgagtgagcaggaa gaggagcttgaggatgatcctactgtagtcaaaaactataaagacctggaaaaagcagtt cagtcttttcggtatgatgttgtcctgaagacggggctagatattgggagaaacaaagtg gaagatgctttctacaaaggtgaactcaggctgaatgaggaaaaattatggaagaaaagc agaacggtgaaagtgggagatacattggatcttctcattggagaggataaagaagcagga acagagacagttatgcggattctcttgaaaaaagtgtttgaagagaagactgaaagtgaa aaatacagagtggtgttacggcggtggaaaagtttaaagttgcctaagaagagaatgtct aaataa >gi568815592r:106968707_107198755|GENSCAN_predicted_peptide_3|903_aa MNSTEFTEDVEEVLKSITVKVETEAEDAALDCSVNSRTSEKHSVDSVLTALQDSSKRKQL VSDGLLDSVPGVKRRRLIPENLVSSAILAHPSPLPYSVGSRRMTHRVRAVRAETVSGLIL GGHRIDAELSEGVLAQYCQHQCQPSDGKEVHCWCPALLAGMRNRENSSPCQGNGEQAGRG RSLGNVWPGEEEPCNDATTPSYKKPLYGISHKIMEKKNPPSGDLLNVYELFEKANASNSP SSLRLLNEPQKRDCGSTGAGTDNDPNIYFLIQKMFYMLNTLTSNMSQLHSKVDLLSLEVS RIKKQVSPTEMVAKFQPPPEYQLTAAELKQIVDQSLSGGDLACRLLVQLFPELFSDVDFS RGCSACGFAAKRKLESLHLQLIRNYVEVYYPSVKDTAVWQAECLPQLNDFFSRFWAQREM EDSQPSGQVASFFEAEQVDPGHFLDNKDQEEALSLDRSSTIASDHVVDTQDLTEFLDEAS SPGEFAVFLLHRLFPELFDHRKLGEQYSCYGDGGKQELDPQRLQIIRNYTEIYFPDMQEE EAWLQQCAQRINDELEGLGLDAGSEGDPPRDDCYDSSSLPDDISVVKVEDSFEGERPGRR SKKIWLVPIDFDKLEIPQPDFEVPGADCLLSKEQLRSIYESSLSIGNFASRLLVHLFPEL FTHENLRKQYNCSGSLGKKQLDPSRIKLIRHYVQLLYPRAKNDRVWTLEFVGKLDERCRR RDTEQRRSYQQQRKVHVPGPECRDLTSYAINPERFREEFEGPPLPPERSSKDFCKIPLDE LVVPSPDFPVPSPYLLSDKEVREIVQQSLSVGNFAARLLVRLFPELFTAENLRLQYNHSG ACNKKQLDPTRLRLIRHYVEAVYPVEKMEEVWHYECIPSIDERCRRPNRKKCDILKKAKK VEK >gi568815592r:106968707_107198755|GENSCAN_predicted_CDS_3|2712_bp atgaactcaactgaattcaccgaagatgtagaagaagttctaaaaagtatcactgtgaaa gtggagacagaggctgaagatgctgctctggactgctccgtgaattccaggacttctgag aagcactctgtggacagcgtcctcactgccctgcaggactccagcaaacgaaagcagctg gtcagcgatggcctgctagactctgtccccggcgtgaagaggaggcggctgatccccgag aatcttgtttcttcggccatcttagcacatccttcacccttaccatatagtgttggcagc agaaggatgacccacagagtaagagccgtgagagcagagactgtgtctggcttgatcctg ggaggtcaccgtattgatgctgaacttagtgagggagttcttgcccagtattgccagcat cagtgtcagccctctgacggaaaggaagtccactgctggtgccctgctctcctagcaggc atgcggaaccgtgagaacagctcgccctgccaaggcaatggtgagcaggccggcaggggc aggagcctgggcaatgtgtggcctggagaggaggagccctgcaacgatgccaccacccct tcctacaagaagcctctgtatggcatctcgcacaagatcatggagaagaagaatcctccc tcgggggacctgctaaacgtgtacgagctctttgagaaggcaaacgccagcaacagcccc tcgtcactgcggctcctgaatgagccacagaagcgggactgtggcagcaccggggcaggc actgacaacgaccccaacatctacttcctgatccagaagatgttctacatgctgaacacc ctcacgtccaacatgtcccagctgcacagcaaggtggacctgctctcccttgaggtgagc cgcatcaagaagcaggtgagccccactgagatggtggccaaattccagccgccccctgag taccagctcacagccgcagagctcaagcagatcgtggaccagagcctgtcagggggggac ctggcctgccgcttgctggtgcagctcttccccgagctcttcagcgacgtggacttctcc cggggctgcagtgcctgtggctttgcggccaagcgcaagctggagtcgctgcacctgcag ctcatccgcaactatgtggaggtctactacccctcggtgaaggacacggctgtgtggcag gccgagtgcctgccccagctgaacgacttcttcagccgcttctgggcccagcgggaaatg gaggacagccagcccagcggccaggtcgccagcttctttgaggcagagcaggtggacccc ggccacttcctggacaacaaagaccaggaggaggccctgtctcttgaccggagcagcacc atcgcctcagaccacgtggtggacacgcaggacctcactgagttcctggacgaagcctcc tcaccaggcgagtttgccgtcttcctcctccaccggctcttccccgagctcttcgaccac cgcaagctgggtgaacagtacagctgctacggggacggtggaaagcaggagctggacccg cagcggctgcagatcatccgcaactacacggagatctacttccctgacatgcaggaggag gaggcctggctgcagcagtgtgcccagcgcatcaacgacgagctcgagggcctggggctg gacgcgggcagtgaaggcgaccccccgcgtgatgactgctacgactcctccagtctgccc gacgacatctcagtggtcaaggtggaggacagcttcgagggcgagcggccgggtcgccgc tccaagaagatctggctggtgcccatcgacttcgacaagttagagatcccccagcctgac ttcgaggtgcccggtgccgactgcctgctcagcaaggagcagctacgcagcatctacgag agcagcctgtccatcggcaacttcgcctcgcgcctgctggtgcacctgttccccgagctc ttcacgcacgagaacctgcgcaagcagtacaactgcagcggctccctgggcaagaagcag ctggacccctcccgcatcaagctcatccgccactacgtgcagctgctctacccacgcgcc aaaaacgaccgcgtctggaccctggagttcgtgggcaaactggatgagcgctgccggcgc cgggacacggagcaaaggcgctcctaccagcagcagcgcaaggtccacgtgccgggccct gagtgcagagacttgaccagctatgcaatcaaccccgagaggttccgggaggagtttgag gggcccccactgccccccgagaggagcagcaaggacttttgcaagatccccttggacgag ctggtggtcccctcgcctgacttcccggtgccttctccctacctgctgtctgacaaggag gtgcgtgagatcgtgcagcagagcctctccgtgggcaactttgccgcccggctcctcgtc cgcctgtttcccgaactcttcaccgccgagaacctccggctgcagtacaaccattccggg gcttgcaacaagaagcaactggaccccacgcggctgcggctcatccgccactacgtggaa gccgtctacccggtggagaagatggaggaggtgtggcactacgaatgtatccccagcatc gatgagaggtgccgccgccccaacaggaaaaaatgcgacatcctcaagaaagcaaagaaa gtggagaagtga >gi568815592r:106968707_107198755|GENSCAN_predicted_peptide_4|144_aa MERPGRPEGSAVLAAAWRSRSAGLWRGTARRVEGGVGAGAGGGDWREMPGGEREAGGAAA RRNEASGGCKWSEAGPRRRRPGAERDGPRRQGAQRSLPGRSTAVRGSRCPGAGGGPGALG VHPGWREAWPAGRKAEAWGARRSS >gi568815592r:106968707_107198755|GENSCAN_predicted_CDS_4|435_bp atggagcggccggggcggccggaaggctctgccgtactcgcggcggcttggcgctcgcgc tcggcggggctttggcggggcactgctcggcgtgtggaagggggtgtcggagcgggagca gggggcggggactggagggagatgccgggcggcgagcgggaggccgggggagcagccgct cggaggaacgaagcctccggaggctgcaaatggagcgaggccggcccgcggcggcggcgg ccgggagcggagagggatgggccccgccgccagggggcgcagcgctcccttcccggccgt tccacggcggttcgcggttcccgctgcccaggagctggaggcgggcccggcgccttgggt gtgcacccgggctggcgcgaggcctggccggccggccgcaaagctgaagcctggggcgct cgccgctcctcgtga >gi568815592r:106968707_107198755|GENSCAN_predicted_peptide_5|100_aa MDETGNRHSQQTVTRMKNQTLHVLTDCSDPSPVSFEVIVYHVKIDLILLRERIKAGKGVT SAIDLCRYHGNKALEALESFPPSEARSALENIVFAVTRFS >gi568815592r:106968707_107198755|GENSCAN_predicted_CDS_5|303_bp atggatgaaaccggaaaccgtcattctcagcaaactgtcacaaggatgaaaaaccaaaca ctgcatgttctcactgactgctctgatccaagtccagttagctttgaggttattgtatat cacgtcaaaatagacttgatcttgttgcgagaaagaatcaaagctggcaaaggtgtgact tcagctattgacctgtgtcgttaccatggaaacaaggcactggaggccctggagagcttt cctccctcggaggccagatctgctttagaaaacattgtgtttgctgtgaccagattttca tga >gi568815592r:106968707_107198755|GENSCAN_predicted_peptide_6|371_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKIKVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIVKMAILPKVIY RFNAIPIKLPMTFFTELEKTTLKFIWNQKRACIAKTILSQKNKAGGIMLPDFKLYYKAKV TKTAWYWYQNRDTDQWNRTEPSEIIPHIYNHLIFDKPDKTKKWVKDSLFNKWCWENWLAI CRKLKLDPFLTLYTKINSRWIKDLNVRPKTIKTLEENLGNAIQDIGTGKDFMSKTPKAMA TKAKIDKWDLIKLKSFCTAKETTIRVNGQPTEWEKIFPIYSSDKGLISRIYKELKQIYKK KTTPSKSGRRI >gi568815592r:106968707_107198755|GENSCAN_predicted_CDS_6|1116_bp atgattgtatatttagaaaaccccatcgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaaagtgcaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggaa aactacaaaccactgctcaacgaaataaaagaagacacaaacaaatggaagaacattcca tgctcatggataggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcctgcattgccaagacaatcctaagccaa aagaacaaagctggaggcatcatgctacctgatttcaaactatactacaaggctaaagta accaaaacagcatggtactggtaccaaaacagagatacagaccaatggaacagaacagag ccctcagaaataataccacacatctacaaccatctgatctttgacaaacctgacaaaacc aagaaatgggtaaaggattccctatttaataaatggtgctgggaaaactggctagccata tgtagaaagctgaaactggatcccttcctcacactttatacaaaaattaattcaagatgg attaaagatttaaatgttagacctaaaaccataaaaaccctagaagaaaacctaggcaat gccattcaggacataggcacgggcaaggacttcatgtctaaaacaccaaaagcaatggca acaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaa gaaactaccatcagagtgaacgggcaaccgacagaatgggagaaaatttttccaatctac tcatctgacaaagggctaatatccagaatctacaaagaactcaaacaaatttacaagaaa aaaacaaccccatcaaaaagtgggcgaaggatatga >gi568815592r:106968707_107198755|GENSCAN_predicted_peptide_7|251_aa AQEKGRLDYAKNYVTNAQASVADSIKWKKGYQVIEDQMNEMKREEKFREKRGKINKQSLR EIWDYVKRPNLRLIGVPESDRENGTKLENTLQDIIQENFPNLARQASIQIQEIQRTPQRY SSRRATPRRIIVRFTKVEMKEKMLRAAREKGRVTHKGKPIRLTADLSAETLQARREWGAI FNILKEKNFQPRISYPAKLSFISEGEIKSFTDKQMLRDFVTTRPALKELLKEALNMERNN RYQPLQKHVKL >gi568815592r:106968707_107198755|GENSCAN_predicted_CDS_7|756_bp gctcaagaaaaaggaagattggactatgctaagaactacgtgacgaatgcacaagcttca gtagccgactcgatcaagtggaagaaagggtatcaagtgattgaagatcaaatgaatgaa atgaagcgagaagagaagtttagagaaaaaagaggaaaaataaacaaacaaagcctccga gaaatatgggactatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgac agggaaaatggaaccaagttggaaaacactctgcaggatattatccaggagaacttcccc aacctagcgaggcaggccagcattcaaattcaggaaatacagagaacgccacaaagatac tcctcgagaagagcaactccaagacgcataattgtcagattcaccaaagttgaaatgaag gaaaaaatgttaagggcagccagagagaaaggtcgggttacccacaaagggaagcccatc agactaacagcggatctctcggcagaaaccctacaagccaggagagagtggggggcaata ttcaacattcttaaagaaaagaattttcaacccagaatttcatatccagccaaactaagc ttcataagtgaaggagaaataaaatcctttacagacaagcaaatgctgagagattttgtc accaccaggcctgccctaaaagagctcctgaaggaagcactaaacatggaaaggaacaac cggtaccagccactgcaaaaacatgtcaaattgtaa