GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:52:59 Sequence gi568815595r:16488663_16698599 : 209937 bp : 40.42% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 5303 5063 241 2 1 92 105 128 0.405 11.10 1.01 Init - 10847 10611 237 2 0 95 53 109 0.548 6.16 1.00 Prom - 11196 11157 40 -2.95 2.00 Prom + 12645 12684 40 -2.95 2.01 Init + 17785 17809 25 0 1 76 98 34 0.627 2.94 2.02 Intr + 23834 24391 558 0 0 34 20 376 0.847 17.27 2.03 Intr + 24449 24897 449 0 2 89 80 230 0.606 14.44 2.04 Term + 29565 29684 120 2 0 29 43 124 0.598 -0.51 2.05 PlyA + 30579 30584 6 1.05 3.05 PlyA - 31134 31129 6 1.05 3.04 Term - 34314 34212 103 2 1 76 52 132 0.534 5.17 3.03 Intr - 35560 35369 192 0 0 87 25 116 0.388 2.89 3.02 Intr - 55963 55790 174 0 0 73 37 181 0.274 9.63 3.01 Init - 59496 59111 386 0 2 80 86 97 0.820 5.06 3.00 Prom - 71843 71804 40 -3.75 4.16 PlyA - 73249 73244 6 1.05 4.15 Term - 78716 78396 321 2 0 40 55 183 0.004 4.04 4.14 Intr - 103486 103388 99 1 0 51 86 61 0.332 1.59 4.13 Intr - 108171 108088 84 0 0 83 110 64 0.724 7.20 4.12 Intr - 108879 108828 52 2 1 70 100 16 0.762 -1.01 4.11 Intr - 109516 109425 92 1 2 88 96 77 0.767 6.37 4.10 Intr - 109936 109790 147 1 0 68 34 149 0.973 7.01 4.09 Intr - 115857 115778 80 1 2 57 73 79 0.695 1.65 4.08 Intr - 116391 116211 181 0 1 38 33 148 0.658 2.82 4.07 Intr - 116741 116541 201 2 0 73 101 118 0.885 10.16 4.06 Intr - 123078 122975 104 1 2 61 77 64 0.189 1.57 4.05 Intr - 138551 138291 261 0 0 32 76 140 0.043 3.74 4.04 Intr - 149401 149223 179 0 2 23 87 110 0.118 3.04 4.03 Intr - 155954 155822 133 0 1 40 105 107 0.115 6.38 4.02 Intr - 157784 157634 151 2 1 84 92 8 0.114 -0.39 4.01 Init - 168174 168091 84 0 0 63 72 96 0.727 6.37 4.00 Prom - 172203 172164 40 -5.75 5.00 Prom + 179996 180035 40 -4.55 5.01 Init + 186261 186355 95 2 2 66 50 50 0.220 -1.10 5.02 Term + 188964 189282 319 0 1 53 54 240 0.499 10.47 5.03 PlyA + 189402 189407 6 1.05 6.00 Prom + 190382 190421 40 -5.45 6.01 Sngl + 191331 193625 2295 2 0 70 47 643 0.915 51.17 6.02 PlyA + 194082 194087 6 1.05 7.03 PlyA - 194282 194277 6 1.05 7.02 Term - 204243 204083 161 1 2 17 47 175 0.885 3.42 7.01 Init - 204904 204862 43 1 1 78 102 37 0.587 4.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:16488663_16698599|GENSCAN_predicted_peptide_1|160_aa MACWQDGQQVGPGCRFLSTWAPLWGYLSFLMAWWLDSTSRHPKSQGVNALKFLKAGSRNW NCVIPSYSIGQAILEPRFRENYGIFYTRVFKDKISEKNHLPAFFLWPAAAEMGCGLNKLE KRDEKRPGNIYSTLKRPQVETKIDVSYEYRFLEFTTLSAX >gi568815595r:16488663_16698599|GENSCAN_predicted_CDS_1|480_bp atggcctgctggcaagatggccagcaagttggtcctggctgtcggttcctctccacatgg gcccctctttggggctacttgagcttcctcatggcatggtggctggattccacgagcagg caccccaaaagtcagggagtaaatgctctcaagtttcttaaagctgggtccagaaactgg aactgtgttattccgtcatattctattggtcaagccatcctggagcccagattcagggaa aattacggcatcttttatactagggtgtttaaagacaaaatctcagaaaaaaatcacctc cccgcctttttcctttggccagcagctgctgaaatgggttgcggattgaacaagttagag aaacgtgatgaaaaacggcctgggaatatttattcaactttgaagaggcctcaggtggaa accaagatagatgtgtcctatgaataccgcttcctggagttcacgactctgagtgctgnn >gi568815595r:16488663_16698599|GENSCAN_predicted_peptide_2|383_aa MTAGNPDDRITHNHTGQLRASTESQTDSRERAWFYNLGLHAVLAAKGSDTGGEGHQPWPR TGVRVPDNGASRNTQPPPPRPDPYPSWRPLSLEKWVKTREENPLQKRRSPFSVLGTLGTR VPVLLVVPALPRSARHVCSYTSSTSVPRANPPPRAHAEGQSGTASTLPHPGRAHPGNLDS ATSSLLFFSEDPSGSASEPAEPTTPNRLLRSASPYIAPGMTPLHPFPAKKSWPKIQASQG SAQVPIRRGPGGAALAAPEPRQRATCREAPRAVPTTRQRPGPWGQTEGPVPVDTPSPVAP PPPPTVWDRRRRRRGRGLPEAACSVPGGKFVPYRGGGAGRTWAPYLRTLWTAVQLLGFVQ EEHYRPSVEEGNRLFILPIDAGQ >gi568815595r:16488663_16698599|GENSCAN_predicted_CDS_2|1152_bp atgacagcagggaacccagatgacaggatcacccataaccacacagggcagctgagggca agcacagagtcacagacagacagcagggagcgtgcctggttctacaaccttggcctccac gcggttcttgctgccaaaggcagcgacaccggaggtgaagggcaccagccctggcctagg actggggtgcgcgtgcctgacaacggggcttcccgaaatacacaaccacctcctcctcgc ccagacccatatccatcctggcgtcccttgagtctagagaagtgggttaagacaagggaa gaaaatcccttacagaagaggagatccccgttttcagtgctggggactctcggcacccgc gtccctgtgcttctggtggttccggcgcttcctcggagcgcgcggcatgtctgctcctac acgtccagcacctctgtccccagagcaaacccacctcccagggcacacgcagaggggcag tcaggcaccgcctccaccctgccccacccaggccgcgcgcaccccggaaacctggactcc gcgaccagctccctcctgttcttttccgaggaccccagcggcagcgcttctgagcctgca gagcctacgacccccaacaggctgctccgaagtgcctctccctacatcgctccaggaatg actcccctacacccgttccccgcaaaaaaaagttggcccaagatccaggcgtcccaagga tcagcccaggtccccatccgacgcgggccagggggtgctgctctggcagctccggagccc cggcaaagagcaacctgcagggaggcgccacgcgcggttcccactacccgccagagaccc ggtccatggggccaaacggaggggcctgtgccggtcgacaccccaagccctgtcgcgccc cccccgccgcctaccgtatgggaccggcggaggagaaggcgcggtcgcgggctccctgag gcagcgtgttccgtcccgggaggaaagtttgtgccatacagaggaggcggcgcggggagg acttgggcgccgtatctgagaaccctgtggacagctgttcagctcttgggatttgtgcag gaggagcattatagaccttcagtagaggaaggaaacaggctgttcatccttcccattgac gctggacaatga >gi568815595r:16488663_16698599|GENSCAN_predicted_peptide_3|284_aa MAATAPSITLSPSNIQMTETESLCDFFFFEIKENSSRSTSSRLLIDYNCGTCPILNSSLT RETELQFWTQTDQGLHLETSMGSRFQELSAAQFLNRNHFLLTKKKVVITVELVTNSCCHN CHLLLLASRSFTAAPQGDVGIAERIHCSQGALADGARAEEVFVDQWWRAQDSKDISEAQQ WVAMETSSESQVLATAVDSPVDEHMWKSCGFPVFHHQEMGQHGHLWVIIVIPPIPRSAYL PGICGTPLAVCPAAPSPSAMIENFLRPSPEADASAMLPVQPSEP >gi568815595r:16488663_16698599|GENSCAN_predicted_CDS_3|855_bp atggctgcaacagctccaagcatcacattatctcccagcaacatccaaatgacagaaaca gaaagtctctgtgatttcttcttttttgagattaaggaaaactcttctagaagtacctcc agcagacttctcattgactacaattgtggcacatgccccatcctgaactcatcactaaca agggaaacagaactacaattttggacacagactgatcaaggtttacaccttgaaacttca atggggtccagattccaagaactatctgctgctcagttcctgaacagaaatcattttcta ttaacaaagaagaaggtggtaatcactgttgagttggtgaccaacagctgttgccacaat tgtcatttacttctgttagcctcaagatctttcacagcagccccccagggggatgtggga atcgctgagaggattcactgcagtcagggagcccttgctgatggggctagggcagaggag gtcttcgtggaccaatggtggagagcacaggacagcaaggacatctcagaggcacaacag tgggtggcgatggaaaccagctcagaaagccaagtcttggcaacagctgttgatagtcct gttgatgaacacatgtggaaaagctgtggctttccagttttccaccatcaggaaatggga caacatggacacttgtgggttatcattgtcattccacctatacccaggagcgcttatctg ccaggaatatgtggcactccacttgctgtgtgccctgctgccccttcaccttctgccatg attgaaaacttcctgaggccctcaccagaagcagatgccagtgccatgcttcctgtacag ccttcagaaccatga >gi568815595r:16488663_16698599|GENSCAN_predicted_peptide_4|722_aa MLENVEVALELVNGQSLEEFGGLRRRQRPHWPLRSFLNSLGMPLRAMNLLFPLPGLLPHH YSHKIVDRDSPPRTAPSSGVRQQTWVPFVKKLTVHFTRARKAEERSRGRWKSCQPSCGST PISSFIKPIHFCLIIKAAFNLSTHGSRPLPLISTVPQHSLLLVFALSRSPSADVFLSATS AKASSWSSSGCCPSKLHYHVQEGKEEGTTGKGLSLQEALAFYSESDTLGAPPRASACVSF ISSVSFGPLDVTKARKIKRSAFTASVVETGSYNKNILNWVAYKQQKFVSHSSGDRQVQDQ GVSMPDGPPFGAPQPCHPLLVFLFSSSLAPLTTRSRAAGSSGPHSSPRSGAPSTASAPPG AGRELLPAAIMSWERRPGSGAAYTERQARDAKGGKVGVGPEDALPRRPRARATAFLRHPF PGSGSLLAEVWAAGDLLMAAPSCGGDRKARLTPSLPHESTANPETPNSTISREASTQSSS AATSQGYILPEGKIMPNTVFVGGIDVRMDETEIRSFFARYGSVKEVKIITDRTGVSKGYG FVSFFNDVDVQKIVEFQNVWTNPNTETYMQPTTTMNPITQYVQKSVDRSIQTVVSCLFNP ENRLRNSVVTQDDYFKSRRKTIRRVRSDTYGRLASGTRARRQGRSRWLQQSGISILDRMV TAGLTQKVTNEQRLEIGQVGNHTDILGKSIADRGDSQCKGSEEKACLVIFRNSKLVSVVE VE >gi568815595r:16488663_16698599|GENSCAN_predicted_CDS_4|2169_bp atgctggaaaatgtggaagtggctttggaactggttaatggtcagagcttggaagagttt ggaggtctcagaagaagacagaggccacactggcctctgcggagttttctgaactcacta ggcatgccactcagagctatgaatttgctattccctctccctggactactccctcatcac tatagccataagatagtggatagagatagtcctccccggactgctccctcctcaggtgtt cggcagcagacatgggtgccttttgtcaaaaagctaaccgtgcacttcactagagccaga aaagctgaagaaagatccaggggcaggtggaaaagctgccagcccagctgcggttccaca ccaataagctcatttatcaagcccatccacttctgcctgatcatcaaggccgccttcaac ctgtccacccatgggagcagacccttaccactcatcagcacagtgcctcagcattctctt ctccttgtctttgcactctccaggtctcccagtgctgatgtcttcctctctgctacttct gccaaagccagttcatggtcctccagtggctgctgcccctccaagcttcactaccatgtc caagaaggaaaagaggaagggaccacgggcaaggggctttctcttcaagaggctctggct ttttattcagaaagtgacacactaggggctcccccaagggcttctgcctgtgtctcattt atatcaagtgtgtcatttggtcccctagatgtaacaaaggctagaaaaatcaaacgttca gcttttacagcctctgtagttgagacggggtcttacaataaaaatatcttaaactgggtg gcttataaacaacagaaatttgtatctcacagttctggagaccggcaagtccaagatcag ggtgtcagcatgcctgacggtccgcctttcggggctcctcagccttgtcacccgctcttg gttttccttttctcttcatctttggctcctttgaccactcgaagccgcgcagcgggttcc agcggacctcacagcagccccagaagtggtgcgccaagcacagcctctgctcctcctgga gccggtcgggaactgctgcctgccgccatcatgagttgggagaggcggcctggcagtggc gcggcctacacagagaggcaggcgcgggacgccaagggcggcaaggtgggggtgggcccc gaggacgcgctgcctcgccggccacgtgcaagggccacggccttcttgaggcacccattt cccggttcgggttctctactggcagaggtttgggctgcgggggacttgctgatggcggct ccctcgtgtggcggcgacagaaaagctcgcctgacgccatctttgccgcacgagtctact gcaaatcctgaaactccaaactcaaccatctccagagaggccagcacccagtcctcatca gctgcaaccagccaaggctatattttaccagaaggcaaaatcatgccaaacactgttttt gttggaggaattgatgttaggatggatgaaactgagattagaagcttctttgctagatat ggttcagtgaaagaagtgaagataatcactgatcgaactggtgtgtccaaaggctatgga tttgtttcattttttaatgacgtggatgtgcagaagatagtagaatttcagaatgtctgg actaatccaaacactgaaacttatatgcagcccacaaccacgatgaatcctataactcag tatgttcagaaatctgtggaccgaagcatacaaacggtggtatcttgtctgtttaatcca gagaacagactgagaaactctgttgttactcaagatgactacttcaagtcaaggagaaag acaataaggagagtaaggagtgacacatatggaaggttagcaagtggtactagagcaagg aggcaagggagatccagatggcttcagcagagtgggatttccattttagatagaatggtc acagcaggccttactcagaaagttacaaatgagcaaagacttgaaataggtcaagtagga aatcatactgatatcttggggaagagcattgcagacagaggggacagccagtgcaaaggc tctgaggaaaaagcatgcctagtgatcttcaggaacagcaagttggtcagtgtggttgaa gtagagtga >gi568815595r:16488663_16698599|GENSCAN_predicted_peptide_5|137_aa MNCLPETQGQKYKLFSSSRPMDYRGRGAHVRLPKVDETTKIGKKQSRKTGNSKKQSASPP PKECSSSLATEQNWTENDFDELREEGFRQSNYSEIQEEIQTKGKEVKNFEKNLDECITRI TNTEKCLKELMELKTKA >gi568815595r:16488663_16698599|GENSCAN_predicted_CDS_5|414_bp atgaactgccttcctgagacacagggccagaaatacaagctgttcagctcctcgaggccc atggactatcgtggaagaggtgcacatgtgagattaccaaaagtagatgaaaccacaaag atagggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaatgcagttcctcactggcaacggaacaaaactggacagagaatgactttgat gagttgagagaagaaggcttcagacaatcaaactactctgagatacaggaggaaattcaa accaaaggcaaagaagttaaaaactttgaaaaaaatttagacgaatgtatcactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggcttga >gi568815595r:16488663_16698599|GENSCAN_predicted_peptide_6|764_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGIPPNSFYEASIILIPKLGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLFRHDQVGFIPGMQGWLNIRKSINVIQHINRTKVKNHMIISIDAEKAFDKI QQPFMLKTLNKFGIDGMYLKIIRAIYDKTTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIIYLENPIFSAQNLLKLISNFSKL SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKGIKYVGIQLTRDVKDLFKENYKPL LKEIKEDTNKWKNIPCSWVGRINIVKMAILSKVIYRFNAIPIKLPMTFFTELEKTTLNFI WNQKRACITKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIT LHIYKYLIIDKPDKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTSYTKINSRWIKDLN VRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLFKLKSFCTTKETTIR VNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWVKDMNRHFSKEDIYAA KRHMKNCSSSLAIKEMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCK LVQPLWKSVWRFLRDLELEIPFDPVIPLLDIYPKDYKSCCYKDT >gi568815595r:16488663_16698599|GENSCAN_predicted_CDS_6|2295_bp atggataaatttctggacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggatctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtataaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcccccctaactca ttttatgaggccagcatcatcctgataccaaagctgggcagagacacaaccaaaaaagaa aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttttccgccatgatcaagtgggcttcatccct gggatgcaaggctggctcaatatacgcaaatcaataaatgtaatccagcatataaacaga accaaagtcaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattcggtattgatgggatgtatctcaaa ataataagagctatctatgacaaaaccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattatatatcta gaaaaccccattttctcagcccaaaatctccttaagctgataagcaacttcagcaaactc tcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagggaataaaatac gtaggaatccaacttacaagggacgtgaaggacctcttcaaggagaactacaaaccactg ctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgtccaaggtaatttatagattcaatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaacttcata tggaaccaaaaaagagcctgcatcaccaagtcaatcctaagccaaaagaacaaagctgga ggcatcacgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatgg tactggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataaca ctgcatatctacaaatatctgatcattgacaaacctgacaaaaacaagcaatggggaaag gattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaa ctggatcccttccttacatcttatacaaaaattaattcaagatggattaaagacttaaac gttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacata ggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaatt gacaaatgggatctatttaaactaaagagcttctgcacaacaaaagaaactaccatcaga gtgaacaggcaacctacaaaatgggagaaaatttttgcaacctactcatctgacaaaggg ctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaacccc atcaaaaagtgggtgaaggatatgaacagacacttctcaaaagaagacatttatgcagcc aaaagacacatgaaaaactgctcatcatcactggccatcaaagaaatgcaaatcaaaacc acaatgagataccatctcacaccagttagaatggcgatcattaaaaagtcaggaaacaac aggtgctggagaggatgtggagaaataggaacacttttacactgttggtgggactgtaaa ctagtccaaccattgtggaagtcagtgtggcgattcctcagggatctagaactagaaata ccatttgacccagtcatcccattactggatatatacccaaaggattataaatcatgctgc tataaagacacatga >gi568815595r:16488663_16698599|GENSCAN_predicted_peptide_7|67_aa MTTAMTRALVVNGKVVLEESEALVCRNMKMELEQANERECKVLKKIWGLAKLMDSMLKCL QRKIDEL >gi568815595r:16488663_16698599|GENSCAN_predicted_CDS_7|204_bp atgactacagccatgaccagggccctagttgttaatgggaaagttgtgcttgaggaatcc gaagcccttgtgtgccgcaacatgaagatggagttggagcaggccaatgagagggagtgc aaagtgctgaagaaaatctggggcttggccaaactgatggactccatgttaaagtgcttg cagaggaaaattgatgagttgtga