GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:10:12 Sequence gi568815596f:28652125_28899300 : 247176 bp : 42.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3379 3540 162 0 0 84 -16 175 0.405 4.50 1.02 Intr + 12037 12146 110 0 2 18 59 110 0.251 -0.74 1.03 Intr + 12300 12467 168 0 0 34 63 169 0.651 7.14 1.04 Term + 14737 14929 193 1 1 84 32 172 0.250 7.11 1.05 PlyA + 16013 16018 6 -0.45 2.06 PlyA - 18767 18762 6 1.05 2.05 Term - 20035 19799 237 1 0 48 49 105 0.309 -2.22 2.04 Intr - 20311 20084 228 1 0 82 100 41 0.348 1.84 2.03 Intr - 21314 21247 68 0 2 144 69 39 0.442 5.31 2.02 Intr - 26562 26185 378 1 0 73 49 245 0.378 13.11 2.01 Init - 30869 30542 328 2 1 46 37 199 0.852 8.13 2.00 Prom - 34658 34619 40 -1.35 3.00 Prom + 42302 42341 40 -7.45 3.01 Init + 42986 43294 309 1 0 86 105 84 0.098 7.26 3.02 Intr + 46431 46611 181 2 1 14 75 101 0.081 -0.08 3.03 Intr + 56313 56453 141 1 0 81 98 45 0.619 4.20 3.04 Intr + 57250 57425 176 2 2 45 83 148 0.191 8.74 3.05 Intr + 76624 76692 69 0 0 77 113 17 0.015 1.56 3.06 Intr + 99914 100316 403 1 1 32 2 367 0.003 15.58 3.07 Intr + 124727 124858 132 0 0 52 68 92 0.655 3.30 3.08 Intr + 126685 126915 231 2 0 109 101 129 0.997 13.12 3.09 Intr + 129614 129718 105 0 0 110 59 42 0.869 2.77 3.10 Intr + 131783 131854 72 0 0 77 111 106 0.999 10.26 3.11 Intr + 136534 136685 152 2 2 90 80 70 0.889 5.36 3.12 Intr + 141739 141873 135 0 0 104 80 149 0.999 15.54 3.13 Term + 147075 147179 105 2 0 34 49 183 0.821 6.43 3.14 PlyA + 149125 149130 6 1.05 4.02 PlyA - 150039 150034 6 1.05 4.01 Sngl - 159377 158703 675 2 0 56 45 308 0.999 19.23 4.00 Prom - 163829 163790 40 -4.35 5.00 Prom + 165449 165488 40 -2.45 5.01 Init + 168637 168647 11 0 2 37 77 10 0.086 -5.25 5.02 Intr + 177024 177195 172 0 1 53 94 104 0.378 6.52 5.03 Intr + 188048 188345 298 1 1 50 86 215 0.440 13.02 5.04 Term + 196186 196214 29 2 2 77 54 36 0.066 -3.64 5.05 PlyA + 196525 196530 6 1.05 6.07 PlyA - 197721 197716 6 1.05 6.06 Term - 199164 198901 264 0 0 14 44 205 0.234 3.22 6.05 Intr - 209184 208994 191 1 2 18 92 101 0.870 1.88 6.04 Intr - 212995 212893 103 1 1 74 110 72 0.860 6.83 6.03 Intr - 214268 214221 48 2 0 60 53 89 0.498 0.56 6.02 Intr - 218286 217455 832 2 1 73 90 427 0.724 31.87 6.01 Init - 223331 223186 146 2 2 35 80 184 0.895 11.94 6.00 Prom - 232608 232569 40 -7.15 7.00 Prom + 237703 237742 40 -9.25 7.01 Init + 240593 240595 3 1 0 113 22 0 0.184 -4.05 7.02 Intr + 242509 242799 291 0 0 113 98 347 0.874 34.61 7.03 Term + 246038 246106 69 1 0 116 46 11 0.375 -3.34 7.04 PlyA + 246547 246552 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:28652125_28899300|GENSCAN_predicted_peptide_1|210_aa MAKLLPAPASSYTPHVLGTGLWRTQGGHILCSLEHNMFSAIQTRLQIAPSQQDVGTESCL GYRYWNTVTAQYAELNCCKEHGCALVKHRPRRAGTQRKRETELSVFADEQRGARTQLRLL VPRERIKLLTLKANESRLCSSVWEWPDNGTGAPALAVLPPTGPDTFHHPEPILDKSVASR GCRSYRSSHTSIATDATCPPTVFTPSWSSP >gi568815596f:28652125_28899300|GENSCAN_predicted_CDS_1|633_bp atggccaagctgttgcctgcccccgcctcctcctatacacctcatgtactggggacaggc ctatggaggacccaaggaggacacattctctgctccctggaacacaacatgttttcagcc atccagacccgacttcaaatagcaccttctcagcaggatgtgggcacagaaagctgttta ggttacaggtactggaatacagtgacggcacagtacgcggagctgaattgctgcaaggaa catggctgtgctttggtcaagcacaggccgaggcgtgctggaacccagagaaagagagag acagagctgtccgtctttgcagatgaacagaggggagccaggacacagctcaggctgctc gtgcccagagaaagaattaagctgctgaccctgaaggcaaacgagagccggctgtgcagc tctgtgtgggagtggccagacaatggaactggggctccggcattggcagtcctgcctcct acaggccctgacacctttcaccatcctgagccaattctggataagagtgtggccagtcgt ggatgccggagttaccgttccagccacacatctattgcaacagatgccacatgtccacca actgtctttacaccatcctggtcatcaccctag >gi568815596f:28652125_28899300|GENSCAN_predicted_peptide_2|412_aa MPKLLAVSQWDLLSDQGRSGPRTPLTSSPVCPPLGAPHLGTASFKDPDATQGFGHPPPSN TGIGDPVSFLSHQFLRKIGIDDSSFFKISVSLFHLLWALLHLESDFPQGGDRVLGAVGHA AKVYGKLHPWAIVLPSLKANLFPFPRMACSAISIYCLAQHPVPRTPWRILRPQTMKCLIR SKRKSTGVTGVHVDAIPQVTLKWLWLLLHVDRSQLAAVRILTMLPKRRGAESAPADHSHW GKQAAMLWATLQRGPRSVLPLTGKNMECLVFCSCVSLVRMMASSFIHGELPLIKPSDLVR LIHCHENSIGETAPMIQLSPPGPTLDMWGLLQFKIADFVQCIIHTTAYSNPGPTPAENFA STVVDDLLFCNPDDLSLVPLSFDLSALLDTVDHPCTLTVCYVGLQETGLISC >gi568815596f:28652125_28899300|GENSCAN_predicted_CDS_2|1239_bp atgccgaagctcttagctgtttcacaatgggacctcctcagtgaccaaggccgctctggt cctcggaccccattgacaagcagcccagtttgccctcctttgggggcccctcatctggga acagcatcctttaaggaccctgatgctactcagggttttggtcatcctccaccatctaat acgggaattggggaccctgtttcatttctgtctcatcagtttctcaggaagattggcata gatgacagctcctttttcaaaatctctgtctctctgttccaccttctgtgggccctgctt catcttgagtctgatttccctcagggtggtgaccgtgtccttggtgctgtgggccacgct gccaaggtttatgggaagctacacccctgggctattgtgttgccctccctgaaggcaaat ctctttcccttcccacgaatggcatgttccgccatctctatttattgtcttgcccagcat ccagttcccagaactccctggagaattctcagacctcaaacaatgaagtgtctcattcgg agtaaaaggaagagcacaggagtcaccggagtccacgtggacgcaattccacaagtgacc ttgaaatggctgtggctgctccttcatgttgaccgaagccagctggcagcggtcaggatt ctgaccatgttgccaaaaagaagaggggcagaaagtgctccagcagatcattcgcactgg ggcaagcaagctgccatgttgtgggcaactttacagagaggcccacgtagcgtgctccca cttacgggtaagaacatggagtgtttggttttctgttcctgtgttagtttggtgaggatg atggcttccagcttcatccatggggaactcccacttataaaaccatcagatcttgtgaga cttattcactgccacgagaacagtataggggaaaccgcccccatgattcaattatctcca cctggccccacccttgacatgtggggattattacaattcaagattgctgactttgtgcag tgcataatccacactactgcatatagcaaccctggccccactcctgctgaaaatttcgcc tccactgttgttgatgacctgctgttttgcaaccctgatgatctttccttagttcccctt tcctttgatctctcagctttacttgacactgtggaccacccctgcaccctcacagtttgt tatgtgggattgcaggagacaggactgatatcttgctga >gi568815596f:28652125_28899300|GENSCAN_predicted_peptide_3|736_aa MGEKAGIQAGVHTTEGEISILRGICKDLVTVKACLCTKKKKKKKEEGREREGKREGKRER EKERKPARRKLSFFPINQLSGAQVAKLQGKWNSLCSLKGALGEENEHEGLKYLCKTHNNS VLEPRVRLGCAGPKAQFLSTVSRSSLLVPCVLENDGCGKQPQADFAKEQEPNKREKELSM IVDSCIIMKRDQDLAGCSGSHYIPNALGGQVPALKLEIKRRAPKALAFHPKPPKAVGRIQ NYVSPAGPGRSLSGERNRHREQRDGGGKVKKFADPWTRVSTSYPLGSFGFLKVRERRAVA AASAAEKPLFPLLGRRVCADKMADGELNVDSLITRLLEGECAPGRGTEGGRAPPPTPASP SAAGTRGDPFPAPRRVSGSAARRTNRGGGEEALGAGERPLGARSGEIGGEGAVPADPRGP GPPAEGLRGCRPGKIVQMTEAEVRGLCIKSREIFLSQPILLELEAPLKICGDIHGQYTDL LRLFEYGGFPPEANYLFLGDYVDRGKQSLETICLLLAYKIKYPENFFLLRGNHECASINR IYGFYDECKRRFNIKLWKTFTDCFNCLPIAAIVDEKIFCCHGGLSPDLQSMEQIRRIMRP TDVPDTGLLCDLLWSDPDKDVQGWGENDRGVSFTFGADVVSKFLNRHDLDLICRAHQVVE DGYEFFAKRQLVTLFSAPNYCGEFDNAGGMMSVDETLMCSFQILKPSEKKAKYQYGGLNS GRPVTPPRTANPPKKR >gi568815596f:28652125_28899300|GENSCAN_predicted_CDS_3|2211_bp atgggggagaaagccggtattcaggcaggagtccacactacagagggtgaaatcagcatc ctcagaggaatttgcaaggaccttgttacagtcaaggcttgcttatgtaccaaaaagaaa aagaaaaaaaaagaagagggaagggagagagaaggaaagagagagggaaagagggagaga gaaaaagaaaggaaacctgctcggagaaagttaagcttttttccaataaatcagttgtca ggggctcaggttgctaagctacagggaaagtggaacagcctgtgctcgctaaaaggagct ttgggggaggagaatgagcatgaagggctcaaatacttgtgtaagacacacaacaattca gtgttggagccaagagtcagacttggatgtgctggacctaaagctcagtttctttccact gtgtcacgttcatctctcttggtaccttgcgtccttgaaaatgatggatgtgggaagcag ccacaggcagactttgcaaaggaacaagaacccaacaaaagggaaaaagaactctccatg attgttgacagctgtataataatgaaaagagaccaggacctggccgggtgcagtggctca cactatattcccaacgctttgggaggccaagtgccggccttaaaacttgaaatcaaacgc agggctcccaaggcgctggcctttcatccaaaaccaccgaaggctgtgggcaggattcag aattatgtttcaccagctggtccaggccgcagtttgtctggggagagaaacagacacagg gaacaaagagatggaggtgggaaggtgaaaaagtttgctgacccatggactagagtatcc acatcttatccactggggtcttttgggttcttgaaggtgagagaacgccgagccgtcgcc gcagcctccgccgccgagaagcccttgttcccgctgctgggaaggagagtctgtgccgac aagatggcggacggggagctgaacgtggacagcctcatcacccggctgctggagggtgag tgcgcgcctggccgcgggacagagggaggtcgggcaccgccgccgacccctgcgtccccg tctgccgccggaacgcgaggggacccctttcccgccccgagacgagtctctgggagcgcg gcgcggcggacgaaccgaggagggggcgaggaggctctgggcgcgggggagcggcctctg ggagcgcggtcaggggagatcgggggagagggggccgttcccgcggaccctcgggggcca ggcccgccggccgaaggcttacgaggatgtcgtccaggaaagattgtgcagatgactgaa gcagaagttcgaggcttatgtatcaagtctcgggagatctttctcagccagcctattctt ttggaattggaagcaccgctgaaaatttgtggagatattcatggacaatatacagattta ctgagattatttgaatatggaggtttcccaccagaagccaactatcttttcttaggagat tatgtggacagaggaaagcagtctttggaaaccatttgtttgctattggcttataaaatc aaatatccagagaacttctttctcttaagaggaaaccatgagtgtgctagcatcaatcgc atttatggattctatgatgaatgcaaacgaagatttaatattaaattgtggaagaccttc actgattgttttaactgtctgcctatagcagccattgtggatgagaagatcttctgttgt catggaggattgtcaccagacctgcaatctatggagcagattcggagaattatgagacct actgatgtccctgatacaggtttgctctgtgatttgctatggtctgatccagataaggat gtgcaaggctggggagaaaatgatcgtggtgtttcctttacttttggagctgatgtagtc agtaaatttctgaatcgtcatgatttagatttgatttgtcgagctcatcaggtggtggaa gatggatatgaattttttgctaaacgacagttggtaaccttattttcagccccaaattac tgtggcgagtttgataatgctggtggaatgatgagtgtggatgaaactttgatgtgttca tttcagatattgaaaccatctgaaaagaaagctaaataccagtatggtggactgaattct ggacgtcctgtcactccacctcgaacagctaatccgccgaagaaaaggtga >gi568815596f:28652125_28899300|GENSCAN_predicted_peptide_4|224_aa MSYGPGTETQQLRSQNSGADDLGDKKRCLMGHKEVGFIKKTPQISIPPTIKAAGTRGDGS ACPGPSLRADGRASGAASGIGPARPSPGTFTRYAAGRRAAKAPTCAATASLARSLSPEPA AGSACVVAAKAAEGAHGRRREDEAGALPSHLRRAVGSPASEPRDRAHSKLEIRLRGPFPG ASTGTCGSPGLRGRGPGNGGQGSVAALLASDGCSSVASQQRYPL >gi568815596f:28652125_28899300|GENSCAN_predicted_CDS_4|675_bp atgagctacgggcctggtactgagacacagcagctcagatctcaaaattctggggcagac gatcttggggacaaaaagaggtgtttaatgggacataaggaggtaggatttatcaaaaag accccccaaatctccattcctcccacaataaaggcggcaggcacacgtggagacgggagc gcctgcccagggccctccctccgagcagacggccgagcttcgggagcagcctccggtatc ggccctgcccgtccttcccctggaaccttcacccgctacgccgccgggcggagggcggcc aaagccccaacctgcgcggccactgcctccctcgccaggtccctcagcccagagcccgct gcggggagcgcgtgtgtcgtcgccgcgaaggcagctgagggcgcccacgggaggcggcgt gaggacgaggctggagcgctgccttctcatctaaggcgggcggtggggtcgccggcgagc gaacccagggaccgggcacactcgaaactggagattcgcctgcgaggccccttcccgggg gcgagcacaggtacctgcggaagcccggggctgcgcgggagagggccgggcaacggcggt caaggctccgtcgcagcgctcctggcctcagacggttgctcgtcggtcgctagccagcag cggtacccgctctaa >gi568815596f:28652125_28899300|GENSCAN_predicted_peptide_5|169_aa MSGRYLANTVEEDEEETKYEIFPWALGKNWRKLFPNFLKLRDQLWDRIDYRAIVSRRCCE EVMAIAPTHYIWQRERSVHHSGAVRNYNRDEVQLPRGPSATPVDCSLCGKKRRYVRLGLS SSSSLSSHTAGVTEKHSQDSYNSLSMDIIGDPSQAYTGSEGYTNSFTGI >gi568815596f:28652125_28899300|GENSCAN_predicted_CDS_5|510_bp atgagtggaaggtatctggctaatacagttgaagaagatgaagaagaaaccaagtacgaa atttttccatgggctttagggaaaaactggagaaaattgttccctaatttcttaaagtta agggaccagctctgggatagaattgactatagggctattgtaagcaggcgatgttgtgag gaggttatggccattgcaccaacccattatatctggcaaagagaacgttctgttcatcac agtggagctgtcagaaactacaacagagatgaagttcagctgccccggggacctagtgcc acaccagtagattgttcactctgtggtaaaaaaagaagatatgttagactgggattgtct tcatcatcatctttatccagtcatacagcaggggtgacagaaaaacattctcaggactca tacaactcactgtcaatggacataataggtgatccttctcaagcttatactggttctgaa ggatacaccaattccttcacgggcatatga >gi568815596f:28652125_28899300|GENSCAN_predicted_peptide_6|527_aa MFEKVLNFSEFGFAVSSNTAVNPRGEVLQNPDSSLAATGDKVKKQEKSRRSRGAVEPHAA AEPSGCCAMRATGKEGVALGLRHSSATAPSRNTMLMAWCRGPVLLCLRQGLGTNSFLHGL GQEPFEGARSLCCRSSPRDLRDGEREHEAAQRKAPGAESCPSLPLSISDIGTGCLSSLEN LRLPTLREESSPRELEDSSGDQGRCGPTHQGSEDPSMLSQAQSATEVEERHVSPSCSTSR ERPFQAGELILAETGEGETKFKKLFRLNNFGLLNSNWGAVPFGKIVGKFPGQILRSSFGK QYMLRRPALEDYVVLMKRGTAITFPKSVEKLSSTKRVPSAQKDINMILSMMDINPGDTVL EAGSGSGGMSLFLSKAVGSQGRVISFEVRKDHHDLAKKNYKHWRDSWKLSHVEEWPDNVD FIHKDISGATEDIKSLTFDAVIELLDGIRTCELALSCEKISEVIVRDWLVCLAKQKNGIL AQKVESKINTDVQLDSQEKIGVKGELFQEDDHGELQFYFMHAVMNGE >gi568815596f:28652125_28899300|GENSCAN_predicted_CDS_6|1584_bp atgtttgagaaggttttaaacttctctgaatttggatttgcagtttctagtaacacagca gtaaatcctagaggagaagttctacagaatccagattcaagtttagcagcaactggggat aaagtgaaaaagcaggagaaaagcaggcgcagtcgcggagctgtagagccccacgcagct gcagagccatcgggctgctgcgccatgcgcgcgactgggaaagaaggggtcgcgctaggc ttgcgtcactcgtctgcgacggcgccttcgcgaaacactatgctaatggcatggtgccgc ggtcctgtcttgctgtgcctgcggcaggggctcggaaccaattcattcctgcacggcctg gggcaggagcccttcgagggagctcggtcactgtgttgcaggtcctcgcctagagacctg cgagatggagaaagagagcacgaggcggcacaaaggaaagccccaggagcagagtcttgc ccatctctccctctgagcatctcggacattgggactggatgtctttcgtcactggaaaac ctcagactgccgacgctgcgggaagagtcatcccctcgagagctcgaggactcgagcgga gaccagggccggtgcggtcccacacaccagggatccgaggatccttcgatgctctcgcag gcccagtccgctaccgaggtcgaagagcgtcacgtctccccttcttgttcaacttccaga gagagaccctttcaggctggggaactgattttagctgagactggggagggagaaacaaaa tttaagaaattatttaggttgaacaacttcggactcttaaatagtaactggggggcagtc ccgttcggcaagatcgtggggaagttccccggccagatactgaggagttccttcggtaag cagtacatgctgaggaggccagccttggaagactatgtagtattgatgaaaagagggact gccataacattcccaaagtctgtggaaaaactgtcttccacgaaacgggtccctagtgcc caaaaggatattaatatgattctctcaatgatggatatcaacccaggtgatactgttttg gaagctggctcaggctctggtggaatgagcttatttttatccaaagcagttggatcacaa ggacgagtcataagttttgaggtacgaaaagaccaccatgatctggctaagaagaattac aaacactggcgtgattcatggaaattaagtcatgtagaagagtggccagacaatgtggat tttattcataaggacatttcaggagcaaccgaagacataaaatctttaacatttgacgca gttattgaacttttagatggaattcgcacctgtgaacttgctctttcatgtgaaaagata agcgaggtcattgtcagagattggttggtttgccttgcaaaacagaaaaatggaatttta gctcaaaaagtagaatctaaaatcaacacagatgtacaactagattctcaagagaaaatt ggagttaaaggtgagctgtttcaagaggatgaccatggtgagcttcagttttactttatg catgcagtaatgaatggagaatga >gi568815596f:28652125_28899300|GENSCAN_predicted_peptide_7|120_aa MAPPRAGAPAHGRTRGCSGARAAMAAGGGGSCDPLAPAGVPCAFSPHSQAYFALASTDGH LRVWETANNRLHQEYVPSAHLSGTCTCLAWAPARLQAKTLSAIGYHSKCVYTALLSLQLR >gi568815596f:28652125_28899300|GENSCAN_predicted_CDS_7|363_bp atggctccgccccgcgccggtgcgcctgcgcacggacgaacacgtggctgcagcggggcc agagcagcaatggcggcgggcggcggcggtagctgcgaccccctggcccctgctggggtc ccttgcgccttctccccgcacagccaggcctacttcgctttggcctctaccgacggtcac ttacgagtatgggagacggccaacaaccggctgcaccaggagtacgtgccttccgcgcac ctcagtggtacctgcacctgtctggcctgggcgccagcgcggctgcaggccaagaccttg tctgctattggctatcacagtaaatgtgtttatactgctttgctgtcactgcaattaaga tga