GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:58:18 Sequence gi568815588f:112850757_113265631 : 414875 bp : 44.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1737 1793 57 2 0 93 59 51 0.288 4.01 1.02 Intr + 6982 7094 113 0 2 71 37 74 0.121 -0.22 1.03 Intr + 32087 32207 121 2 1 109 42 57 0.012 3.70 1.04 Intr + 48197 48271 75 1 0 96 48 64 0.386 2.91 1.05 Term + 68201 68407 207 1 0 86 48 173 0.992 10.44 1.06 PlyA + 70153 70158 6 1.05 2.00 Prom + 86061 86100 40 -2.96 2.01 Init + 100001 100189 189 1 0 65 75 231 0.608 18.51 2.02 Intr + 100451 100517 67 1 1 31 113 35 0.509 -1.22 2.03 Intr + 100727 100851 125 0 2 35 103 114 0.837 7.90 2.04 Term + 101606 101695 90 1 0 84 48 28 0.175 -3.88 2.05 PlyA + 102710 102715 6 1.05 3.05 PlyA - 103249 103244 6 1.05 3.04 Term - 114073 113928 146 2 2 122 41 211 0.917 17.87 3.03 Intr - 126875 126746 130 2 1 87 91 -15 0.225 -1.03 3.02 Intr - 130462 130323 140 2 2 33 91 80 0.510 2.98 3.01 Init - 136368 136362 7 0 1 55 123 12 0.506 1.74 3.00 Prom - 136628 136589 40 -4.36 4.00 Prom + 138090 138129 40 -3.46 4.01 Init + 138132 138186 55 2 1 74 97 43 0.461 5.25 4.02 Intr + 171060 171176 117 1 0 75 107 44 0.209 5.44 4.03 Term + 185185 185435 251 2 2 60 43 236 0.114 12.17 4.04 PlyA + 190567 190572 6 1.05 5.07 PlyA - 191080 191075 6 1.05 5.06 Term - 199526 199450 77 0 2 99 42 53 0.789 -0.20 5.05 Intr - 201370 201245 126 2 0 74 108 47 0.782 5.95 5.04 Intr - 208871 208808 64 2 1 77 58 32 0.015 -2.61 5.03 Intr - 219178 219006 173 2 2 91 48 115 0.113 7.46 5.02 Intr - 238978 238856 123 2 0 95 82 43 0.840 4.96 5.01 Init - 239328 239271 58 0 1 48 96 40 0.789 2.29 5.00 Prom - 247658 247619 40 -6.06 6.00 Prom + 250405 250444 40 0.94 6.01 Init + 264501 264506 6 2 0 74 100 10 0.790 1.11 6.02 Term + 270533 270661 129 1 0 121 53 81 0.960 6.08 6.03 PlyA + 271852 271857 6 1.05 7.00 Prom + 272170 272209 40 -5.46 7.01 Init + 274815 274958 144 2 0 48 33 102 0.015 0.82 7.02 Intr + 290428 290560 133 0 1 112 96 64 0.813 9.82 7.03 Intr + 292367 292491 125 0 2 53 80 32 0.432 -0.80 7.04 Intr + 293167 293228 62 0 2 120 16 38 0.407 -2.67 7.05 Intr + 295255 295341 87 1 0 85 109 87 0.957 9.59 7.06 Intr + 300242 300367 126 2 0 118 88 -8 0.782 1.99 7.07 Intr + 300969 301128 160 0 1 70 73 132 0.991 9.89 7.08 Intr + 301577 301684 108 1 0 87 42 125 0.610 8.28 7.09 Intr + 306885 307196 312 2 0 77 85 68 0.175 1.78 7.10 Intr + 307265 307313 49 1 1 89 115 62 0.959 7.25 7.11 Intr + 309164 309236 73 0 1 76 89 35 0.713 0.86 7.12 Term + 314799 315216 418 0 1 104 33 194 0.885 10.25 7.13 PlyA + 316594 316599 6 1.05 8.00 Prom + 320406 320445 40 -0.46 8.01 Init + 326186 326245 60 1 0 74 80 18 0.337 0.85 8.02 Intr + 326789 326894 106 1 1 49 55 62 0.393 -1.21 8.03 Term + 328106 328161 56 0 2 116 40 67 0.531 2.42 8.04 PlyA + 328550 328555 6 -0.45 9.04 PlyA - 329630 329625 6 1.05 9.03 Term - 332533 332379 155 2 2 66 39 93 0.704 0.28 9.02 Intr - 334902 334800 103 0 1 -18 90 152 0.748 4.55 9.01 Init - 344067 343993 75 0 0 82 93 30 0.278 3.99 9.00 Prom - 345478 345439 40 -6.76 10.05 PlyA - 345943 345938 6 1.05 10.04 Term - 351307 351195 113 2 2 93 49 113 0.892 6.62 10.03 Intr - 379497 379399 99 1 0 117 75 46 0.026 6.28 10.02 Intr - 393084 392968 117 1 0 104 62 68 0.846 6.24 10.01 Intr - 413778 413665 114 1 0 104 86 22 0.147 4.02 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 18177 18150 28 2 1 148 42 16 0.916 0.55 S.002 Term + 93503 93618 116 0 2 131 53 91 0.802 8.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:112850757_113265631|GENSCAN_predicted_peptide_1|190_aa MGPTKEEAASLDREVKAPEHMGTDSTVKIHAANRQVTGGSGEENARTATDSFPLRESSSS TKHKSPALALVKAMNVKGVYGHTLTHSSLHSWDSTCRASLVPAARIYTDVNLAISTSRSG KEVEQSEPLTTGLERKILKSFDPALRGAFYQELEGTFQGVNPFDRKYKMQDLQGTKRMQY MNLEKMMMMK >gi568815588f:112850757_113265631|GENSCAN_predicted_CDS_1|573_bp atggggcccactaaagaggaagccgcaagtctggacagagaagtgaaggctcctgagcac atggggactgatagtacggtgaagatccacgcagcaaataggcaggtgactggggggagt ggagaggagaatgcccgcacagccacagacagcttcccactcagagaaagttcctcctcc accaagcacaaatctccagccctggccctagtaaaggccatgaatgtaaaaggtgtttat ggtcatactctcactcactcatctcttcattcttgggattctacctgcagagcctcatta gtgcccgcagctcggatttacacagatgtaaatctggccattagcacctcacgctctggc aaggaggtggagcagtcagagcccctaaccactggcttggaaagaaaaatcctgaagtca tttgatccagctctaagaggtgcattctaccaggaactagaaggcaccttccagggagtt aacccttttgatcgcaagtacaaaatgcaagaccttcaaggcactaagaggatgcaatat atgaacttggaaaagatgatgatgatgaaatga >gi568815588f:112850757_113265631|GENSCAN_predicted_peptide_2|156_aa MPQLNGGGGDDLGANDELISFKDEGEQEEKSSENSSAERDLADVKSSLVNESETNQNSSS DSEAERRPPPRSESFRDKSRESLEEAAKRQDGGLFKGPPYPGYPFIMIPDLTSPYLPNGS LSPTARTGRERPGRRGGLRCPAHWLRALPGRAGQQG >gi568815588f:112850757_113265631|GENSCAN_predicted_CDS_2|471_bp atgccgcagctgaacggcggtggaggggatgacctaggcgccaacgacgaactgatttcc ttcaaagacgagggcgaacaggaggagaagagctccgaaaactcctcggcagagagggat ttagctgatgtcaaatcgtctctagtcaatgaatcagaaacgaatcaaaacagctcctcc gattccgaggcggaaagacggcctccgcctcgctccgaaagtttccgagacaaatcccgg gaaagtttggaagaagcggccaagaggcaagatggagggctctttaaggggccaccgtat cccggctaccccttcatcatgatccccgacctgacgagcccctacctccccaacggatcg ctctcgcccaccgcccgaaccggccgcgagcgccctgggcgccgtggcgggctccgctgc ccggcgcactggctgcgggctctcccgggccgcgcagggcagcagggctga >gi568815588f:112850757_113265631|GENSCAN_predicted_peptide_3|140_aa MLERNSFTNRIFCEETFLICFHHYENVATATSDKKTQSKAGCLKHTGGLVCPIIGFIHLV EVQFTLANITPVQATFSAPVAGARQICHRIITVIQPPPTTTTTTITTTTITTTTTITTTT TTIIIIIIIIIISVGGRGGE >gi568815588f:112850757_113265631|GENSCAN_predicted_CDS_3|423_bp atgctcgagagaaacagcttcacaaatcgtatcttctgtgaggagacatttctcatctgc tttcaccattatgaaaatgtagcaacagcaacaagtgacaagaaaacacaaagcaaagca ggctgcctaaaacatactggtggtttggtctgccccataattgggtttatccatcttgtt gaagtacagtttacacttgctaacatcacgccagttcaagctacattttctgctccagtg gcaggagcaaggcagatttgtcatcggatcataacagtgattcaacccccccccaccacc accaccaccaccatcaccaccaccaccatcaccaccaccaccaccatcaccaccaccacc accaccatcatcatcatcatcatcatcatcattatctcagttgggggaaggggcggagaa taa >gi568815588f:112850757_113265631|GENSCAN_predicted_peptide_4|140_aa MPLNIIPVIQIRKQNFKEEHGQCRRRKSTLVFGLNQRLAMVPKQGAKWLTLVFMPCAEKV CQPCPEQIKGQIPLAPSICGAESLKGNTGNTGSHLVFYQGCEVSWQLCRHIIIIITIIII IIIIIIIIIIIICPLSFLLV >gi568815588f:112850757_113265631|GENSCAN_predicted_CDS_4|423_bp atgcctttaaacatcatccccgttatacagataagaaaacagaatttcaaagaagaacat ggccagtgcaggaggaggaaatccacactggtctttggactgaaccagaggctggcgatg gtcccgaaacagggtgccaagtggctgacccttgtttttatgccttgcgctgaaaaagtt tgccaaccctgccctgaacaaataaagggacaaattccacttgccccgtccatctgtgga gcagagtcactgaaaggaaatactggaaatactggaagccacttggtgttttatcaagga tgtgaggtttcctggcaactttgtcgccatatcatcatcatcatcaccatcatcatcatc atcatcatcatcatcatcatcatcatcatcatcatctgccctttaagttttctgcttgtt tag >gi568815588f:112850757_113265631|GENSCAN_predicted_peptide_5|206_aa MLPVLGEQHLRVKPFTAASGVFMAMISLSAVSTPRHHGTQNDSDPTRIQGSQWMGPALER VISLTPTSHVIQWEPLSYAHQSILATSILEYLVLQRNLNQADESALADLNCTKQTREPFS DCQFPLGTLKPDPRTNTVHRKRTEPSSNNEVVIISQTEINNCMLSLPPIFHDLKNNASSN TTPEQHRRCPCERNKQLVPSEQEDKE >gi568815588f:112850757_113265631|GENSCAN_predicted_CDS_5|621_bp atgcttcctgtccttggagagcaacatctgcgagtaaagccattcacggctgcctcaggc gtttttatggctatgatatccctatcagcggtcagcacacccagacatcatggcacccaa aatgacagtgacccaacaagaattcagggatcgcagtggatgggccctgccctagagaga gtgatctccctgacaccgaccagccatgtgatccagtgggagccgctatcttacgctcac caaagcattcttgccactagtattcttgaatatctggttttacaaaggaatctcaatcag gccgatgaatcagctcttgcagatctcaactgtaccaagcagacacgagaacctttctca gactgtcaatttccactaggcacgttaaagcctgacccgcgcaccaacacagtccaccga aaacgtacagaaccttcatcaaataatgaagttgtaataatcagtcaaacagaaataaac aactgtatgctcagcttgccaccaattttccatgatttaaaaaataatgcttctagtaat acaacacctgagcaacacagacggtgtccttgtgagagaaacaagcagcttgtgccctca gagcaggaagacaaagagtaa >gi568815588f:112850757_113265631|GENSCAN_predicted_peptide_6|44_aa MKLSVRSKAGGSLWLTFHDNMPLEIAVLLINPLDMTDLPQGLAL >gi568815588f:112850757_113265631|GENSCAN_predicted_CDS_6|135_bp atgaagctgtctgtgaggagcaaagcaggtggctccctgtggctgacttttcatgacaac atgccattagaaattgccgtcctgttgatcaatcctctcgacatgacagatttgccgcaa ggcctggctctttga >gi568815588f:112850757_113265631|GENSCAN_predicted_peptide_7|598_aa MDDSEFSNGPAPMTNTILIFINCFDNANFDAVVKGLPKLPAKASELRKSNKVPVVQHPHH VHPLTPLITYSNEHFTPGNPPPHLPADVDPKTEWTTGCAPVELADVRWHVLAESAHVCLV KLLAPATHAVPIQQESHGLRTLQIYPRITHYRLAPQGQPVYPITTGGFRHPYPTALTVNA SMSRFPPHMVPPHHTLHTTGIPHPAIVTPTVKQESSQSDVGSLHSSKHQDSKKEEEKKKP HIKKPLNAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQ LHMQLYPGWSARDNYMGSQTACNAIIPLLSESSSCRPPFPFMDQMLWVRFDNGHRYMRNE SFPSPPANPFSAPKKRVSLQLGYNARAFGLKRGWRLDKYRGFVWMEMAVWSLPPAFSFQG KKKKRKRDKQPGETNDLSAPKKCRARFGLDQQNNWCGPCRRKKKCVRYIQGEGSCLSPPS SDGSLLDSPPPSPNLLGSPPRDAKSQTEQTQPLSLSLKPDPLAHLSMMPPPPALLLAEAT HKASALCPNGALDLPPAALQPAAPSSSIAQPSTSSLHSHSSLAGTQPQPLSLVTKSLE >gi568815588f:112850757_113265631|GENSCAN_predicted_CDS_7|1797_bp atggatgactcggagttcagcaacgggccggctccaatgacaaatacaatattaatattc attaactgctttgacaatgctaattttgatgcagtagtaaagggccttccaaagttaccg gccaaagcatcagagctgcgcaagtctaacaaagtgccagtggtgcagcaccctcaccat gtccaccccctcacgcctcttatcacgtacagcaatgaacacttcacgccgggaaaccca cctccacacttaccagccgacgtagaccccaaaacagagtggactacaggatgtgctcct gtggaactggcagatgtgcggtggcacgtacttgcagaatccgctcatgtgtgccttgtg aagctcttagcccctgcgacgcacgcggtgcccattcagcaggaatcccacggcctccgc accctccagatatatccccgtattacccactatcgcctggcaccgcaaggtcaaccagtg tacccaatcacgacaggaggattcagacacccctaccccacagctctgaccgtcaatgct tccatgtccaggttccctccccatatggtcccaccacatcatacgctacacacgacgggc attccgcatccggccatagtcacaccaacagtcaaacaggaatcgtcccagagtgatgtc ggctcactccatagttcaaagcatcaggactccaaaaaggaagaagaaaagaagaagccc cacataaagaaacctcttaatgcattcatgttgtatatgaaggaaatgagagcaaaggtc gtagctgagtgcacgttgaaagaaagcgcggccatcaaccagatccttgggcggaggtgg catgcactgtccagagaagagcaagcgaaatactacgagctggcccggaaggagcgacag cttcatatgcaactgtaccccggctggtccgcgcgggataactatatgggctctcagact gcttgcaacgccatcatcccactcctctctgaatcttcctcgtgccgtcccccattccct tttatggatcaaatgctgtgggtcaggtttgacaatggacataggtacatgagaaatgag agcttcccttcaccacccgccaatcccttctctgctcccaagaagcgtgtgtccctccag ctggggtacaatgcaagggcatttggcttgaagcggggttggaggttggacaaataccgg gggtttgtgtggatggagatggcagtatggtcacttcctcctgccttctccttccaggga aagaagaagaagaggaaaagggacaagcagccgggagagaccaatgacctgagcgctcct aagaaatgccgagcgcgctttggccttgatcaacagaataactggtgcggcccttgcagg agaaaaaaaaagtgcgttcgctacatacaaggtgaaggcagctgcctcagcccaccctct tcagatggaagcttactagattcgcctcccccctccccgaacctgctaggctcccctccc cgagacgccaagtcacagactgagcagacccagcctctgtcgctgtccctgaagcccgac cccctggcccacctgtccatgatgcctccgccacccgccctcctgctcgctgaggccacc cacaaggcctccgccctctgtcccaacggggccctggacctgcccccagccgctttgcag cctgccgccccctcctcatcaattgcacagccgtcgacttcttccttacattcccacagc tccctggccgggacccagccccagccgctgtcgctcgtcaccaagtctttagaatag >gi568815588f:112850757_113265631|GENSCAN_predicted_peptide_8|73_aa MDVHNYQGPIFGGSCWQPPEPAQCGLPDSEFERVIERQIKAALVAAFPDRADPHARCLYN EDKILAGLKRTFQ >gi568815588f:112850757_113265631|GENSCAN_predicted_CDS_8|222_bp atggacgtccataattaccagggccccatatttgggggttcctgttggcagccccctgag cctgctcaatgtgggctccctgacagcgagtttgaaagggttatagagaggcagataaag gccgcgcttgttgcagccttccctgacagggccgacccccacgccaggtgcctttacaac gaggataagatccttgctggcctaaagagaacattccagtag >gi568815588f:112850757_113265631|GENSCAN_predicted_peptide_9|110_aa MAALIYGCLIEAVFLKNNLAECNITLFNRCAEQRYRLPWRPSGLSLNYREIWTHQDVEND SGLSTRAVILAGHAEGLSSKWNVPETGKGKALELLCRVSTGSHKPKSWGQ >gi568815588f:112850757_113265631|GENSCAN_predicted_CDS_9|333_bp atggctgctctcatttatggctgtttaattgaggcagtctttctgaagaacaatttagca gaatgtaacataactctttttaaccgctgtgctgaacaacgataccggttgccctggaga ccatctggtctgagcctcaattaccgagagatctggactcaccaggatgtagaaaacgac tcaggactgtcaacaagggcagtcatcctggcgggacatgcagaggggctgtcctctaaa tggaacgttccagagactggcaaagggaaggctctagagctcctctgcagagtctccacg ggctcccacaagcccaagagttgggggcagtag >gi568815588f:112850757_113265631|GENSCAN_predicted_peptide_10|147_aa XAVKRPFVSRDYGKFLPSSLLGDLPLLIGSVFDCQQMEDSFWLVLSPGRRGAEYHGQVHS EAENNAQQLDLLLDLAAGIAFLIPAQKDDLGAAGSSPGMSPQPPPKSRLPGRRALSHGHH YHKAQEEYCQANINVHLRPKISPVSLW >gi568815588f:112850757_113265631|GENSCAN_predicted_CDS_10|444_bp ngtgcagtcaaaaggccctttgtctccagagattatgggaaattccttccttcctccctg cttggggacctgccgctcttaataggttcagtatttgattgtcaacagatggaagattca ttctggctggtcctttctcctggaagaagaggggctgagtaccatgggcaggtccactcg gaagcagagaacaatgcccagcaacttgacctcctcctagacctggcagcagggatagct ttcctgattcctgcccagaaggatgacctgggggcagcaggatctagccccggaatgtcc ccacagcctcctcccaagtccagacttccaggcagaagagccttatcccatggccaccac taccacaaggcccaggaggagtactgccaggctaacatcaatgttcacttaaggcccaag atctctccagtcagcttgtggtga