GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:41:51 Sequence gi568815593r:151216820_151447460 : 230641 bp : 43.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 448 594 147 0 0 122 64 27 0.663 4.13 1.02 Intr + 6657 6930 274 2 1 74 55 130 0.706 5.41 1.03 Term + 7147 7295 149 2 2 70 52 80 0.405 0.66 1.04 PlyA + 8006 8011 6 1.05 2.03 PlyA - 8030 8025 6 1.05 2.02 Term - 29665 29514 152 2 2 37 36 169 0.366 4.67 2.01 Init - 31616 31370 247 2 1 42 16 179 0.459 3.86 2.00 Prom - 32463 32424 40 -8.26 3.00 Prom + 35107 35146 40 -6.56 3.01 Init + 36398 36478 81 1 0 47 113 204 0.832 17.77 3.02 Intr + 42178 42258 81 0 0 -8 89 121 0.095 2.43 3.03 Intr + 42936 43097 162 2 0 77 105 103 0.911 11.07 3.04 Intr + 49912 50094 183 0 0 110 100 110 0.999 14.38 3.05 Term + 50477 50632 156 1 0 95 38 164 0.997 10.03 3.06 PlyA + 52400 52405 6 1.05 4.26 PlyA - 56057 56052 6 1.05 4.25 Term - 60842 60574 269 0 2 111 32 283 0.903 20.66 4.24 Intr - 64364 64195 170 1 2 91 76 201 0.998 18.79 4.23 Intr - 67391 67225 167 2 2 122 84 127 0.980 14.46 4.22 Intr - 67892 67794 99 2 0 83 95 73 0.993 7.81 4.21 Intr - 70645 70427 219 1 0 47 58 297 0.976 21.10 4.20 Intr - 71651 71567 85 1 1 155 103 2 0.997 8.22 4.19 Intr - 76640 76545 96 1 0 111 62 81 0.946 6.92 4.18 Intr - 79449 79361 89 0 2 144 55 100 0.998 11.07 4.17 Intr - 81864 81774 91 2 1 123 91 33 0.907 7.10 4.16 Intr - 86535 86408 128 0 2 44 97 61 0.366 2.08 4.15 Intr - 87328 87290 39 1 0 95 86 11 0.188 0.02 4.14 Intr - 92842 92658 185 2 2 49 43 193 0.285 10.41 4.13 Intr - 93347 93285 63 0 0 50 61 87 0.039 0.89 4.12 Intr - 93551 93529 23 1 2 38 117 19 0.032 -3.01 4.11 Intr - 100269 100002 268 1 1 118 97 468 0.949 47.29 4.10 Intr - 105396 105227 170 2 2 68 84 274 0.960 24.59 4.09 Intr - 108633 108467 167 0 2 71 30 172 0.352 8.46 4.08 Intr - 116503 116405 99 1 0 114 79 18 0.847 3.81 4.07 Intr - 118728 118510 219 0 0 126 109 344 0.704 38.80 4.06 Intr - 122325 122241 85 2 1 123 80 17 0.001 4.22 4.05 Intr - 126164 126069 96 1 0 76 86 56 0.469 3.32 4.04 Intr - 126779 126691 89 2 2 129 78 87 0.638 10.57 4.03 Intr - 127448 127358 91 1 1 110 99 99 0.990 13.20 4.02 Intr - 130437 130408 30 2 0 97 90 17 0.507 0.05 4.01 Init - 130641 130478 164 0 2 75 92 165 0.685 14.81 4.00 Prom - 135292 135253 40 -4.66 5.04 PlyA - 137132 137127 6 1.05 5.03 Term - 144168 143960 209 1 2 16 32 192 0.033 3.90 5.02 Intr - 148686 148650 37 0 1 89 36 20 0.011 -5.26 5.01 Init - 149807 149652 156 2 0 67 36 280 0.602 18.67 5.00 Prom - 150578 150539 40 -4.06 6.00 Prom + 154360 154399 40 -3.46 6.01 Init + 163894 164121 228 0 0 33 -15 192 0.482 1.77 6.02 Intr + 164210 164359 150 1 0 42 40 167 0.041 7.66 6.03 Term + 165132 165566 435 2 0 66 45 295 0.060 18.29 6.04 PlyA + 166418 166423 6 1.05 7.04 PlyA - 168124 168119 6 1.05 7.03 Term - 170466 170293 174 0 0 79 46 81 0.617 0.76 7.02 Intr - 171048 170996 53 1 2 74 116 30 0.830 3.03 7.01 Init - 172950 172818 133 0 1 84 44 65 0.806 2.00 7.00 Prom - 173074 173035 40 -2.46 8.00 Prom + 174156 174195 40 -4.66 8.01 Init + 195695 195866 172 1 1 58 33 192 0.102 10.30 8.02 Intr + 200654 200795 142 0 1 111 78 12 0.178 1.91 8.03 Term + 203616 203715 100 0 1 68 39 59 0.085 -3.40 8.04 PlyA + 203991 203996 6 1.05 9.03 PlyA - 206538 206533 6 1.05 9.02 Term - 208578 208506 73 2 1 159 49 39 0.917 4.48 9.01 Init - 214662 214463 200 0 2 100 99 106 0.911 11.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 42172 42258 87 0 0 53 89 107 0.885 7.94 S.002 Init - 93345 93285 61 0 1 57 61 80 0.923 3.61 S.003 Term + 123199 123549 351 0 0 -37 41 509 0.950 28.19 S.004 Sngl - 144121 143960 162 1 0 74 32 178 0.837 5.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:151216820_151447460|GENSCAN_predicted_peptide_1|189_aa MQLLVSKSNGPCPGTSLFTVLPLLLPRWVPKGGTGFLLSFLIAGNLGLGVPRDADTASLW NTLRAALRLSAFNPRGEEAEGPERSSHLSMATQRTAPDRKQRAPATDTRHLPTRGSGNQL PFPGPGAVFVNSAAWCGSGRAHPPPGAPGRRFPTPGPPEEPAIRAPLPAPRPPAGAAGET LKLNLEALS >gi568815593r:151216820_151447460|GENSCAN_predicted_CDS_1|570_bp atgcagctgctggtgtccaaaagcaatgggccctgtccaggcacctccttgtttacagtg ctgcctcttctgctgccaagatgggttccaaagggaggcactgggttcttgctgtcattt ttgattgctggcaacttggggctgggggtccccagggatgctgatactgcgagtctgtgg aacactcttcgagcagccctgagattatcagcctttaacccccggggggaggaagccgaa ggcccagagaggtcaagtcacttgtccatggccacacagcgaacagccccagaccgaaag cagagggcacctgccacagacacacgacatctccccacccgaggcagcgggaatcagctg cctttccctggcccgggtgccgtgtttgtaaactcggctgcttggtgcggctcgggccgg gcccatcctcctccgggggctcccggacgccgcttcccaactccggggcccccagaggag cctgcgatccgggccccgctgcccgctccgcgcccgccggctggggctgccggcgagacc ctgaaactgaacctggaagcgctgtcttga >gi568815593r:151216820_151447460|GENSCAN_predicted_peptide_2|132_aa MAAIIGDALNAQRASKGNPKDYKDNTSKGKSCFKCKKSGHWAKECTKPPPSSSPQAPAVN AKAPVTNPGTGELIASAPTEGLVQGLDWVHLSRIKPAIPEDPDQEPEVSISHYTCEPVEA LKFLFKRQPKDE >gi568815593r:151216820_151447460|GENSCAN_predicted_CDS_2|399_bp atggcagccatcattggtgatgccctgaatgcccaaagagcatctaagggaaacccgaag gactataaagataataccagcaaaggcaagtcttgcttcaaatgcaagaaaagcgggcat tgggcaaaggaatgtactaagcccccgccctccagctcccctcaggcccctgctgtcaac gcaaaggcaccagtcacgaaccctggcactggagaattgattgcctctgctcccactgag gggctggtacaaggattagattgggtacatctttcaaggatcaagccagcaataccagaa gatccggaccaggaacctgaagtttccatcagccactacacctgtgaacctgtggaagcc ctgaagttcctgtttaaaagacagccaaaagatgagtaa >gi568815593r:151216820_151447460|GENSCAN_predicted_peptide_3|220_aa MQSLMQAPLLIALGLLLAAPAQAHLKKSHADVVVVESGDLGDVEEEQEQLSTGVPSQLSS FSWDNCDEGKDPAVIRSLTLEPDPIIVPGNVTLSVMGSTSVPLSSPLKVDLVLEKEVAGL WIKIPCTDYIGSCTFEHFCDVLDMLIPTGEPCPEPLRTYGLPCHCPFKEGTYSLPKSEFV VPDLELPSWLTTGNYRIESVLSSSGKRLGCIKIAASLKGI >gi568815593r:151216820_151447460|GENSCAN_predicted_CDS_3|663_bp atgcagtccctgatgcaggctcccctcctgatcgccctgggcttgcttctcgcggcccct gcgcaagcccacctgaaaaagtcacatgcagatgtagtggtggtggaatcaggagacctg ggggatgtggaggaggaacaggagcagctcagcacaggggtgccatcccagctcagtagc ttttcctgggataactgtgatgaagggaaggaccctgcggtgatcagaagcctgactctg gagcctgaccccatcatcgttcctggaaatgtgaccctcagtgtcatgggcagcaccagt gtccccctgagttctcctctgaaggtggatttagttttggagaaggaggtggctggcctc tggatcaagatcccatgcacagactacattggcagctgtacctttgaacacttctgtgat gtgcttgacatgttaattcctactggggagccctgcccagagcccctgcgtacctatggg cttccttgccactgtcccttcaaagaaggaacctactcactgcccaagagcgaattcgtt gtgcctgacctggagctgcccagttggctcaccaccgggaactaccgcatagagagcgtc ctgagcagcagtgggaagcgtctgggctgcatcaagatcgctgcctctctaaagggcata taa >gi568815593r:151216820_151447460|GENSCAN_predicted_peptide_4|1066_aa MSVTKSTEGPQGAVAIKLDLMSPPESAKKLENKDSTFLDESPSESAGLKKTKGITLKLAL AKTLRVFQALIHLVKGNMGTGILGLPLAVKNAGILMGPLSLLVMGFIACHCMHILVKCAQ RFCKRLNKPFMDYGDTVMHGLEANPNAWLQNHAHWGRHIVSFFLIITQLGFCCVYIVFLA DNLKQVVEAVNSTTNNCYSNETVILTPTMDSRLYMLSFLPFLVLLVLIRNLRILTIFSML ANISMLVSLVIIIQYITQEIPDPSRLPLVASWKTYPLFFGTAIFSFESIGVVLPLENKMK NARHFPAILSLGMSIVTSLYIGMAALGYLRFGDDIKASISLNLPNCWLYQSVKLLYIAGI LCTYALQFYVPAEIIIPFAISRVSTRWALPLDLSIRLVMVCLTCLLAILIPRLDLVISLV GSVSGTALALIIPPLLEVTTFYSEGMSPLTIFKDALISILGFVGFVVGTYQALDELLKSE DSHPFSNSTTFVRSEAADLPGMKLQTFAESVTAHKGSVDPKNSGAQLASPSGSHTGDAGG ATCQSCAVGPHSSALGWSMGLGAVEQGAALIGEALAAQEPMKRPLQIVNSMRTEIVMSLL GRDYNSELNSLDNGPQSPSESSSSITSENVHPAGEAGLSMMQTLIHLLKCNIGTGLLGLP LAIKNAGLLVGPVSLLAIGVLTVHCMVILLNCAQHLSQRLQKTFVNYGEATMYGLETCPN TWLRAHAVWGRYTVSFLLVITQLGFCSVYFMFMADNLQQMVEKAHVTSNICQPREILTLT PILDIRFYMLIILPFLILLVFIQNLKVLSVFSTLANITTLGSMALIFEYIMEGIPYPSNL PLMANWKTFLLFFGTAIFTFEGVGMVLPLKNQMKHPQQFSFVLYLGMSIVIILYILLGTL GYMKFGSDTQASITLNLPNCWLYQSVKLMYSIGIFFTYALQFHVPAEIIIPFAISQVSES WALFVDLSVRSALVCLTCVSAILIPRLDLVISLVGSVSSSALALIIPALLEIVIFYSEDM SCVTIAKDIMISIVGLLGCIFGTYQALYELPQPISHSMANSTGVHA >gi568815593r:151216820_151447460|GENSCAN_predicted_CDS_4|3201_bp atgtctgtgacaaaaagtactgagggtccccagggagccgttgccatcaaattggacctt atgtcgcctcctgaaagtgccaagaagttggagaacaaggactctacattcttggatgaa agtccttcagagtcagcaggcttgaagaagaccaagggcataaccctgaagcttgcattg gcaaaaaccctgagagtgttccaggccttgattcacctggtgaaaggcaacatgggcaca gggatcctgggactacccctcgctgtgaagaacgcgggcatcctgatgggcccactcagt ctgctggtgatgggcttcattgcctgccactgtatgcacatcctggtcaagtgtgcccag cgcttctgtaagaggcttaacaagccctttatggactatggggacacggtgatgcatgga ctagaagccaaccccaacgcctggctccagaatcacgctcactggggaaggcatatcgtg agcttcttccttattatcacccaacttggcttctgctgtgtgtacattgtgtttttggct gataatttaaaacaggtagtggaagctgttaatagcacaaccaacaactgctattccaat gagacggtgattctgacccccaccatggactcgcgactctacatgctctccttcctgccc ttcctggtgctgctggtcctcatccggaacctcaggatcttgaccatcttctccatgctg gccaacatcagcatgctggtcagcttggtcatcatcatacagtacattacccaggaaatc ccagaccccagccggttgccactggtagcaagctggaagacctaccctctcttcttcgga acagccattttttcttttgaaagcattggtgtggttctgcctctggaaaacaagatgaag aatgcccgccacttcccagccatcctgtctttgggaatgtccatcgtcacttccctatac attggcatggcggctctgggctacctgcggtttggagatgacatcaaggccagcataagc cttaacctgcctaactgctggctgtaccagtctgtcaagcttctctacattgccggcatc ctgtgcacctatgccctgcagttctacgtccctgcagaaatcatcatcccctttgccatc tcccgggtgtcaacacgctgggcactgcctctggatctgtccattcgcctcgtcatggtc tgcctgacatgcctcctggccatcctcatcccccgcctggacctggtcatctccctggtg ggctccgtgagtggcaccgccctggccctcatcatcccaccgctcctggaggtcaccacg ttctactcagagggcatgagccccctcaccatcttcaaggacgccctgatcagcatcctg ggcttcgtgggctttgtggtggggacctaccaggccctggacgagctgctcaagtcagaa gactctcaccccttttccaactccaccacttttgttcggagtgaagctgcagaccttccc ggaatgaagctgcagaccttcgcagagagtgttacagctcataaaggcagtgtggaccca aaaaactcaggagcccagctggcttcacccagtggatcccacaccggggatgcaggtgga gctacctgccagtcctgcgccgtgggcccgcactcctcagcccttgggtggtcgatggga ctgggcgccgtggagcagggggcagcgctcatcggggaggctctggccgcacaggagccc atgaagcggcccctccagattgtaaattccatgagaacagagattgtaatgtcattgctt ggaagggactacaacagtgagctgaactccttggacaacggacctcagtcaccctcagag agcagcagtagcattacttcagagaatgtccatcctgctggagaagctggactatcgatg atgcaaactttgatccacttgttgaaatgcaacattggcacagggctcctggggcttccc ctggccataaagaatgccggcttgttggtcggtcctgtcagccttctggccatcggggtc ctcaccgtgcactgcatggtcatcctgttgaactgtgctcaacacctcagccagagactg cagaagacttttgtgaactatggagaggccacgatgtacggccttgaaacctgcccgaac acctggctgagggcccatgcagtgtggggaaggtacactgtcagcttcttattagtcatc acccagctgggcttctgcagtgtttattttatgtttatggcagacaatttacaacagatg gtggaaaaagcccacgtgacctccaacatctgccagcccagggagattctgacgctgacc cccatcctggacattcgtttctacatgctgataatcctgcccttcctgatcctgttggtg tttatccagaacctcaaggtgctgtccgtcttctcgacattggccaacatcaccaccctt gggagcatggctctgatctttgagtatatcatggaggggattccatatcccagcaaccta cccttgatggcaaactggaagaccttcttgctgttctttggtacagccatcttcacattt gaaggcgtcggtatggttctgcctctcaaaaaccagatgaagcatccacagcagttttct tttgttctgtacttggggatgtccattgtcatcatcctctatatcttactggggacactg ggctacatgaagtttgggtcagacacccaggccagcatcaccctcaacttgcccaattgc tggttgtaccagtcagtcaagctgatgtactctatcggcatcttcttcacctatgccctc cagttccacgtcccagctgagatcatcatcccgtttgccatctcccaagtgtcagagagc tgggcactgtttgtagacctgtctgtccgctcagccttggtctgtctaacctgtgtctca gccatcctcatcccccgcctggacttggtcatctccctggtaggctccgtgagcagcagc gccctggctctcatcatcccagccctcctggagatcgtcatcttttactctgaggacatg agctgtgtcaccattgccaaggacatcatgattagcatcgtgggccttttagggtgtata tttgggacataccaagccctctatgagttgccccaacccatcagccattccatggccaac tccacaggtgtccatgcataa >gi568815593r:151216820_151447460|GENSCAN_predicted_peptide_5|133_aa MVLTQMSLVSSSTLPLIILCLLEMTTYYSECMSSLIITKDALISILGFVGFLKLVTKSLV SLLIEPIKEIKKEIVDVTKKMGSEGFQDMDLGEIQEQIDTMPEELTKDDWMEMSASQPVS AEEEEEAAAVSEN >gi568815593r:151216820_151447460|GENSCAN_predicted_CDS_5|402_bp atggtccttacccagatgagcttggtgagcagcagcaccctgcccctcatcatcctgtgc ctcctggagatgactacctactactcagagtgcatgagctccctcatcatcaccaaggac gccctgatcagcatcctgggctttgtgggatttttgaagcttgttaccaagagtttggtt tccctgctgattgagccaatcaaggaaataaaaaaagagattgtggatgtgacaaaaaag atggggagtgaagggtttcaagatatggatcttggagaaattcaagagcaaatagacacc atgccagaggaattaacaaaagatgactggatggagatgagtgcttctcaaccggtgtca gctgaggaggaagaagaagcagcagcagtgtcagaaaactaa >gi568815593r:151216820_151447460|GENSCAN_predicted_peptide_6|270_aa MSIDLIQDEVGSPLQTDGGHCERKASGKTGNGSTRTGSSRPTASRTQQNKGKGCEESPRD GAAPDAASIGNAPDFARREVSVDREKQLALRHQPHDHVTTSAILQDLSLAGKDELAMAVW KELMVAAVSKVNPLGGYSTTMLKMAVLMPSPGALPVFALPVTGGTDVTPKESLLTVIHMV LTEHDPFKHSADSELKALVCMALNEQRLMSWVNLICKCGSLIETHYQPGSYIAHTGFESA LNLHSRLSSLKFSLPVDLAMRQLKNIKDAF >gi568815593r:151216820_151447460|GENSCAN_predicted_CDS_6|813_bp atgtccatcgacctcatccaagatgaagtgggaagccccttgcagacagatggtggacat tgtgagcgcaaggcctctgggaagacaggaaatggttccaccagaactggcagcagcaga cccactgcctccaggacacagcaaaacaaaggcaagggatgtgaagaaagtccaagggac ggggctgcacctgatgccgcaagcattggcaatgctccagattttgctaggcgggaggtg tcagtggacagagagaagcagctagccttgaggcaccagccacacgaccatgtcaccacc tctgccatcctccaggacctctctctggcaggcaaggatgagctggccatggctgtgtgg aaggagctgatggtggctgcagtcagcaaggtgaaccccttggggggttacagtactacc atgctaaaaatggcagtgcttatgcccagcccgggagctctcccagtctttgcccttcct gttacaggaggcactgatgtaacccccaaagagagcctactgacagtcatccacatggtg ctgacagagcacgaccctttcaagcacagtgcagactctgaattgaaggctttggtgtgc atggcactgaatgagcagcgtctgatgtcctgggtgaacctcatctgcaagtgcgggtca ctcatcgagactcactaccagcctgggagctacatagcacacacaggctttgagagtgcc ctcaacctgcacagtcgcctcagcagcctcaagttcagcctccctgtagatctggccatg cgccagctcaaaaatatcaaagatgccttttga >gi568815593r:151216820_151447460|GENSCAN_predicted_peptide_7|119_aa MAYYAAIKKDEFMSFVGTWMKLETIILSKLLQGQKTKHRMFSLNGHGFPGPLNRNLRKRR KKSLLVSVRGSFLIPSALHRLYQSVELLYLGGICLTYPLQFYVSAKIIVPVTVSWVCKC >gi568815593r:151216820_151447460|GENSCAN_predicted_CDS_7|360_bp atggcatactatgcagccataaaaaaggatgagttcatgtcctttgtagggacatggatg aagctggaaaccattattctcagcaaactattgcaaggacaaaaaaccaaacaccgcatg ttctcactcaatgggcatggcttcccaggtcccttaaataggaatttgcgcaagagaaga aaaaagagcctgctggtgtctgtgcgtggatctttcctgatcccctctgccttgcacagg ttgtaccagtctgttgagctcctctacttgggtggcatctgccttacctaccccctgcag ttctatgtctctgccaagatcattgtgcctgtcactgtttcctgggtgtgcaagtgctga >gi568815593r:151216820_151447460|GENSCAN_predicted_peptide_8|137_aa MGWGIYLEEMLEDEDEASQREDFVQHLESRQRTEALEEHQQHQQGKCEHCVDVTSTVGLG YSCYVLAKRLAAFCPQPRDLWNFKLERHDLGYLVEEISNSKALKSDFPMTRCCFPETMHT AGLPANDGDDRACRPLE >gi568815593r:151216820_151447460|GENSCAN_predicted_CDS_8|414_bp atggggtggggaatctacctggaggagatgctggaggatgaggacgaggcttcccagcgt gaagactttgttcagcatctagaaagtaggcagcgcacagaggctctggaggaacaccag cagcaccagcaagggaaatgtgaacactgtgtagatgtgacatccacggtgggcctaggt tactcttgctatgttttagcaaagagactggcagcattttgcccccaacctagagatctg tggaactttaaacttgagagacatgatttagggtatctggtggaagaaatttctaacagc aaagcactcaagagtgattttcctatgacacgctgctgcttcccagagaccatgcacact gctggtcttcctgctaatgatggggatgatcgtgcatgccgccccttggagtaa >gi568815593r:151216820_151447460|GENSCAN_predicted_peptide_9|90_aa MAAPHPSHAQYVELNSEVSTCLGLWVPPPAASPWMMQPKEYPASPEGGFSNSSTRSSCCQ DRASRTSWLQTLIYIVKGNIDLQLLSLPQP >gi568815593r:151216820_151447460|GENSCAN_predicted_CDS_9|273_bp atggctgctccacacccctcacatgcccaatacgtggagctaaacagtgaggtatccacc tgcctgggcctgtgggtgcctccaccagctgccagtccctggatgatgcagcccaaagaa taccctgccagccccgagggagggttcagcaactcctccacccggagctcctgctgccag gacagggccagcaggacctcgtggttgcaaaccctgatctacatagtaaaaggcaacatt gacttgcagctcctgagcctgccccaaccatga