GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:33:25 Sequence gi568815578f:46625040_46833831 : 208792 bp : 45.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12840 13035 196 1 1 112 100 59 0.647 7.97 1.02 Term + 13433 13493 61 2 1 78 48 61 0.394 -1.62 1.03 PlyA + 14422 14427 6 -0.45 2.04 PlyA - 15525 15520 6 1.05 2.03 Term - 15896 15747 150 2 0 38 37 130 0.010 0.81 2.02 Intr - 26451 26272 180 0 0 98 92 319 0.917 33.36 2.01 Init - 27148 27113 36 1 0 64 67 14 0.246 -3.09 2.00 Prom - 30683 30644 40 -0.96 3.00 Prom + 37809 37848 40 -3.16 3.01 Init + 39836 39907 72 1 0 96 19 67 0.155 1.67 3.02 Term + 55354 55512 159 0 0 38 48 149 0.170 3.84 3.03 PlyA + 56293 56298 6 1.05 4.03 PlyA - 56662 56657 6 1.05 4.02 Term - 62192 61714 479 0 2 73 43 334 0.986 22.40 4.01 Init - 64375 64093 283 1 1 106 95 433 0.615 43.00 4.00 Prom - 66057 66018 40 -6.06 5.00 Prom + 71093 71132 40 -6.06 5.01 Init + 72997 73033 37 0 1 94 81 16 0.093 1.68 5.02 Intr + 84516 84701 186 1 0 116 106 20 0.443 6.26 5.03 Intr + 89260 89331 72 2 0 79 59 45 0.106 0.08 5.04 Intr + 100002 101285 1284 1 0 120 111 798 0.876 72.60 5.05 Intr + 101825 101947 123 0 0 67 97 157 0.967 15.06 5.06 Intr + 104314 104449 136 2 1 110 70 144 0.998 14.43 5.07 Term + 108717 108795 79 0 1 112 55 114 0.995 7.74 5.08 PlyA + 111112 111117 6 1.05 6.04 PlyA - 111142 111137 6 1.05 6.03 Term - 119016 118531 486 0 0 76 41 253 0.810 14.10 6.02 Intr - 121799 121678 122 0 2 3 68 90 0.252 -1.29 6.01 Init - 127030 126604 427 1 1 57 53 238 0.325 13.67 6.00 Prom - 132844 132805 40 -4.06 7.06 PlyA - 133447 133442 6 1.05 7.05 Term - 138154 138062 93 1 0 73 34 93 0.478 0.23 7.04 Intr - 139837 139799 39 1 0 112 98 -16 0.295 0.22 7.03 Intr - 143033 142967 67 1 1 43 99 88 0.537 4.31 7.02 Intr - 145913 145822 92 2 2 40 62 64 0.167 -2.21 7.01 Init - 148210 148151 60 1 0 90 80 23 0.492 2.97 7.00 Prom - 150614 150575 40 -4.56 8.00 Prom + 153878 153917 40 -4.46 8.01 Init + 155470 155575 106 0 1 94 76 18 0.400 1.58 8.02 Intr + 158789 158973 185 0 2 84 98 97 0.975 9.81 8.03 Intr + 160527 160577 51 2 0 101 55 33 0.103 0.40 8.04 Intr + 167504 167746 243 1 0 89 52 95 0.082 3.69 8.05 Term + 168895 168954 60 0 0 74 42 42 0.050 -4.00 8.06 PlyA + 169147 169152 6 1.05 9.06 PlyA - 169380 169375 6 1.05 9.05 Term - 170661 170510 152 1 2 87 47 67 0.179 0.57 9.04 Intr - 176678 176578 101 1 2 77 81 86 0.618 6.55 9.03 Intr - 185144 185102 43 0 1 126 80 10 0.565 1.30 9.02 Intr - 186617 186448 170 1 2 87 44 134 0.057 8.49 9.01 Init - 203117 203044 74 2 2 64 98 34 0.043 2.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 98179 98224 46 0 1 91 94 17 0.937 3.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:46625040_46833831|GENSCAN_predicted_peptide_1|85_aa XGDSGGHTTPGEAGSFSRKGKPGSGEVQVPYQEKSQEQEAVDVWFSGGENVLSEQASVSP INPKESKGKAAVLTAADRGAVSSVQ >gi568815578f:46625040_46833831|GENSCAN_predicted_CDS_1|258_bp ngaggtgattcagggggacataccaccccaggtgaggcaggctccttctctaggaaggga aagccagggtctggagaagtacaggttccatatcaagaaaaaagtcaggagcaggaagcg gttgatgtctggtttagtgggggtgagaatgtcctctcagaacaggcctcagtctcccct atcaaccccaaagaaagcaagggaaaggcggctgtcctgactgctgcggacagaggggcc gtcagctccgtccagtga >gi568815578f:46625040_46833831|GENSCAN_predicted_peptide_2|121_aa MGICCTDYFVIQALSRRRGQAGQSRAGPYRQAIALMAALAAAAKKVWSARRLLVLLFTPL ALLPVVFALPPKVCKQAGVTSSDAAGSKIVFPDPTQQDSALWTDEQNSLTPPDLEDTKSA P >gi568815578f:46625040_46833831|GENSCAN_predicted_CDS_2|366_bp atggggatttgttgcacagattatttcgtcatccaggctttaagccggcgccggggccag gcggggcagtcccgggccggcccgtaccgccaggcgatcgcgctgatggcggcgctggca gcagcggccaagaaggtgtggagcgcgcggcggctgctggtgctgctgttcacgccgctc gcgctgctgccggtggtcttcgccctcccgcccaaggtgtgtaaacaggctggcgtgact tccagtgatgcagcagggagcaagatcgtgtttcctgacccaactcaacaggacagtgct ctctggactgatgagcagaattctctgacaccgccagaccttgaagacacgaagtcagca ccctag >gi568815578f:46625040_46833831|GENSCAN_predicted_peptide_3|76_aa MTGLSIIKELECRDKVTASQQGLKCSNCSTYLLIWMLVVDMDVRPRSLKEAPIAPEAGRV VSSQLLWSAPPGINSQ >gi568815578f:46625040_46833831|GENSCAN_predicted_CDS_3|231_bp atgacaggtcttagcatcatcaaggaactggagtgcagagacaaggtcacagccagccag caaggactcaaatgctccaactgctccacgtacctccttatatggatgcttgtggtggac atggatgtgcgacccagatcccttaaggaagcacctattgccccagaggcagggagagtg gtcagcagtcagctcctgtggtcagctcctccagggatcaactcacagtag >gi568815578f:46625040_46833831|GENSCAN_predicted_peptide_4|253_aa MAAARATTPADGEEPAPEAEALAAARERSSRFLSGLELVKQGAEARVFRGRFQGRAAVIK HRFPKGYRHPALEARLGRRRTVQEARALLRCRRAGISAPVVFFVDYASNCLYMEEIEGSV TVRDYIQSTMETEKTPQGLSNLAKTIGQVLARMHDEDLIHGDLTTSNMLLKPPLEQLNIV LIDFGLSFISALPEDKGVDLYVLEKAFLSTHPNTETVFEAFLKSYSTSSKKARPVLKKLD EVRLRGRKRSMVG >gi568815578f:46625040_46833831|GENSCAN_predicted_CDS_4|762_bp atggcggcggccagagctactacgccggccgatggcgaggagcccgccccggaggctgag gctctggccgcagcccgggagcggagcagccgcttcttgagcggcctggagctggtgaag cagggtgccgaggcgcgcgtgttccgtggccgcttccagggccgcgcggcggtgatcaag caccgcttccccaagggctaccggcacccggcgctggaggcgcggcttggcagacggcgg acggtgcaggaggcccgggcgctcctccgctgtcgccgcgctggaatatctgccccagtt gtcttttttgtggactatgcttccaactgcttatatatggaagaaattgaaggctcagtg actgttcgagattatattcagtccactatggagactgaaaaaactccccagggtctctcc aacttagccaagacaattgggcaggttttggctcgaatgcacgatgaagacctcattcat ggtgatctcaccacctccaacatgctcctgaaaccccccctggaacagctgaacattgtg ctcatagactttgggctgagtttcatttcagcacttccagaggataagggagtagacctc tatgtcctggagaaggccttcctcagtacccatcccaacactgaaactgtgtttgaagcc tttctgaagagctactccacctcctccaaaaaggccaggccagtgctaaaaaaattagat gaagtgcgcctgagaggaagaaagaggtccatggttgggtag >gi568815578f:46625040_46833831|GENSCAN_predicted_peptide_5|638_aa MAEGERGADVPHGLGAWLADVALAALRAGGQGRRDRGGGGPESLSGGSGVGDSGGGCAPG PSAPPARRRVPLAMGPRNLLIDWIWIMDTTLGLGTEGGGHSPPVLPLCASVSLLGGLTFG YELAVISGALLPLQLDFGLSCLEQEFLVGSLLLGALLASLVGGFLIDCYGRKQAILGSNL VLLAGSLTLGLAGSLAWLVLGRAVVGFAISLSSMACCIYVSELVGPRQRGVLVSLYEAGI TVGILLSYALNYALAGTPWGWRHMFGWATAPAVLQSLSLLFLPAGTDETATHKDLIPLQG GEAPKLGPGRPRYSFLDLFRARDNMRGRTTVGLGLVLFQQLTGQPNVLCYASTIFSSVGF HGGSSAVLASVGLGAVKVAATLTAMGLVDRAGRRALLLAGCALMALSVSGIGLVSFAVPM DSGPSCLAVPNATGQTGLPGDSGLLQDSSLPPIPRTNEDQREPILSTAKKTKPHPRSGDP SAPPRLALSSALPGPPLPARGHALLRWTALLCLMVFVSAFSFGFGPVTWLVLSEIYPVEI RGRAFAFCNSFNWAANLFISLSFLDLIGTIGLSWTFLLYGLTAVLGLGFIYLFVPETKGQ SLAEIDQQFQKRRFTLSFGHRQNSTGIPYSRIEISAAS >gi568815578f:46625040_46833831|GENSCAN_predicted_CDS_5|1917_bp atggcagaaggtgaaaggggagcagacgtgccacatggcctcggggcctggctggccgac gtggcgttggcggcgctgcgcgcgggagggcagggcaggagggacagaggcgggggcggg ccggaaagtttgtccggcggcagcggcgttggggactccggcgggggatgcgcgcccggc ccctcagcgcccccagcacgccgccgagtcccgctcgccatggggcccaggaatttgctg attgattggatctggatcatggacaccaccctggggctgggcactgagggtggaggccac tccccacctgtcctgcctttgtgtgcctctgtgtctttgctgggtggcctgacctttggt tatgaactggcagtcatatcaggtgccctgctgccactgcagcttgactttgggctaagc tgcttggagcaggagttcctggtgggcagcctgctcctgggggctctcctcgcctccctg gttggtggcttcctcattgactgctatggcaggaagcaagccatcctcgggagcaacttg gtgctgctggcaggcagcctgaccctgggcctggctggttccctggcctggctggtcctg ggccgcgctgtggttggcttcgccatttccctctcctccatggcttgctgtatctacgtg tcagagctggtggggccacggcagcggggagtgctggtgtccctctatgaggcaggcatc accgtgggcatcctgctctcctatgccctcaactatgcactggctggtaccccctgggga tggaggcacatgttcggctgggccactgcacctgctgtcctgcaatccctcagcctcctc ttcctccctgctggtacagatgagactgcaacacacaaggacctcatcccactccaggga ggtgaggcccccaagctgggcccggggaggccacggtactcctttctggacctcttcagg gcacgcgataacatgcgaggccggaccacagtgggcctggggctggtgctcttccagcaa ctaacagggcagcccaacgtgctgtgctatgcctccaccatcttcagctccgttggtttc catgggggatcctcagccgtgctggcctctgtggggcttggcgcagtgaaggtggcagct accctgaccgccatggggctggtggaccgtgcaggccgcagggctctgttgctagctggc tgtgccctcatggccctgtccgtcagtggcataggcctcgtcagctttgccgtgcccatg gactcaggcccaagctgtctggctgtgcccaatgccaccgggcagacaggcctccctgga gactctggcctgctgcaggactcctctctacctcccattccaaggaccaatgaggaccaa agggagccaatcttgtccactgctaagaaaaccaagccccatcccagatctggagacccc tcagcccctcctcggctggccctgagctctgccctccctgggccccctctgcccgctcgg gggcatgcactgctgcgctggaccgcactgctgtgcctgatggtctttgtcagtgccttc tcctttgggtttgggccagtgacctggcttgtcctcagcgagatctaccctgtggagata cgaggaagagccttcgccttctgcaacagcttcaactgggcggccaacctcttcatcagc ctctccttcctcgatctcattggcaccatcggcttgtcctggaccttcctgctctacgga ctgaccgctgtcctcggcctgggcttcatctatttatttgttcctgaaacaaaaggccag tcgttggcagagatagaccagcagttccagaagagacggttcaccctgagctttggccac aggcagaactccactggcatcccgtacagccgcatcgagatctctgcggcctcctga >gi568815578f:46625040_46833831|GENSCAN_predicted_peptide_6|344_aa MGGVPEEQIRKSYQLRGEWKWMASKQQMPASVTPPLTESLWDSLKCMSDTVRKRQENEKK TWMFSIFELMQAGDNSCQSCKAEHPEKWNRRIQAPVEGINESRGVWMGLFLSSVWLKLQT PGVTQWVASTLPSTSAGIAKSNGSDGDDGDGGDDEKGQILQSLQDLLTDWMWECKRGVRG DCELLSQFSFVPENPMILDQVMTKTTNKTPGEHEGRATLQGTSRSDSLSGSVVTSDSPIW GGVQAELDTGCSVATGNVQSDFQHLFPLLLPTCLMVSTSSPLSHRVWGDWPLYWVEQCHK LHVYLAPVNVTLFGNKILADVIKSHWIRVGPPSNDLGLYKKREM >gi568815578f:46625040_46833831|GENSCAN_predicted_CDS_6|1035_bp atgggtggggtccctgaggagcagatccgtaagagctatcagcttagaggcgaatggaag tggatggcgtccaagcaacagatgcccgcttctgtgactccaccccttacggaaagtcta tgggactctctgaaatgtatgagtgatactgttagaaagcggcaagaaaatgaaaagaaa acgtggatgttttccatatttgagttgatgcaggctggagacaacagctgccagagctgc aaggcagaacaccctgagaagtggaaccggaggatccaggcgcctgttgaagggataaat gagagccgtggcgtgtggatgggactctttctgtcttccgtttggctgaagctccaaacg ccaggagtaacccagtgggtagcaagcacccttccctccacctctgcaggaattgcaaaa agcaatgggagtgatggtgatgatggtgatggaggagatgatgagaaaggtcaaattctg cagagcctgcaggatttgctaacagattggatgtgggagtgcaaaagaggagttaggggt gactgcgagctcctctcgcagttcagttttgtgcccgagaaccccatgatcctcgatcag gtcatgaccaagaccacaaataaaacccctggtgaacacgagggcagggccacactgcag ggcaccagccggagtgacagcctgtctggctcggtagtgactagtgactcccccatctgg gggggtgtgcaagcggagctggacacaggatgcagtgtggccacgggaaatgtgcagagt gatttccagcatctctttcctcttctcttgccaacttgccttatggtgtccacttcttcc ccactcagccatagggtctggggtgattggcccctgtactgggttgaacagtgtcataaa cttcatgtctacctggcccctgtgaatgtgactttatttggaaataagattttggcagat gtaatcaagtcacactggattcgggtgggccctccatccaatgacttgggtctttataag aagagggaaatgtaa >gi568815578f:46625040_46833831|GENSCAN_predicted_peptide_7|116_aa MPLSNEQEHLSASFSTCGVQIFPTQQAYFCCRDFVCAVLVTWNILFLTFLRSLSAKRNQS TPTHGSLLVEPMQRTTLQLKITCSGKVLGRYVNCISSCELLSKDRKALLWITFCAF >gi568815578f:46625040_46833831|GENSCAN_predicted_CDS_7|351_bp atgcctctgagcaatgagcaagagcatctctccgccagcttttccacctgcggtgtccag atctttccaacacagcaggcatacttctgctgcagggacttcgtctgtgctgttctggtc acctggaatatacttttcctcaccttcctcaggtcactatcggccaaacgcaaccagagc acacccacccatgggagcctgttggtggagcccatgcagaggacaactctccagttaaaa atcacttgctcagggaaggtactgggtcgctatgtcaactgtatttcttcttgtgagctg ctgtcaaaagatcggaaagctctgctctggatcaccttctgtgccttctga >gi568815578f:46625040_46833831|GENSCAN_predicted_peptide_8|214_aa MERTIPPFPWGGTAQEGFGLDTRARGLRFQEACSTVSIPSAISINGFPSPISINPKVFAM TSRAPNKPKRFCPLTLLSHPRSIHPTPVTSQTTFPNTAPGATGGLFTLAAVAPASTLPSE LPMATCFSSFSSGSDVTTSEKALLTTLSKMHFAPQTSKFSLYSFPPWHQHYLIIFLSVAS GVLVDDLSTPTGGKRLLKCHPVKKIPLTTHKNMQ >gi568815578f:46625040_46833831|GENSCAN_predicted_CDS_8|645_bp atggagaggaccattcccccttttccatggggtggcactgcccaagagggcttcgggcta gatacccgggcgcgcggcttgcgttttcaggaagcttgtagcacagtcagcatcccatct gcaatcagcatcaatggcttcccatctccaatcagcatcaatcctaaggtctttgccatg acttctagggctccaaataaaccaaagcgcttttgtccattgaccctactgagccatcct cgttccatccaccccaccccagttaccagtcagacaacatttcctaacacagctccagga gccaccgggggcttgttcacgcttgctgctgtggctccagcaagcacccttccttcagag cttcccatggctacctgcttctcgtccttcagctctgggtcagacgtcaccacctcagag aaggccttgttgaccaccttgtctaagatgcactttgcccctcagacatctaaattttca ctctactcatttcctccatggcatcagcattatctgatcattttcttgtcagttgcatct ggcgtgcttgttgatgatctgtccaccccaaccggaggaaaaagacttctcaaatgtcac cctgtcaagaagatccctctgaccacccataagaacatgcagtaa >gi568815578f:46625040_46833831|GENSCAN_predicted_peptide_9|179_aa MAGNQHDMSLSSWCAQSRGVQALKSPRRRLWCLLSAPRRHLRRAGVPGPLSNLGAANTGP IVAPASPAGSCARAAICERGSEQADRAMFRQHLTNSFLPHGCKAAAITPNITPDVEAERK QEEMKSNDSYPAPYMCPITRIYSLHHLTITTNRFLNGAVIGRLDETICQCADSPCIAGH >gi568815578f:46625040_46833831|GENSCAN_predicted_CDS_9|540_bp atggctggaaaccagcacgacatgtctttgtcctcgtggtgtgcacagtctagaggtgtt caggcactcaaaagtccccggaggagactttggtgcctgctgtccgccccccggcgccat ctgcgccgcgcaggtgtgcccggccccctgtccaacctgggggctgcgaacactgggccg attgtggcccccgcgagcccggcgggcagctgcgcccgagctgccatctgcgagcgcggc tctgagcaggcagatcgtgccatgttccggcagcatttgacaaacagcttcttgcctcat ggttgcaaggcggctgctataactccaaacatcacacctgatgtcgaggcagaaagaaag caggaggaaatgaagagtaatgacagttaccctgcaccgtacatgtgcccaattacgagg atttacagcttacatcatctaactataaccaccaataggtttctcaatggggctgtcatt ggccgtttggatgaaacaatttgtcagtgtgctgacagtccatgcattgcaggacattga