GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:01:36 Sequence gi568815578r:46586756_46789414 : 202659 bp : 45.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 204 199 6 1.05 1.11 Term - 1408 1300 109 0 1 102 53 88 0.716 4.68 1.10 Intr - 2500 2405 96 0 0 115 55 99 0.936 8.42 1.09 Intr - 5774 5649 126 1 0 89 82 61 0.890 5.39 1.08 Intr - 9533 9402 132 1 0 40 52 189 0.418 10.26 1.07 Intr - 13282 13216 67 2 1 79 94 82 0.963 5.86 1.06 Intr - 13866 13833 34 0 1 53 115 28 0.775 -0.20 1.05 Intr - 18329 18274 56 0 2 51 93 64 0.526 1.90 1.04 Intr - 23854 23691 164 0 2 89 94 208 0.975 21.12 1.03 Intr - 26970 26705 266 0 2 109 110 526 0.716 53.21 1.02 Intr - 28914 28736 179 1 2 43 -4 181 0.143 4.04 1.01 Init - 40267 40144 124 1 1 65 57 122 0.722 7.13 1.00 Prom - 42365 42326 40 -5.06 2.05 PlyA - 42743 42738 6 1.05 2.04 Term - 53546 53455 92 0 2 117 49 51 0.024 1.98 2.03 Intr - 56066 56012 55 2 1 96 52 55 0.041 1.25 2.02 Intr - 64735 64556 180 1 0 98 92 319 0.919 33.36 2.01 Init - 65432 65397 36 2 0 64 67 14 0.246 -3.09 2.00 Prom - 68967 68928 40 -0.96 3.00 Prom + 76093 76132 40 -3.16 3.01 Init + 78120 78191 72 2 0 96 19 67 0.155 1.67 3.02 Term + 93638 93796 159 1 0 38 48 149 0.170 3.84 3.03 PlyA + 94577 94582 6 1.05 4.03 PlyA - 94946 94941 6 1.05 4.02 Term - 100476 99998 479 1 2 73 43 334 0.986 22.40 4.01 Init - 102659 102377 283 2 1 106 95 433 0.615 43.00 4.00 Prom - 104341 104302 40 -6.06 5.00 Prom + 109377 109416 40 -6.06 5.01 Init + 111281 111317 37 1 1 94 81 16 0.093 1.68 5.02 Intr + 122800 122985 186 2 0 116 106 20 0.443 6.26 5.03 Intr + 127544 127615 72 0 0 79 59 45 0.106 0.08 5.04 Intr + 138286 139569 1284 2 0 120 111 798 0.876 72.60 5.05 Intr + 140109 140231 123 1 0 67 97 157 0.967 15.06 5.06 Intr + 142598 142733 136 0 1 110 70 144 0.998 14.43 5.07 Term + 147001 147079 79 1 1 112 55 114 0.995 7.74 5.08 PlyA + 149396 149401 6 1.05 6.04 PlyA - 149426 149421 6 1.05 6.03 Term - 157300 156815 486 1 0 76 41 253 0.810 14.10 6.02 Intr - 160083 159962 122 1 2 3 68 90 0.252 -1.29 6.01 Init - 165314 164888 427 2 1 57 53 238 0.325 13.67 6.00 Prom - 171128 171089 40 -4.06 7.07 PlyA - 171731 171726 6 1.05 7.06 Term - 176438 176346 93 2 0 73 34 93 0.478 0.23 7.05 Intr - 178121 178083 39 2 0 112 98 -16 0.295 0.22 7.04 Intr - 181317 181251 67 2 1 43 99 88 0.536 4.31 7.03 Intr - 189345 189223 123 2 0 102 40 44 0.013 0.70 7.02 Intr - 195241 195185 57 0 0 107 101 6 0.007 1.80 7.01 Init - 197145 197063 83 0 2 77 60 70 0.005 3.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 136463 136508 46 1 1 91 94 17 0.937 3.68 S.002 Intr + 197073 197257 185 1 2 84 98 97 0.983 9.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:46586756_46789414|GENSCAN_predicted_peptide_1|450_aa MKPDNVQQTTMNSVIRDGLPGGETFEVRAEQQGASQPEIEGVNDNPTFPETNNLELPLVS LFASHITTDLSAKLSDVILEIDREFNYFSLPTTASNGPWWPEGRCLFVILLMAVYWCTEA LPLSVTALLPIVLFPFMGILPSNKVCPQYFLDTNFLFLSGLIMASAIEEWNLHRRIALKI LMLVGVQPARLILGMMVTTSFLSMWLSNTASTAMMLPIANAILKSLFGQKEVRKDPSQES EENTGLFNSDAEQTGPGEGCGKLLAFHNFGNGCSAAVRRNGLHTVPTEMQFLASTEAKED EYRRNIWKGFLISIPYSASIGGTATLTGTAPNLILLGQLKSFFPQCDVVNFGSWFIFAFP LMLLFLLAGWLWISFLYGGLSFRGWRKNKSEIRTNAEDRARAVIREEYQNLGPIKFAEQA VFILFCMFAILLFTRDPKFIPGWASLFNPG >gi568815578r:46586756_46789414|GENSCAN_predicted_CDS_1|1353_bp atgaagccagataatgtgcagcagaccacaatgaacagtgtgatcagagatggacttcct ggaggggagacctttgaagtgagggctgaacagcaaggagccagccagccagagatagaa ggggtaaatgacaaccccaccttccctgagaccaacaacttggaattgcccttggtttct ctctttgcctcacatatcacaacagatctatcagccaagctgtcagacgtgatcttggaa attgatcgagaattcaactacttctcactacctacaactgccagcaatgggccctggtgg ccggaaggccgctgcttgtttgtcatcctgctcatggcggtgtactggtgcacggaggcc ctgccgctctcagtgacggcgctgctgcccatcgtcctcttccccttcatgggcatcttg ccctccaacaaggtctgcccccagtacttcctcgacaccaacttcctcttcctcagtggg ctgatcatggccagcgccattgaggagtggaacctgcaccggcgaatcgccctcaagatc ctgatgcttgttggagtccagccggccaggctcatcctggggatgatggtgaccacctcg ttcttgtccatgtggctgagcaacaccgcctccactgccatgatgcttcccattgccaat gccatcctgaaaagtctctttggccagaaggaggttcgaaaggaccccagccaggagagt gaagagaacacagggctcttcaactcagatgcagagcaaacggggccaggcgagggctgt ggcaaactgctggcctttcacaactttggcaatggctgctcagctgctgtgcggagaaac ggcctacacactgtgcccacggagatgcagtttctcgccagcacagaagcgaaggaggat gaatatcgtcggaacatctggaagggcttcctcatctccatcccctactcagccagtatt gggggcacagccacactcacgggcacagcccctaacctcatcctgcttggccagctcaag agtttctttccgcagtgtgacgtggtgaatttcggctcctggttcattttcgccttccct cttatgctgttgttcctgttggcaggctggctctggatctccttcctgtacgggggactg agcttcaggggctggaggaagaataaatctgagataagaaccaatgcagaagatagggct cgagctgtaattcgggaagaataccagaacctggggcccatcaagtttgccgaacaggct gttttcatccttttctgcatgtttgccatcctcctcttcacccgggacccgaagttcatc cctggctgggccagcctcttcaatcctgggtga >gi568815578r:46586756_46789414|GENSCAN_predicted_peptide_2|120_aa MGICCTDYFVIQALSRRRGQAGQSRAGPYRQAIALMAALAAAAKKVWSARRLLVLLFTPL ALLPVVFALPPKKVADPGHSGNNQRLIREPGPYQPSAQREDFDFHCTQLPYKKVPTPGRL >gi568815578r:46586756_46789414|GENSCAN_predicted_CDS_2|363_bp atggggatttgttgcacagattatttcgtcatccaggctttaagccggcgccggggccag gcggggcagtcccgggccggcccgtaccgccaggcgatcgcgctgatggcggcgctggca gcagcggccaagaaggtgtggagcgcgcggcggctgctggtgctgctgttcacgccgctc gcgctgctgccggtggtcttcgccctcccgcccaagaaagttgctgaccctggacactcc gggaataatcagaggctaattcgagagccaggaccttaccagccctctgcgcagagagag gacttcgatttccactgtactcagctcccctacaagaaagtgccgacacctgggaggctg tag >gi568815578r:46586756_46789414|GENSCAN_predicted_peptide_3|76_aa MTGLSIIKELECRDKVTASQQGLKCSNCSTYLLIWMLVVDMDVRPRSLKEAPIAPEAGRV VSSQLLWSAPPGINSQ >gi568815578r:46586756_46789414|GENSCAN_predicted_CDS_3|231_bp atgacaggtcttagcatcatcaaggaactggagtgcagagacaaggtcacagccagccag caaggactcaaatgctccaactgctccacgtacctccttatatggatgcttgtggtggac atggatgtgcgacccagatcccttaaggaagcacctattgccccagaggcagggagagtg gtcagcagtcagctcctgtggtcagctcctccagggatcaactcacagtag >gi568815578r:46586756_46789414|GENSCAN_predicted_peptide_4|253_aa MAAARATTPADGEEPAPEAEALAAARERSSRFLSGLELVKQGAEARVFRGRFQGRAAVIK HRFPKGYRHPALEARLGRRRTVQEARALLRCRRAGISAPVVFFVDYASNCLYMEEIEGSV TVRDYIQSTMETEKTPQGLSNLAKTIGQVLARMHDEDLIHGDLTTSNMLLKPPLEQLNIV LIDFGLSFISALPEDKGVDLYVLEKAFLSTHPNTETVFEAFLKSYSTSSKKARPVLKKLD EVRLRGRKRSMVG >gi568815578r:46586756_46789414|GENSCAN_predicted_CDS_4|762_bp atggcggcggccagagctactacgccggccgatggcgaggagcccgccccggaggctgag gctctggccgcagcccgggagcggagcagccgcttcttgagcggcctggagctggtgaag cagggtgccgaggcgcgcgtgttccgtggccgcttccagggccgcgcggcggtgatcaag caccgcttccccaagggctaccggcacccggcgctggaggcgcggcttggcagacggcgg acggtgcaggaggcccgggcgctcctccgctgtcgccgcgctggaatatctgccccagtt gtcttttttgtggactatgcttccaactgcttatatatggaagaaattgaaggctcagtg actgttcgagattatattcagtccactatggagactgaaaaaactccccagggtctctcc aacttagccaagacaattgggcaggttttggctcgaatgcacgatgaagacctcattcat ggtgatctcaccacctccaacatgctcctgaaaccccccctggaacagctgaacattgtg ctcatagactttgggctgagtttcatttcagcacttccagaggataagggagtagacctc tatgtcctggagaaggccttcctcagtacccatcccaacactgaaactgtgtttgaagcc tttctgaagagctactccacctcctccaaaaaggccaggccagtgctaaaaaaattagat gaagtgcgcctgagaggaagaaagaggtccatggttgggtag >gi568815578r:46586756_46789414|GENSCAN_predicted_peptide_5|638_aa MAEGERGADVPHGLGAWLADVALAALRAGGQGRRDRGGGGPESLSGGSGVGDSGGGCAPG PSAPPARRRVPLAMGPRNLLIDWIWIMDTTLGLGTEGGGHSPPVLPLCASVSLLGGLTFG YELAVISGALLPLQLDFGLSCLEQEFLVGSLLLGALLASLVGGFLIDCYGRKQAILGSNL VLLAGSLTLGLAGSLAWLVLGRAVVGFAISLSSMACCIYVSELVGPRQRGVLVSLYEAGI TVGILLSYALNYALAGTPWGWRHMFGWATAPAVLQSLSLLFLPAGTDETATHKDLIPLQG GEAPKLGPGRPRYSFLDLFRARDNMRGRTTVGLGLVLFQQLTGQPNVLCYASTIFSSVGF HGGSSAVLASVGLGAVKVAATLTAMGLVDRAGRRALLLAGCALMALSVSGIGLVSFAVPM DSGPSCLAVPNATGQTGLPGDSGLLQDSSLPPIPRTNEDQREPILSTAKKTKPHPRSGDP SAPPRLALSSALPGPPLPARGHALLRWTALLCLMVFVSAFSFGFGPVTWLVLSEIYPVEI RGRAFAFCNSFNWAANLFISLSFLDLIGTIGLSWTFLLYGLTAVLGLGFIYLFVPETKGQ SLAEIDQQFQKRRFTLSFGHRQNSTGIPYSRIEISAAS >gi568815578r:46586756_46789414|GENSCAN_predicted_CDS_5|1917_bp atggcagaaggtgaaaggggagcagacgtgccacatggcctcggggcctggctggccgac gtggcgttggcggcgctgcgcgcgggagggcagggcaggagggacagaggcgggggcggg ccggaaagtttgtccggcggcagcggcgttggggactccggcgggggatgcgcgcccggc ccctcagcgcccccagcacgccgccgagtcccgctcgccatggggcccaggaatttgctg attgattggatctggatcatggacaccaccctggggctgggcactgagggtggaggccac tccccacctgtcctgcctttgtgtgcctctgtgtctttgctgggtggcctgacctttggt tatgaactggcagtcatatcaggtgccctgctgccactgcagcttgactttgggctaagc tgcttggagcaggagttcctggtgggcagcctgctcctgggggctctcctcgcctccctg gttggtggcttcctcattgactgctatggcaggaagcaagccatcctcgggagcaacttg gtgctgctggcaggcagcctgaccctgggcctggctggttccctggcctggctggtcctg ggccgcgctgtggttggcttcgccatttccctctcctccatggcttgctgtatctacgtg tcagagctggtggggccacggcagcggggagtgctggtgtccctctatgaggcaggcatc accgtgggcatcctgctctcctatgccctcaactatgcactggctggtaccccctgggga tggaggcacatgttcggctgggccactgcacctgctgtcctgcaatccctcagcctcctc ttcctccctgctggtacagatgagactgcaacacacaaggacctcatcccactccaggga ggtgaggcccccaagctgggcccggggaggccacggtactcctttctggacctcttcagg gcacgcgataacatgcgaggccggaccacagtgggcctggggctggtgctcttccagcaa ctaacagggcagcccaacgtgctgtgctatgcctccaccatcttcagctccgttggtttc catgggggatcctcagccgtgctggcctctgtggggcttggcgcagtgaaggtggcagct accctgaccgccatggggctggtggaccgtgcaggccgcagggctctgttgctagctggc tgtgccctcatggccctgtccgtcagtggcataggcctcgtcagctttgccgtgcccatg gactcaggcccaagctgtctggctgtgcccaatgccaccgggcagacaggcctccctgga gactctggcctgctgcaggactcctctctacctcccattccaaggaccaatgaggaccaa agggagccaatcttgtccactgctaagaaaaccaagccccatcccagatctggagacccc tcagcccctcctcggctggccctgagctctgccctccctgggccccctctgcccgctcgg gggcatgcactgctgcgctggaccgcactgctgtgcctgatggtctttgtcagtgccttc tcctttgggtttgggccagtgacctggcttgtcctcagcgagatctaccctgtggagata cgaggaagagccttcgccttctgcaacagcttcaactgggcggccaacctcttcatcagc ctctccttcctcgatctcattggcaccatcggcttgtcctggaccttcctgctctacgga ctgaccgctgtcctcggcctgggcttcatctatttatttgttcctgaaacaaaaggccag tcgttggcagagatagaccagcagttccagaagagacggttcaccctgagctttggccac aggcagaactccactggcatcccgtacagccgcatcgagatctctgcggcctcctga >gi568815578r:46586756_46789414|GENSCAN_predicted_peptide_6|344_aa MGGVPEEQIRKSYQLRGEWKWMASKQQMPASVTPPLTESLWDSLKCMSDTVRKRQENEKK TWMFSIFELMQAGDNSCQSCKAEHPEKWNRRIQAPVEGINESRGVWMGLFLSSVWLKLQT PGVTQWVASTLPSTSAGIAKSNGSDGDDGDGGDDEKGQILQSLQDLLTDWMWECKRGVRG DCELLSQFSFVPENPMILDQVMTKTTNKTPGEHEGRATLQGTSRSDSLSGSVVTSDSPIW GGVQAELDTGCSVATGNVQSDFQHLFPLLLPTCLMVSTSSPLSHRVWGDWPLYWVEQCHK LHVYLAPVNVTLFGNKILADVIKSHWIRVGPPSNDLGLYKKREM >gi568815578r:46586756_46789414|GENSCAN_predicted_CDS_6|1035_bp atgggtggggtccctgaggagcagatccgtaagagctatcagcttagaggcgaatggaag tggatggcgtccaagcaacagatgcccgcttctgtgactccaccccttacggaaagtcta tgggactctctgaaatgtatgagtgatactgttagaaagcggcaagaaaatgaaaagaaa acgtggatgttttccatatttgagttgatgcaggctggagacaacagctgccagagctgc aaggcagaacaccctgagaagtggaaccggaggatccaggcgcctgttgaagggataaat gagagccgtggcgtgtggatgggactctttctgtcttccgtttggctgaagctccaaacg ccaggagtaacccagtgggtagcaagcacccttccctccacctctgcaggaattgcaaaa agcaatgggagtgatggtgatgatggtgatggaggagatgatgagaaaggtcaaattctg cagagcctgcaggatttgctaacagattggatgtgggagtgcaaaagaggagttaggggt gactgcgagctcctctcgcagttcagttttgtgcccgagaaccccatgatcctcgatcag gtcatgaccaagaccacaaataaaacccctggtgaacacgagggcagggccacactgcag ggcaccagccggagtgacagcctgtctggctcggtagtgactagtgactcccccatctgg gggggtgtgcaagcggagctggacacaggatgcagtgtggccacgggaaatgtgcagagt gatttccagcatctctttcctcttctcttgccaacttgccttatggtgtccacttcttcc ccactcagccatagggtctggggtgattggcccctgtactgggttgaacagtgtcataaa cttcatgtctacctggcccctgtgaatgtgactttatttggaaataagattttggcagat gtaatcaagtcacactggattcgggtgggccctccatccaatgacttgggtctttataag aagagggaaatgtaa >gi568815578r:46586756_46789414|GENSCAN_predicted_peptide_7|153_aa MAKTLGLMLIGDGKPLMLIADGMLTVEGWDHLVAGKQARAPTDSTLCCTDEVGTDILILI SNDYASWSQLIGPKWTNQSPSLGMWNVTSLSAKRNQSTPTHGSLLVEPMQRTTLQLKITC SGKVLGRYVNCISSCELLSKDRKALLWITFCAF >gi568815578r:46586756_46789414|GENSCAN_predicted_CDS_7|462_bp atggcaaagaccttaggattgatgctgattggagatgggaagccattgatgctgattgca gatgggatgctgactgtagagggatgggaccatctagttgcaggaaaacaagctcgggct cccactgattctacattatgctgcacagatgaagtgggaacagatatattaatacttata tccaatgactatgcttcctggtcacagttgattggtccaaagtggaccaatcagagtcct tccctgggaatgtggaatgtgacgtcactatcggccaaacgcaaccagagcacacccacc catgggagcctgttggtggagcccatgcagaggacaactctccagttaaaaatcacttgc tcagggaaggtactgggtcgctatgtcaactgtatttcttcttgtgagctgctgtcaaaa gatcggaaagctctgctctggatcaccttctgtgccttctga