GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:08:29 Sequence gi568815578f:35072155_35276810 : 204656 bp : 45.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 375 370 6 1.05 1.04 Term - 4031 3208 824 0 2 -62 43 377 0.042 11.46 1.03 Intr - 4411 4249 163 1 1 77 71 36 0.054 0.35 1.02 Intr - 6020 5892 129 2 0 87 88 71 0.067 7.89 1.01 Init - 20627 20460 168 2 0 109 96 162 0.957 16.64 1.00 Prom - 25299 25260 40 -2.56 2.07 PlyA - 26143 26138 6 1.05 2.06 Term - 43779 43279 501 0 0 44 41 469 0.927 32.28 2.05 Intr - 46565 46444 122 0 2 33 98 115 0.285 7.21 2.04 Intr - 51880 51736 145 1 1 109 99 242 0.999 27.26 2.03 Intr - 54221 54097 125 0 2 77 91 217 0.998 21.20 2.02 Intr - 59629 59488 142 1 1 125 66 237 0.901 25.13 2.01 Init - 62775 62584 192 0 0 54 82 183 0.785 13.27 2.00 Prom - 64109 64070 40 -2.16 3.06 PlyA - 66154 66149 6 -0.45 3.05 Term - 66346 66249 98 2 2 131 43 31 0.630 1.03 3.04 Intr - 70324 70219 106 1 1 88 94 118 0.977 12.19 3.03 Intr - 72864 72825 40 2 1 76 119 33 0.916 3.43 3.02 Intr - 74781 74671 111 2 0 103 72 194 0.942 18.79 3.01 Init - 75104 74998 107 2 2 32 66 178 0.945 7.72 3.00 Prom - 78170 78131 40 -7.36 4.00 Prom + 79516 79555 40 -8.46 4.01 Init + 81647 81719 73 1 1 82 78 61 0.761 3.73 4.02 Intr + 90514 90673 160 2 1 68 72 98 0.375 5.25 4.03 Intr + 92583 92673 91 0 1 74 51 15 0.291 -3.60 4.04 Intr + 99935 100070 136 1 1 113 101 105 0.998 14.44 4.05 Intr + 102548 102799 252 0 0 68 76 371 0.984 31.31 4.06 Intr + 104014 104292 279 2 0 73 86 419 0.983 37.65 4.07 Term + 104544 104659 116 1 2 90 32 163 0.958 9.63 4.08 PlyA + 105015 105020 6 1.05 5.00 Prom + 108277 108316 40 -5.16 5.01 Init + 111296 111368 73 1 1 80 65 108 0.967 8.93 5.02 Term + 117489 117517 29 1 2 51 41 68 0.399 -3.36 5.03 PlyA + 117710 117715 6 1.05 6.00 Prom + 124236 124275 40 -5.66 6.01 Init + 126850 127320 471 0 0 114 61 124 0.807 7.83 6.02 Term + 136255 136257 3 0 0 131 48 0 0.468 -2.80 6.03 PlyA + 136823 136828 6 1.05 7.05 PlyA - 140034 140029 6 1.05 7.04 Term - 140322 140309 14 1 2 127 28 16 0.195 -2.14 7.03 Intr - 146000 145844 157 2 1 56 91 51 0.070 1.78 7.02 Intr - 152798 152651 148 1 1 78 46 33 0.119 -1.66 7.01 Init - 154871 154489 383 2 2 61 56 426 0.927 31.04 7.00 Prom - 161919 161880 40 0.64 8.00 Prom + 162303 162342 40 -4.96 8.01 Init + 163839 164039 201 2 0 54 87 63 0.388 1.13 8.02 Intr + 174686 174834 149 1 2 108 54 180 0.357 15.63 8.03 Intr + 179751 179867 117 0 0 92 49 59 0.272 1.98 8.04 Intr + 182296 182600 305 1 2 91 103 256 0.965 23.63 8.05 Intr + 191637 191798 162 1 0 105 66 303 0.977 29.75 8.06 Intr + 195051 195265 215 1 2 82 97 226 0.935 21.33 8.07 Intr + 197606 197744 139 1 1 125 90 238 0.983 27.74 8.08 Intr + 199415 199685 271 0 1 97 74 405 0.695 36.60 8.09 Term + 202035 202455 421 0 1 85 53 807 0.698 71.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:35072155_35276810|GENSCAN_predicted_peptide_1|427_aa MAAAPVAAGSGAGRGRRSAATVAAWGGWGGRPRPGNILLQLRQGQLTGRGLVRAVQFTET FLTERDKQSKWSGIPQLLLKLHTTSHLHSDFVECQNILKPLLVIPRQTGSGVDLQQTLTD LQLRVLTVRRKTNKQKGHPHQNPICMSPSSKTKEAKNLDKRLDEWLTRINSIEKTLNDLM ELNTMARKLRDACTSFSSQFDQVEERVSVIEDQMNEMKREEKFREKRIKRNKQSLQEIWD CVKRPNLRLIGVPESDGENGTKLENTLQDIIQENFPNLARQVNIQIQEIQRTPQRYSSRR GTPRHIIVRFTKVEIKEKILRAAREKGRVTHKGKPIRLTAVLLAETLQARREWGPIFNIL KEKNFQPRVSYPAKLSFISEGEVKSFTDKQMLRDFVTTRPALQELLKEALNMERNNRYQL LQKHAKL >gi568815578f:35072155_35276810|GENSCAN_predicted_CDS_1|1284_bp atggcggcggcgccggtagcggctgggtctggagccggccgagggagacggtcggcagcc acagtggcggcttggggcggatggggcggccggccgcggcctggtaacattctgctgcag ctgcggcagggccagctgaccggccggggcctggtccgggcggtgcagttcactgagact tttttgacggagagggacaaacaatccaagtggagtggaattcctcagctgctcctcaag ctgcacaccaccagccacctccacagtgactttgttgagtgtcaaaacatcctcaagcct ctgctggtgatacccagacaaacagggtctggagtggacctccagcaaactctgacagac ctgcagctgagggtcctgactgttagaaggaaaactaacaaacagaaaggacatccacac caaaaccccatctgtatgtcaccatcatcaaagaccaaagaagctaaaaaccttgacaaa agattagacgaatggctaactagaataaacagcatagaaaagaccttaaatgacctgatg gagctgaacaccatggctcgaaaactacgtgacgcatgcacaagcttcagtagccaattc gatcaagtggaagaaagggtatcagtgattgaagatcaaatgaatgaaatgaagcgagaa gagaagtttagagaaaaaagaataaaaagaaacaaacaaagcctccaagaaatatgggac tgtgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgacggggagaatgga accaagttggaaaacactcttcaggatattatccaggagaacttccccaatctagcaagg caggtcaacattcaaattcaggaaatacagagaacgccacaaagatactcctcgaggaga ggaactccaagacacataattgtcagattcaccaaagttgaaattaaggaaaaaatatta agggcagccagagagaaaggtcgggttacccacaaagggaagcccatcagactaacagcg gttctcttggcagaaactctacaagccagaagagagtgggggccaatattcaacattctt aaagaaaagaattttcaacccagagtttcatatccagccaaactaagcttcataagtgaa ggagaagtaaaatcctttacagacaagcaaatgctgagagattttgtcaccaccaggcct gccttacaagagctcctgaaggaagcgctaaacatggaaaggaacaaccgataccaacta ctgcaaaaacatgccaaattgtaa >gi568815578f:35072155_35276810|GENSCAN_predicted_peptide_2|408_aa MPYGTVNLLHGVNPGETPVTCTAGIGTFIVEFATLSSLTGDPVFEDVARVALMRLWESRS DIGLVGNHIDVLTGKWVAQDAGIGAGVDSYFEYLVKGAILLQDKKLMAMFLEYNKAIRNY TRFDDWYLWVQMYKGTVSMPVFQSLEAYWPGLQSLIGDIDNAMRTFLNYYTVWKQFGGLP EFYNIPQGYTVEKREGYPLRPELIESAMYLYRATGDPTLLELGRDAVESIEKISKVECGF ATIKDLRDHKLDNRMESFFLAETVKYLYLLFDPTNFIHNNGSTFDAVITPYGECILGAGG YIFNTEAHPIDPAALHCCQRLKEEQWEVEDLMREFYSLKRSRSKFQKNTVSSGPWEPPAR PGTLFSPENHDQARERKPAKQKVPLLSCPSQPFTSKLALLGQVFLDSS >gi568815578f:35072155_35276810|GENSCAN_predicted_CDS_2|1227_bp atgccatatggaacagtgaacttacttcatggcgtgaacccaggagagacccctgtcacc tgtacggcagggattgggaccttcattgttgaatttgccaccctgagcagcctcactggt gacccggtgttcgaagatgtggccagagtggctttgatgcgcctctgggagagccggtca gatatcgggctggtcggcaaccacattgatgtgctcactggcaagtgggtggcccaggac gcaggcatcggggctggcgtggactcctactttgagtacttggtgaaaggagccatcctg cttcaggataagaagctcatggccatgttcctagagtataacaaagccatccggaactac acccgcttcgatgactggtacctgtgggttcagatgtacaaggggactgtgtccatgcca gtcttccagtccttggaggcctactggcctggtcttcagagcctcattggagacattgac aatgccatgaggaccttcctcaactactacactgtatggaagcagtttggggggctcccg gaattctacaacattcctcagggatacacagtggagaagcgagagggctacccacttcgg ccagaacttattgaaagcgcaatgtacctctaccgtgccacgggggatcccaccctccta gaactcggaagagatgctgtggaatccattgaaaaaatcagcaaggtggagtgcggattt gcaacaatcaaagatctgcgagaccacaagctggacaaccgcatggagtcgttcttcctg gccgagactgtgaaatacctctacctcctgtttgacccaaccaacttcatccacaacaat gggtccaccttcgacgcggtgatcaccccctatggggagtgcatcctgggggctgggggg tacatcttcaacacagaagctcaccccatcgaccctgccgccctgcactgctgccagagg ctgaaggaagagcagtgggaggtggaggacttgatgagggaattctactctctcaaacgg agcaggtcgaaatttcagaaaaacactgttagttcggggccatgggaacctccagcaagg ccaggaacactcttctcaccagaaaaccatgaccaggcaagggagaggaagcctgccaaa cagaaggtcccacttctcagctgccccagtcagcccttcacctccaagttggcattactg ggacaggttttcctagactcctcataa >gi568815578f:35072155_35276810|GENSCAN_predicted_peptide_3|153_aa MPFRLLIPLGLLCALLPQHHGAPGPDGSAPDPAHYRERVKAMFYHAYDSYLENAFPFDEL RPLTCDGHDTWGSFSLTLIDALDTLLILGNVSEFQRVVEVLQDSVDFDIDVNASVFETNI RGMTGDWETDRVTNGMSMGFGIKDTQPYHLLDG >gi568815578f:35072155_35276810|GENSCAN_predicted_CDS_3|462_bp atgcctttccggctgctcatcccgctcggcctcctgtgcgcgctgctgcctcagcaccat ggtgcgccaggtcccgacggctccgcgccagatcccgcccactacagggagcgagtcaag gccatgttctaccacgcctacgacagctacctggagaatgcctttcccttcgatgagctg cgacctctcacctgtgacgggcacgacacctggggcagtttttctctgactctaattgat gcactggacaccttgctgattttggggaatgtctcagaattccaaagagtggttgaagtg ctccaggacagcgtggactttgatattgatgtgaacgcctctgtgtttgaaacaaacatt cgaggtatgacaggtgactgggagacagacagagtgacaaatggaatgagcatgggcttt ggaatcaaagatactcagccctaccacctactagatgggtga >gi568815578f:35072155_35276810|GENSCAN_predicted_peptide_4|368_aa MASRPVREGGGGVSPPPGQPPRLGGSQFLASKGTKLETENEFEELTEVDFRRWVITNSSE LKEHVLTQCKEAKNLEKRDMDEAENHHSQQTNTKTENQTLHVITHKWETAASGARSRPEP GTQVRSLNFRMLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWYQGNASL GGHLTHVLEGPDTNTTIIQLQPLQEPESWARTQSGLQSYLLQFHGLVRLVHQERTLAFPL TIRCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGVVTFTLQQLNA YNRTRYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVGSFIIAGVAVGIF LCTGGRRC >gi568815578f:35072155_35276810|GENSCAN_predicted_CDS_4|1107_bp atggccagccgccccgtccgggagggaggtgggggggtcagccctccgcccggccagccg ccccgtctgggaggatcacaattccttgccagcaagggaacaaaactggaaacagagaat gagtttgaagaattgacagaagtagacttcagaaggtgggtaataacaaactcctccgag ctaaaggagcatgttctaacccaatgcaaggaagctaagaaccttgaaaaaagggacatg gatgaggctgaaaaccatcattctcagcaaactaacacaaaaacagaaaaccaaacactg catgttatcactcataagtgggagactgcagccagcggagcccgcagccggcccgagcca ggaacccaggtccggagcctcaacttcaggatgttgacaacattgctgccgatactgctg ctgtctggctgggccttttgtagccaagacgcctcagatggcctccaaagacttcatatg ctccagatctcctacttccgcgacccctatcacgtgtggtaccagggcaacgcgtcgctg gggggacacctaacgcacgtgctggaaggcccagacaccaacaccacgatcattcagctg cagcccttgcaggagcccgagagctgggcgcgcacgcagagtggcctgcagtcctacctg ctccagttccacggcctcgtgcgcctggtgcaccaggagcggaccttggcctttcctctg accatccgctgcttcctgggctgtgagctgcctcccgagggctctagagcccatgtcttc ttcgaagtggctgtgaatgggagctcctttgtgagtttccggccggagagagccttgtgg caggcagacacccaggtcacctccggagtggtcaccttcaccctgcagcagctcaatgcc tacaaccgcactcggtatgaactgcgggaattcctggaggacacctgtgtgcagtatgtg cagaaacatatttccgcggaaaacacgaaagggagccaaacaagccgctcctacacttcg ctggtcctgggcgtcctggtgggcagtttcatcattgctggtgtggctgtaggcatcttc ctgtgcacaggtggacggcgatgttaa >gi568815578f:35072155_35276810|GENSCAN_predicted_peptide_5|33_aa MIESFLRPHQKKMLVPCFLYILQKWGITEPADM >gi568815578f:35072155_35276810|GENSCAN_predicted_CDS_5|102_bp atgattgaaagcttcctgaggcctcaccagaagaagatgctggtgccatgcttcctgtac atcctgcagaagtggggcatcacggaacctgccgacatgtga >gi568815578f:35072155_35276810|GENSCAN_predicted_peptide_6|157_aa MPSQPPRTFIAKDNSLPGFKASKHRLTLLLGDNAAGDFKLKPMSMYHYENPRAIKNDVNS LPMLHKWNNKAWMTAYLFIAWFTGYFKPTVETYCSGKKIPLKILLLIGNARGHPRALMEM YKEIQVVFMLANTSFILQSMDQGVTSIFKSYLRNTGQ >gi568815578f:35072155_35276810|GENSCAN_predicted_CDS_6|474_bp atgcccagccagccacctaggacttttatagctaaagataactcattgcctggattcaaa gcttcaaagcacaggctaactctcttactaggggataatgcagctggtgattttaagttg aagccaatgtccatgtaccattatgaaaatcctagggccattaagaatgatgtgaattct ctgcctatgctccataaatggaacaacaaagcctggatgacagcatatctgttcatagca tggtttactggatattttaagcctactgttgagacctactgctcaggaaaaaagattcct ctcaaaatattactactcattggcaatgcacgtggtcacccaagagccctgatggagatg tacaaggagattcaggttgttttcatgcttgctaacacatcattcattctgcagtccatg gatcaaggagtaacttcgattttcaagtcttatttaagaaatacaggccagtga >gi568815578f:35072155_35276810|GENSCAN_predicted_peptide_7|233_aa MPRARAPAPRASPTCPAKGASASSARATATATAARFPAPAAAAAARAAPGRQQSAGSSSS SSRPGTRQRLQRGAWPGGGGGGGGPGAARPPRLLGIPAAGPAPGPPAFAAAAAASAASAP PPPRAPPRMGVRKETDLRDNGHTRRGKGVWSDGKSRKMDGSIFTETESSCGGTGLKETQP WAHLAQPMGQQEQQLFLQEVHLHSLHLQEPAHVQEPPVVQEQLGSISSRGPAT >gi568815578f:35072155_35276810|GENSCAN_predicted_CDS_7|702_bp atgccccgagcccgcgccccggccccgcgcgccagccccacctgcccggcgaagggcgcc tccgcctcgtccgcccgcgccaccgccaccgccaccgctgcccggttccctgcccccgcc gccgccgccgccgcccgcgcggcgcccgggaggcagcagagcgcgggcagcagcagcagc agcagccgcccagggacccgccagcggctccagcgcggggcctggcccggcggcggcggc ggcggcggcggccccggcgcggcgcggccgccccggctcctcggcatcccggcggcgggg cccgcgcccggcccgcctgccttcgccgccgccgccgccgcctcggccgccagcgcgccc ccgcctccgcgcgccccgccccggatgggggtgagaaaagagacagatttaagagacaat gggcatacaagaagagggaaaggagtttggagtgatggcaaatcgagaaagatggatggt agcatcttcacagaaacggagtcctcatgtggaggaactggcttgaaggagacacagccc tgggcacacttggcacagcccatggggcagcaggagcagcagctcttcttgcaggaggtg catttgcactctttgcacttgcaggagccggcgcacgtgcaggagccaccagtggtgcag gagcagttggggtccatttcaagccgcggacctgctacgtaa >gi568815578f:35072155_35276810|GENSCAN_predicted_peptide_8|659_aa MVIIWLVVASWEWAKDILQSAREFPLVHLPPTQEPMAANRQCCAIKVFVKKLSWALWAIL RALFRLANWLKSYGYLLPYDSRASALHSAKALQSAVSTMQQFYGIPVTGVLDQTTIEWMK KPRCGVPDHPHLSRRRRNKRYALTGQKWRQKHITYSIHNYTPKVGELDTRKAIRQAFDVW QKVTPLTFEEVPYHEIKSDRKEADIMIFFASGFHGDSSPFDGEGGFLAHAYFPGPGIGGD THFDSDEPWTLGNANHDGNDLFLVAVHELGHALGLEHSSDPSAIMAPFYQYMETHNFKLP QDDLQGIQKIYGPPAEPLEPTRPLPTLPVRRIHSPSERKHERQPRPPRPPLGDRPSTPGT KPNICDGNFNTVALFRGEMFVFKDRWFWRLRNNRVQEGYPMQIEQFWKGLPARIDAAYER ADGRFVFFKGDKYWVFKEVTVEPGYPHSLGELGSCLPREGIDTALRWEPVGKTYFFKGER YWRYSEERRATDPGYPKPITVWKGIPQAPQGAFISKEGCTACALLFTLPQSRCRKCLGVV MLGCISADYTYFYKGRDYWKFDNQKLSVEPGYPRNILRDWMGCNQKEVERRKERRLPQDD VDIMVTINDVPGSVNAVAVVIPCILSLCILVLVYTIFQFKNKTGPQPVTYYKRPVQEWV >gi568815578f:35072155_35276810|GENSCAN_predicted_CDS_8|1980_bp atggtgatcatctggcttgtggtggcctcctgggagtgggccaaggacattctgcagagc gccagggaattcccgctagtgcacttaccacccactcaagagccgatggcagccaacagg caatgctgcgccattaaggtgtttgtcaagaagctaagctgggcattgtgggcgatcctg agggccctttttagattggctaactggttaaagtcctatggctatctgcttccctatgac tcacgggcatctgcgctgcactcagcgaaggccttgcagtcggcagtctccactatgcag cagttttacgggatcccggtcaccggtgtgttggatcagacaacgatcgagtggatgaag aaaccccgatgtggtgtccctgatcacccccacttaagccgtaggcggagaaacaagcgc tatgccctgactggacagaagtggaggcaaaaacacatcacctacagcattcacaactat accccaaaagtgggtgagctagacacgcggaaagctattcgccaggctttcgatgtgtgg cagaaggtgaccccactgacctttgaagaggtgccataccatgagatcaaaagtgaccgg aaggaggcagacatcatgatcttttttgcttctggtttccatggcgacagctccccattt gatggagaagggggattcctggcccatgcctacttccctggcccagggattggaggagac acccactttgactccgatgagccatggacgctaggaaatgccaaccatgacgggaacgac ctcttcctggtggctgtgcatgagctgggccacgcgctgggactggagcactccagcgac cccagcgccatcatggcgcccttctaccagtacatggagacgcacaacttcaagctgccc caggacgatctccagggcatccagaagatctatggacccccagccgagcctctggagccc acaaggccactccctacactccccgtccgcaggatccactcaccatcggagaggaaacac gagcgccagcccaggccccctcggccgcccctcggggaccggccatccacaccaggcacc aaacccaacatctgtgacggcaacttcaacacagtggccctcttccggggcgagatgttt gtctttaaggatcgctggttctggcgtctgcgcaataaccgagtgcaggagggctacccc atgcagatcgagcagttctggaagggcctgcctgcccgcatcgacgcagcctatgaaagg gccgatgggagatttgtcttcttcaaaggtgacaagtattgggtgtttaaggaggtgacg gtggagcctgggtacccccacagcctgggggagctgggcagctgtttgccccgtgaaggc attgacacagctctgcgctgggaacctgtgggcaagacctactttttcaaaggcgagcgg tactggcgctacagcgaggagcggcgggccacggaccctggctaccctaagcccatcacc gtgtggaagggcatcccacaggctccccaaggagccttcatcagcaaggaaggatgtact gcctgtgccctccttttcacactgccccagagcaggtgccggaagtgtctgggagtggtg atgctgggctgtatttctgcagattacacctatttctacaagggccgggactactggaag tttgacaaccagaaactgagcgtggagccaggctacccgcgcaacatcctgcgtgactgg atgggctgcaaccagaaggaggtggagcggcggaaggagcggcggctgccccaggacgac gtggacatcatggtgaccatcaacgatgtgccgggctccgtgaacgccgtggccgtggtc atcccctgcatcctgtccctctgcatcctggtgctggtctacaccatcttccagttcaag aacaagacaggccctcagcctgtcacctactataagcggccagtccaggaatgggtgtga