GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:13:56 Sequence gi568815581r:78576094_78774742 : 198649 bp : 47.08% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 161 156 6 1.05 1.04 Term - 519 467 53 1 2 103 31 70 0.223 0.39 1.03 Intr - 3989 3807 183 0 0 1 96 144 0.405 6.26 1.02 Intr - 15059 14937 123 0 0 61 91 67 0.748 4.86 1.01 Init - 15292 15277 16 1 1 114 78 8 0.426 2.84 1.00 Prom - 17237 17198 40 -4.36 2.25 PlyA - 20200 20195 6 1.05 2.24 Term - 25837 25679 159 1 0 94 44 82 0.551 2.34 2.23 Intr - 26405 26153 253 1 1 99 47 83 0.590 2.74 2.22 Intr - 28383 28304 80 0 2 60 86 60 0.127 1.35 2.21 Intr - 31365 31257 109 2 1 72 80 64 0.247 4.29 2.20 Intr - 33910 33729 182 1 2 69 28 119 0.218 2.67 2.19 Intr - 35251 35144 108 1 0 74 88 44 0.123 3.48 2.18 Intr - 40325 40202 124 1 1 73 37 92 0.131 3.19 2.17 Intr - 41158 40783 376 2 1 55 -11 215 0.063 2.37 2.16 Intr - 42115 41785 331 1 1 64 105 113 0.465 5.80 2.15 Intr - 42951 42856 96 0 0 115 31 69 0.473 4.11 2.14 Intr - 46772 46638 135 2 0 79 53 70 0.841 3.36 2.13 Intr - 47712 47461 252 0 0 63 47 122 0.617 3.23 2.12 Intr - 48222 48099 124 2 1 112 107 59 0.993 10.79 2.11 Intr - 52345 52225 121 2 1 67 95 86 0.993 6.75 2.10 Intr - 54108 53939 170 2 2 60 59 149 0.807 8.79 2.09 Intr - 61376 61232 145 0 1 101 97 59 0.651 7.44 2.08 Intr - 65391 65370 22 0 1 63 109 10 0.386 -2.28 2.07 Intr - 65605 65501 105 1 0 49 87 84 0.322 4.81 2.06 Intr - 82615 82437 179 2 2 49 85 70 0.162 2.44 2.05 Intr - 88139 88014 126 0 0 74 56 46 0.207 0.65 2.04 Intr - 88758 88662 97 0 1 64 99 5 0.255 -1.22 2.03 Intr - 91824 91582 243 0 0 47 32 185 0.424 6.49 2.02 Intr - 92280 92077 204 0 0 53 105 138 0.971 11.40 2.01 Init - 93104 92970 135 2 0 82 75 37 0.557 1.94 2.00 Prom - 96330 96291 40 -6.86 3.17 PlyA - 97628 97623 6 1.05 3.16 Term - 100076 99998 79 1 1 109 53 111 0.510 6.94 3.15 Intr - 101227 101061 167 1 2 106 44 43 0.395 0.46 3.14 Intr - 101493 101367 127 2 1 51 59 74 0.718 1.48 3.13 Intr - 104251 104097 155 1 2 99 97 175 0.997 18.37 3.12 Intr - 104949 104878 72 0 0 94 81 70 0.984 6.50 3.11 Intr - 116400 116324 77 1 2 124 101 22 0.756 6.33 3.10 Intr - 117918 117880 39 1 0 78 64 70 0.602 1.80 3.09 Intr - 122287 122176 112 1 1 42 111 84 0.981 6.05 3.08 Intr - 122875 122727 149 2 2 105 102 227 0.999 25.75 3.07 Intr - 124350 124238 113 2 2 52 66 180 0.997 12.22 3.06 Intr - 125658 125578 81 2 0 88 113 70 0.997 8.25 3.05 Intr - 126147 126029 119 0 2 75 94 202 0.999 18.76 3.04 Intr - 126511 126445 67 0 1 68 115 39 0.900 3.51 3.03 Intr - 132168 132104 65 0 2 100 113 74 0.957 8.62 3.02 Intr - 133639 133557 83 2 2 66 91 171 0.918 14.56 3.01 Init - 134257 134248 10 1 1 76 86 3 0.896 -0.34 3.00 Prom - 137008 136969 40 -6.16 4.00 Prom + 138785 138824 40 -7.66 4.01 Init + 139599 139675 77 2 2 81 113 24 0.075 4.78 4.02 Intr + 147864 148012 149 0 2 87 47 57 0.092 1.38 4.03 Intr + 149508 149608 101 1 2 69 54 78 0.468 2.33 4.04 Intr + 159439 159552 114 0 0 93 59 52 0.437 3.44 4.05 Intr + 165716 165907 192 1 0 77 84 75 0.573 5.69 4.06 Intr + 168290 168382 93 1 0 101 81 0 0.365 0.76 4.07 Term + 189335 189433 99 1 0 66 44 107 0.677 2.23 4.08 PlyA + 189748 189753 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 69340 69471 132 0 0 84 113 104 0.952 12.64 S.002 Term + 73959 74102 144 2 0 68 52 94 0.840 1.71 S.003 Term - 112441 112251 191 2 2 59 44 130 0.877 3.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:78576094_78774742|GENSCAN_predicted_peptide_1|124_aa MAEVTGALKASTLGPDPPVPLILKASQVILPVAKQRTSALSYTCHSAILSVQFRGTEYTY IVVKTPPPSISRTSSLSQIETLTLLPERGPDPDPKRGFLDLVQEEIQGNDDVTRPRVSEI LGGD >gi568815581r:78576094_78774742|GENSCAN_predicted_CDS_1|375_bp atggcagaggtgacaggtgctttgaaagcctccacgcttggccccgaccctccagtgccc ttaattctcaaagcgtcccaggtgattctgcccgtggctaagcagagaacctcagcctta agttacacctgccactcagccattttaagtgtacagttccgtggcactgagtacacttac atcgttgtaaaaacaccaccaccatccatctctagaacttcttcactatcccaaattgaa actctgaccctgttaccagaaaggggtccagatccagaccccaagagagggttcttggat ctcgtgcaagaagaaattcagggaaatgacgacgtgacgcgtccccgggtctcggagatc ctgggtggagattaa >gi568815581r:78576094_78774742|GENSCAN_predicted_peptide_2|1291_aa MEGEGEIRMSPKCWAWAADKDRLGLEIQVLFWRCQEEDDSYNGRQNEGENGVIISIDAGK ALDEMKLHQEIHPTGQAQKEPFQHKQGLLRDTLETVSSPPSQAPKKDWGKSSLGSGSRHM PLSNLVEDLSQYKAIYENTTANLILNGERLEAFPLRSGSRQGCLLLPLLLNMVLDVLARA IRQEKEGLQVGKEKPCSSHGTEGTWNAAALALLIPSHGSRPPLSQPASGAPFRSQEKGWR PLSSHIVLGTVGDSRHHTSQPEPIGLTQGGTWQPVAREQLHLGAGAIPGHPDPVTAYEAP ESHVFLNFPKSTTGACQKDVTRMNRLLLNRPLTTDDIVFRQWWQFLKDQMSQPDWLALFL MIKNSIGFSAGRPDEGFRASKTELYTREQLIQRKGRTVLPGQPWKGFSGGHHQGVGTPRC DVCGSPSPPCLLWNLHNDRCFGGQSACVRALHTLNVTGNKDTPRFQEKIEHVDPFYHMTH YQTKPELLVFGSGDEFFSTDGTMNFYDQLPGRKYLRAIPSAGHAIDFHETNTYLEIRFFL YLLKEQEFPSISWTKTEMSGAGGYLHSIETCLCLSEDGQKQTQEELRNCTGQVQPGGGTD HRTQEASRSQMQQIEAACLTACEMQGEEDVYVTEPIGHSMEGPQCGPGTAFMGIIGKPHG KAQADHLIGGQYPSLPADPGRACGTPAPGAYRKELEVPSAGWEASFIEVTSLDAGLAEIL STEEARMELLLTRADLLLVTYSWAKSEKPLPALGKADGAAQQAGFAHRLRYHPTQSARLC GPLIYILICQQDGVGYAVFAQRSDPRDICVGVCPGAACLQVQIGVSFAVGINALELLGLC LHKLPLPVSSEKSYCGDTEWGRQSTVSYHDTKLVAGPRGHEGGEGAPSTVQPMRRSDPLC PPVTSGPSQSPLAGKNRQHLRPLGVSVNSDRGSNSPEMSASSLFPAYTLTHVRSLVISVD TQEAVEDSIPRAPQMHQCPSNASARGVLWTFLGTMHAIDFHVKETLLAIDSFFPRLLMGQ TFPSVSRAKIEGPQPTRRPGCCYWHQLRQALDLSLAVGCRGPAEDMFWVHTEDYAQPPRI QKIGNGASYATLGELGQAQEEFLKGAGRREQEDEIKDNGKDEWCLESQQLHLLLEMDEDF PDENKQRLSTQSSLGPCGSANPRLLNLVGVDEKGSWEGPGSQHCPHSDFEVPGGQHAYWE RELSQMPRALKTGLDVSKGEKLPSSRHSFYHGNNCQNQHGRLSAVCKLEFMGTLYFSLLG IRERWAPGCPSAPSLVMTLESPDLDLRRFSL >gi568815581r:78576094_78774742|GENSCAN_predicted_CDS_2|3876_bp atggagggagaaggagaaatcaggatgagtcctaagtgttgggcctgggcagccgacaag gacagacttgggctggaaatccaggttctcttctggaggtgtcaagaagaggatgactct tataatggccggcagaatgaaggagaaaacggggtgatcatctccatcgatgcaggaaaa gcattggacgaaatgaaactccaccaggaaatacacccaacggggcaggcgcagaaggaa ccctttcaacacaagcaaggcctactgcgggacacacttgagacagtatcgtcccctcct tcccaggccccaaagaaggactggggcaagtccagcctgggttcaggcagcagacatatg cccttaagtaacctggtggaggacctgtcacagtacaaggccatctatgaaaacaccaca gccaacctcatactcaacggagagagattggaagcttttcctctaagatcaggatcaaga caaggatgcctgcttttgccacttctcctcaacatggtcctggatgttctagccagggca attaggcaagaaaaagaaggtctccaagttggaaaggaaaaaccctgctccagccatggg actgaggggacctggaatgctgctgccttagccctgctcattcccagccatgggtctcgc ccacccctctcccagccagcatctggtgcaccctttcggtcacaggaaaaggggtggaga ccactaagttctcacattgtgctgggcaccgtcggtgacagcagacatcacacatcacag cccgagcccattggcctcactcaaggaggaacgtggcagcctgtggcccgtgagcagctg cacttgggggctggggccattccagggcacccggatccggtgactgcatatgaggctcct gaatctcatgttttcctaaactttccaaaatcaacaacaggtgcctgccagaaggacgtt actcgaatgaacagactactgttgaatcgtccactgaccacagatgacatcgtgttcagg cagtggtggcagttcctcaaggaccaaatgtcccagccagattggttggccttgttcctg atgatcaagaattccataggcttctctgcaggaaggcctgatgaagggttcagagctagc aagacagaactctacacgagggagcaactaattcagagaaagggccggactgtccttccc ggacaaccctggaaaggcttctcgggggggcaccatcagggtgttggaacgccaaggtgt gatgtctgcggctcaccatctccaccctgcctgctgtggaatctccacaacgacaggtgc tttggaggccagagtgcatgcgtacgggccctgcacacgctgaatgtcactgggaacaag gacacaccccgcttccaggagaagatagaacatgtcgaccccttttaccatatgacacat tatcagaccaaaccagagctcttggttttcggatctggggatgaattctttagcacagat ggaaccatgaacttctatgatcaactgccaggaaggaaatacctcagagccatccccagt gcaggacatgctattgacttccatgagacaaacacctacttggaaatcaggttttttctg taccttttgaaggagcaggaatttcccagcatttcctggacaaagacggagatgtcgggg gcaggtgggtatttgcattccattgaaacctgtctgtgcttgtcagaggatggccagaag cagacccaggaggagcttcggaactgcacggggcaggtgcagcctgggggtggcacggac cacagaacacaagaggcatccaggagtcagatgcagcagatagaggccgcgtgcctcacc gcctgtgaaatgcagggagaggaggatgtttatgtcactgagcccatcgggcacagcatg gaaggtccgcagtgtggtcctgggactgccttcatgggcatcataggaaagccacacggc aaggcccaggctgaccatctaatcggaggacagtacccctcgcttcctgcagaccccggg cgggcatgtggcaccccagctcctggggcctacagaaaggagctggaggtgcctagtgca ggctgggaggcctcctttattgaggtgacatccctggatgctggcttggcggagattctt tccactgaggaggcaaggatggagttgctgctgacccgtgcagatttgctgctggtaaca tactcatgggcaaagtctgaaaaacctcttcctgccctgggcaaggcagatggggcagcc cagcaggctggctttgcccaccgtctccgatatcatccaacacagtcagccaggttgtgt ggccctctcatctacattctcatttgccaacaagatggtgtgggctatgctgtctttgca cagcgatctgatcccagggacatttgtgtgggtgtgtgcccgggagctgcatgtctgcag gttcagattggggtcagctttgctgttgggatcaatgcccttgaacttctaggcctgtgc ctgcacaagctgcccctccccgtgtcttcggagaagtcctactgcggggacacagagtgg gggaggcaaagcaccgtcagctaccatgacacgaagcttgttgctggacccagaggacat gaaggaggggaaggggctcccagtacagtccaacccatgcgccgctcagacccgctctgc cctcccgttacttctggcccatcacagagccccctagctggcaagaatcggcagcatttg aggcctcttggtgtcagtgtcaactcagaccgtggatccaacagccctgaaatgtctgca tcctcccttttcccagcgtatactctgacacatgtccgtagtcttgtcatctctgttgac actcaggaggctgtggaggacagtatccctcgagcaccccagatgcaccagtgcccctct aatgcttcagccagaggggtcctctggaccttcttagggacaatgcatgcaattgatttc catgtcaaagaaaccctactggccatcgacagctttttcccaagacttctgatggggcaa acatttcccagtgtttcccgggcaaagatagagggtccccagcccacacgcaggcccgga tgctgttactggcatcaacttcgccaggctctggacctatctttggctgtcggctgccga ggaccagcagaagacatgttctgggtacacacagaggactatgctcaaccgccaaggatt cagaaaataggaaatggagcttcttatgccactctgggagagcttgggcaggcccaggag gaatttctgaaaggagccgggaggagagaacaagaggacgagataaaagataatggaaaa gacgaatggtgtttagaaagccagcagctccatcttctcctggaaatggatgaagacttc ccagatgagaacaagcagaggctgtctactcagagctcgctaggaccctgcggctctgca aatccccggctgctcaaccttgtcggggtggatgagaaggggtcctgggaggggcctggg agccagcattgcccccactcagattttgaagtccctggtgggcaacatgcatattgggag agggagctgagccaaatgcctcgggccctcaagacaggcttagatgtctccaagggggaa aagcttccttcgtccaggcattccttctatcatggaaacaactgtcagaaccagcatggg agactttcagctgtttgtaagttggagttcatgggcactctgtacttttccttactagga atcagagaaaggtgggcccctgggtgtccttcagcaccaagcctggtcatgacgctggaa tctcccgacctggaccttcgcaggttctcgctgtga >gi568815581r:78576094_78774742|GENSCAN_predicted_peptide_3|504_aa MKKVPSDLTAEERQELENIRRRKQELLADIQRLKDEIAEVANEIENLGSTEERKNMQRNK QVAMGRKKFNMDPKKGIQFLIENDLLKNTCEDIAQFLYKGEGLNKTAIGDYLGERDEFNI QVLHAFVELHEFTDLNLVQALRQFLWSFRLPGEAQKIDRMMEAFAQRYCQCNNGVFQSTD TCYVLSFAIIMLNTSLHNPNVKDKPTVERFIAMNRGINDGGDLPEELLRNLYESIKNEPF KIPEDDGNDLTHTFFNPDREGWLLKLAICDEGKRESMIGGGRVKTWKRRWFILTDNCLYY FEYTTDKEPRGIIPLENLSIREVEDSKKPNCFELYIPDNKDQVIKACKTEADGRVVEGNH TVYRISAPTPEEKEEWIKCIKGTAVELRVSGNSGCHRVCRLGAEMCSLNMGSLFDSPEPR EVEGSATRTSLQNPPAYPKGLAYPGSQGRFTPTPPNMKAGVPAARFPRESTSGGQTASQA AISRDPFYEMLAARKKKVSSTKRH >gi568815581r:78576094_78774742|GENSCAN_predicted_CDS_3|1515_bp atgaagaaagttcccagtgacctgacagcagaggagcgtcaagaactggagaacatccga cggagaaaacaggagctgctggctgacattcagaggctgaaggatgagatagcagaagta gctaatgaaattgaaaacctgggatccacagaggaaaggaaaaacatgcagaggaacaaa caggtagccatgggcaggaaaaaatttaatatggaccctaaaaaggggatccagttctta atagagaacgacctcctgaagaacacttgtgaagacattgcccagttcttatataaaggc gaagggctcaacaagacagccatcggcgactacctaggggagagagatgagtttaatatc caggttcttcatgcatttgtggagctgcatgagttcactgatcttaatctcgtccaggca ctacggcagttcctgtggagcttccggctacccggagaggcccagaagatcgaccggatg atggaggcgtttgcccagcgatattgtcagtgcaataatggcgtgttccagtccacggat acttgttacgtcctctcctttgccatcatcatgttgaacaccagtctgcacaaccccaat gtcaaagataagcccactgtggagaggttcattgccatgaaccgaggcatcaatgatggg ggagacctgccggaggagctcctccggaatctctatgagagcataaaaaatgaacccttt aaaatcccagaagacgacgggaatgacctcactcacactttcttcaatccagaccgagaa ggctggctattgaaactcgctatttgtgatgagggcaagagagaaagcatgatcggaggt ggcagggtaaagacttggaagagacgctggttcattctgactgacaactgcctttactac tttgagtataccacggataaggagccccgtggaatcatccctttagagaatctgagtatc cgggaagtggaggactccaaaaaaccaaactgctttgagctttatatccccgacaataaa gaccaagttatcaaggcctgcaagaccgaggctgacgggcgggtggtggaggggaaccac actgtttaccggatctcagctccgacgcccgaggagaaggaggagtggattaagtgcatt aaaggtacagcagtagaactgcgtgtcagcggtaacagcggctgccatcgagtgtgtcgg ctaggcgctgagatgtgctctttgaatatgggatctctttttgactctccggaacccagg gaggtggagggcagtgccacacgcacctctctgcagaaccccccagcttacccgaaaggg ttggcctacccaggaagccaagggagattcaccccaacacctccaaacatgaaagcaggt gtcccggccgccagattccctcgtgaaagcacttcaggtggtcagaccgcttcccaagca gccatcagcagggaccctttctacgaaatgctcgcagcacggaaaaagaaggtctcctcc acgaagcgacactga >gi568815581r:78576094_78774742|GENSCAN_predicted_peptide_4|274_aa MPPEWCPLGHQGRQEQLPLSGHLALRDPMPSTSTRDTPAFSPRRTRLLLSTLAATNGYFF TRGCGIAFEDTCLLSEDSVIMCKAHYDSLFNMASFHYADRVDHSRGLPVIQKYRYGHLCM LPSQRPPRVLQLSVSPVDTVQDHLHLEDSLRQTAGWSRTGSRAGGKRKVSGATLKPDPGG QVSHCQEKEFGNMKRGKARKNPKVGMELKELLRPFLTSCQMGDSLGHTITQVSSKVPKNI CKSGHREIAEVFVKVREQVEEVSVFNKSVKASVE >gi568815581r:78576094_78774742|GENSCAN_predicted_CDS_4|825_bp atgccccccgagtggtgtcctctgggccatcagggacgccaggaacagctccctttgagt gggcacctggctctacgggatcccatgccctccacttccacgagggacactccagccttt tcaccaagaaggactcgcctcctactcagtacactcgcagccaccaatggttatttcttc accagaggctgtggcatcgcctttgaggacacctgcctattatcagaagactcagtcatc atgtgcaaggctcactatgactctctctttaacatggccagcttccactacgccgacaga gtggatcacagccggggtttacctgtgatacagaagtatcgttatggacacctgtgcatg cttccctcccagcgtcccccacgcgtgctgcagctttcagtgtcgcccgtggacactgtc caggatcatttgcacttagaggacagcctccggcagacagcaggatggagccggacagga agtagagctggaggcaaaaggaaggtgtcgggtgcaaccctcaagccagatcctgggggc caggtttcccactgtcaggaaaaggaatttggaaacatgaaaagggggaaggctagaaag aaccctaaggttggaatggaattgaaggagctgcttagacctttcctaacaagctgccaa atgggagacagcttagggcatacaatcacgcaagtgtcctccaaagtcccgaagaacatc tgcaagtctggacacagagaaatcgcagaagtctttgtcaaagttagagaacaagttgaa gaagtctccgtcttcaataaatcagtaaaagccagtgtggagtag