GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:25:29 Sequence gi568815578r:31565952_31822218 : 256267 bp : 47.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 259 344 86 0 2 76 110 73 0.885 6.92 1.02 Term + 2127 2277 151 0 1 101 38 64 0.591 0.08 1.03 PlyA + 6228 6233 6 1.05 2.02 PlyA - 7576 7571 6 1.05 2.01 Sngl - 12925 12500 426 1 0 81 37 296 0.959 18.10 2.00 Prom - 17083 17044 40 -6.76 3.03 PlyA - 20345 20340 6 1.05 3.02 Term - 28984 28794 191 2 2 61 41 98 0.185 0.01 3.01 Init - 38981 38723 259 2 1 30 94 252 0.954 17.30 3.00 Prom - 39057 39018 40 -10.94 4.00 Prom + 39299 39338 40 -5.76 4.01 Init + 39437 39862 426 1 0 63 78 822 0.971 73.00 4.02 Term + 40633 40713 81 0 0 93 39 75 0.788 0.79 4.03 PlyA + 41923 41928 6 1.05 5.00 Prom + 43889 43928 40 -4.26 5.01 Init + 44730 44751 22 2 1 76 105 21 0.124 2.69 5.02 Intr + 67258 67485 228 2 0 78 58 69 0.386 0.74 5.03 Intr + 73982 74146 165 0 0 109 95 267 0.926 29.43 5.04 Intr + 77453 77584 132 0 0 47 113 163 0.998 15.32 5.05 Term + 78817 78953 137 2 2 81 38 257 0.998 18.18 5.06 PlyA + 80650 80655 6 1.05 6.03 PlyA - 81748 81743 6 1.05 6.02 Term - 100135 99998 138 1 0 150 47 254 0.999 25.46 6.01 Init - 103263 103258 6 0 0 82 54 4 0.304 -2.86 6.00 Prom - 105965 105926 40 -6.86 7.00 Prom + 107709 107748 40 -7.36 7.01 Init + 108531 108585 55 2 1 86 91 0 0.460 1.79 7.02 Intr + 113181 113241 61 1 1 74 42 44 0.106 -3.81 7.03 Intr + 113294 113414 121 2 1 14 110 107 0.538 6.00 7.04 Intr + 113536 113663 128 0 2 26 103 145 0.930 9.28 7.05 Intr + 142021 142148 128 1 2 61 94 55 0.150 3.72 7.06 Intr + 142908 143035 128 1 2 84 8 24 0.075 -5.60 7.07 Intr + 147500 147625 126 1 0 69 65 148 0.858 11.48 7.08 Term + 150937 151044 108 0 0 100 40 69 0.844 1.81 7.09 PlyA + 151061 151066 6 -0.45 8.05 PlyA - 152975 152970 6 1.05 8.04 Term - 154704 154603 102 0 0 71 50 31 0.339 -4.12 8.03 Intr - 156258 155704 555 0 0 15 86 628 0.000 48.34 8.02 Intr - 157963 157784 180 1 0 91 94 53 0.000 6.26 8.01 Init - 160631 160629 3 2 0 102 81 0 0.001 0.70 8.00 Prom - 161468 161429 40 -4.16 9.00 Prom + 172641 172680 40 -1.16 9.01 Init + 191526 191631 106 2 1 52 111 51 0.855 4.18 9.02 Intr + 194106 194228 123 1 0 94 110 61 0.998 9.46 9.03 Intr + 200605 200731 127 2 1 104 94 45 0.994 6.44 9.04 Intr + 205609 205731 123 1 0 7 106 107 0.929 4.10 9.05 Intr + 209916 210037 122 0 2 119 71 148 0.998 16.34 9.06 Intr + 211536 211687 152 1 2 84 95 165 0.998 16.68 9.07 Intr + 212862 213033 172 2 1 103 83 86 0.999 9.12 9.08 Intr + 216298 216439 142 2 1 81 98 71 0.934 6.81 9.09 Intr + 217754 217970 217 2 1 55 79 148 0.919 9.11 9.10 Intr + 226784 226879 96 1 0 89 45 127 0.996 8.71 9.11 Intr + 227897 228073 177 1 0 76 92 88 0.995 8.12 9.12 Intr + 228451 228597 147 0 0 94 91 97 0.984 11.03 9.13 Intr + 231453 231564 112 2 1 105 27 137 0.957 9.25 9.14 Intr + 232414 232688 275 2 2 68 63 193 0.433 12.06 9.15 Term + 235019 235129 111 1 0 129 37 72 0.909 4.76 9.16 PlyA + 235522 235527 6 1.05 10.00 Prom + 235556 235595 40 -7.76 10.01 Init + 242471 242608 138 1 0 61 87 60 0.064 3.37 10.02 Intr + 253242 253325 84 2 0 75 41 60 0.182 0.02 10.03 Intr + 253582 253681 100 0 1 108 50 62 0.653 4.08 10.04 Intr + 254175 254503 329 1 2 80 47 200 0.780 10.52 10.05 Intr + 255444 255731 288 2 0 30 59 225 0.336 11.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 53836 53728 109 0 1 71 103 40 0.903 4.09 S.002 Init - 156267 155704 564 0 0 49 86 638 0.952 54.74 S.003 Term + 157686 158098 413 1 2 57 36 282 0.964 15.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:31565952_31822218|GENSCAN_predicted_peptide_1|78_aa PALLYLVPACIGFPVLVALAKGEVTEMFSYESSAEILPHTPRLTHFPTVSGSPASLADSM QQKLAGPRRRRPQNPSAM >gi568815578r:31565952_31822218|GENSCAN_predicted_CDS_1|237_bp cctgccctcctatacctggtccccgcctgcatcggttttcctgtcctggtggcgctggcc aagggagaagtgacagagatgttcagctacgagtcctcggcggaaatcctgcctcatacc ccgaggctcacccacttccccacagtctcgggctccccagccagcctggccgactccatg cagcagaagctagctggccctcgccgccggcgcccgcagaatcccagcgccatgtaa >gi568815578r:31565952_31822218|GENSCAN_predicted_peptide_2|141_aa MGLAGPALGAAGRPAAPGSEGLSTRASSCRGCTGSPSSAGPPALRWISCRALAASPWDTA GDLQPAMPPRPMGSCAAQASPTSAAPSSMAPGPIDPPRAEECGRHTAWDWQAAPPAAPVR DPQGEASRAPESSGDLENLYV >gi568815578r:31565952_31822218|GENSCAN_predicted_CDS_2|426_bp atgggcttggcgggccccgcacttggagcggctggccggcccgccgccccaggcagtgag gggcttagcacccgggccagcagctgcagagggtgcaccgggtccccaagcagtgctggc ccaccggcgctgcgctggatttcttgccgggccttagctgcctccccgtgggacacggct ggggacctgcagcccgccatgcccccccgccccatgggctcctgtgcagcccaagcctcc cctacgagcgccgccccctcctccatggcgcccggtcccatcgaccccccaagggctgag gagtgcggtcggcacacggcctgggattggcaggcagctccacctgcagcccctgtgcgg gatccacagggtgaagccagccgggctcctgagtctagtggggacttggagaacctttat gtctag >gi568815578r:31565952_31822218|GENSCAN_predicted_peptide_3|149_aa MREIEARTEKRLALDSPESLGFRTLLEDASRTLIYKYQDRNLKRALLYICSPGLSQFLNN VAVRVPETRRAYPDPSRVGAINSGKSKQYTNAHTHTPNVAFICTTDTTDKPNSHTPIDTT CQMHNAQQYTIIYNQHTDLRAHYRNTKSA >gi568815578r:31565952_31822218|GENSCAN_predicted_CDS_3|450_bp atgagagaaattgaggcccgaacggagaagcggctggctctggatagcccagagagcctg ggtttccggaccctcttggaagatgcttctcggaccctgatctacaaatatcaggaccga aatttaaagcgggcgctcctttacatctgctcccctgggctttctcaattcctaaataat gttgctgttcgtgttcctgagacccggagggcctacccagatccctcccgggtgggagca attaattcgggaaagtctaagcaatatacaaatgcacacacccacacgcccaacgtggca tttatatgtacgactgacacaacagacaaacccaacagccacacaccaatagacacaacc tgccaaatgcacaacgcacaacagtacaccataatatacaaccaacacactgacctccgc gcacactaccgaaacaccaaatctgcataa >gi568815578r:31565952_31822218|GENSCAN_predicted_peptide_4|168_aa MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQ QVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTPGG RGLPVRAPLSTLNGEISALTAELVLGGELEGYIWIVALTGLNECFRCL >gi568815578r:31565952_31822218|GENSCAN_predicted_CDS_4|507_bp atgaaagtcgccagtggcagcaccgccaccgccgccgcgggccccagctgcgcgctgaag gccggcaagacagcgagcggtgcgggcgaggtggtgcgctgtctgtctgagcagagcgtg gccatctcgcgctgcgccgggggcgccggggcgcgcctgcctgccctgctggacgagcag caggtaaacgtgctgctctacgacatgaacggctgttactcacgcctcaaggagctggtg cccaccctgccccagaaccgcaaggtgagcaaggtggagattctccagcacgtcatcgac tacatcagggaccttcagttggagctgaactcggaatccgaagttggaacccccgggggc cgagggctgccggtccgggctccgctcagcaccctcaacggcgagatcagcgccctgacg gccgagctggttctgggaggagaattggagggctacatctggattgttgctcttaccggc ctgaatgagtgtttccggtgtctttaa >gi568815578r:31565952_31822218|GENSCAN_predicted_peptide_5|227_aa MAAGKKGGLSLSQSSQHVGPVTTSGLNAVSGVPSTLGPPAVPGEDPYSSALGPRVACLKG QSVSSQVQDLERRLRSCPAGYIHPRGGGKMSPYTNCYAQRYYPMPEEPFCTELNAEEQAL KEKEKGSWTQLTHAEKVALYRLQFNETFAEMNRRSNEWKTVMGCVFFFIGFAALVIWWQR VYVFPPKPITLTDERKAQQLQRMLDMKVNPVQGLASRWDYEKKQWKK >gi568815578r:31565952_31822218|GENSCAN_predicted_CDS_5|684_bp atggctgccggaaagaaaggaggtctcagcctatcccagagctctcagcatgtcgggcct gtcacgacaagtggcctgaatgccgtgtccggagtcccttccacccttggaccccccgcg gttccaggagaagacccttattcctcggctctgggaccccgagtggcctgccttaaggga cagtccgtctcttcccaggttcaggaccttgaaaggaggctccgcagttgtcctgcaggc tatattcacccccgtggtggggggaagatgtccccctacaccaactgctatgcccagcgc tactaccccatgccagaagagcccttctgcacagaactcaacgctgaggagcaggccctg aaggagaaggagaagggaagctggacccagctgacccacgccgaaaaggtggccttgtac cggctccagttcaatgagacctttgcggagatgaaccgtcgctccaatgagtggaagaca gtgatgggttgtgtcttcttcttcattggattcgcagctctggtgatttggtggcagcgg gtctacgtatttcctccaaagccgatcaccttgacggacgagcggaaagcccagcagctg cagcgcatgctggacatgaaggtgaatcctgtgcagggcctggcctcccgctgggactat gagaagaagcagtggaagaagtga >gi568815578r:31565952_31822218|GENSCAN_predicted_peptide_6|47_aa MVDTFVELYGNNAAAESRKGQERFNRWFLTGMTVAGVVLLGSLFSRK >gi568815578r:31565952_31822218|GENSCAN_predicted_CDS_6|144_bp atggtggatacttttgtggaactctatgggaacaatgcagcagccgagagccgaaagggc caggaacgcttcaaccgctggttcctgacgggcatgactgtggccggcgtggttctgctg ggctcactcttcagtcggaaatga >gi568815578r:31565952_31822218|GENSCAN_predicted_peptide_7|284_aa MGQVHVLLWLRKAFILSPVHYPFKVQNSKMPEMVRRGLGQYPEHHHQEAWKQLTVPLDVA CGPPEISERHRDVWFDPEQFCDYLVAGPGARHFPPPSLVFRLQNENSCTYFTGLAAEFNG PWRQGRHQEYQEQLLPERVGGLQSPQKSLPNLFPSGFNITLRWGGGRRQVMFTLLRPELP KGFNQSFESAPGPESQNNNQGSLPDLQGWDRKQSMEVIAMAFPEELEKNLQFRGEHWRLP KKLCGSEFTAKQKDPVGLPDSGLPTGNKEDRLSDRGAEEKRIQE >gi568815578r:31565952_31822218|GENSCAN_predicted_CDS_7|855_bp atgggccaggtccatgtacttctgtggctgaggaaggccttcattctaagtccagtccat tacccctttaaggtgcagaacagtaagatgcctgagatggtgagaagagggctaggtcaa tacccagagcaccaccaccaggaggcctggaagcaactcactgtcccacttgatgtggcc tgtggaccaccagaaatttcagagcgtcacagagatgtgtggttcgatcctgagcagttc tgtgactacctggttgcaggacctggagcaagacactttccccctccaagcctggtgttc cggctgcaaaatgaaaacagctgtacttacttcacaggcctagcagcagagttcaatggt ccatggaggcaaggcagacaccaggaataccaggagcagctgctgcctgagagagtgggg ggcctccagagcccccagaagagtcttcctaacctctttccatcaggttttaacatcaca ctgagatggggaggtggcaggagacaggtgatgtttaccctgctgaggccagagttgcca aaaggcttcaatcaatcctttgaatctgcacctggccctgagtcccaaaataacaaccag ggcagcttacctgatctgcagggctgggacaggaagcagtccatggaagtcatcgcaatg gcatttcctgaagagctggagaaaaacctccagttccgaggggagcactggaggctgccc aaaaagctgtgtggctcagagttcactgctaaacaaaaagacccagtgggattgcctgac tcaggtctgccaactgggaacaaggaagacagactgtcggatagaggggcagaagagaag aggatccaggagtag >gi568815578r:31565952_31822218|GENSCAN_predicted_peptide_8|279_aa MGKLRPACSGETARAVSQVGRQPGRPARVRGAELGREELLRLEATGPMKGDVAPHGSRGS QSNRELVVDFLSYKLSQKGYSWSQFSDVEENRTEAPEGTESEMETPSAINGNPSWHLADS PAVNGATGHSSSLDAREVIPMAAVKQALREAGDEFELRYRRAFSDLTSQLHITPGTAYQS FEQVVNELFRDGVNWGRIVAFFSFGGALCVESVDKEMQVLVSRIAAWMATYLNDHLEPWI QENGGWCIVNLGDNVSTVTDTSKFPFQGGRVLLPITEPG >gi568815578r:31565952_31822218|GENSCAN_predicted_CDS_8|840_bp atggggaaactgaggccggcttgttcgggagagacggcgcgagcagtcagccaggtaggc cggcagccaggtaggccggcccgggtccgcggcgcggaactcggccgcgaagagctcttg cgtctggaagctaccgggccgatgaagggggatgtggccccccacggctcgcggggctcg cagagcaaccgggagctggtggttgactttctctcctacaagctttcccagaaaggatac agctggagtcagtttagtgatgtggaagagaacaggactgaggccccagaagggactgaa tcggagatggagacccccagtgccatcaatggcaacccatcctggcacctggcagacagc cccgcggtgaatggagccactggccacagcagcagtttggatgcccgggaggtgatcccc atggcagcagtaaagcaagcgctgagggaggcaggcgacgagtttgaactgcggtaccgg cgggcattcagtgacctgacatcccagctccacatcaccccagggacagcatatcagagc tttgaacaggtagtgaatgaactcttccgggatggggtaaactggggtcgcattgtggcc tttttctccttcggcggggcactgtgcgtggaaagcgtagacaaggagatgcaggtattg gtgagtcggatcgcagcttggatggccacttacctgaatgaccacctagagccttggatc caggagaacggcggctggtgtattgtaaatcttggcgataatgtgagcacagtgactgac accagcaaattccctttccagggagggagagttctcctgcccatcactgagccaggatga >gi568815578r:31565952_31822218|GENSCAN_predicted_peptide_9|733_aa MSQVKSSYSYDAPSDFINFSSLDDEGDTQNIDSWFEEKANLENKLLGKNGTGGLFQGKTP LRKANLQQAIVTPLKPVDNTYYKEAEKENLVEQSIPSNACSSLEVEAAISRKTPAQPQSS NNKKKPEEEGSAHQDTAEKNASSPEKAKGRHTVPCMPPAKQKFLKSTEEQELEKSMKMQQ EVVEMRKKNEEFKKLALAGIGQPVKKSVSQVTKSVDFHFRTDERIKQHPKNQEEYKEVNF TSELRKHPSSPARVTKGCTIVKPFNLSQGKKRTFDETVSTYVPLAQQVEDFHKRTPNRYH LRSKKDDINLLPSKSSVTKICRDPQTPVLQTKHRARAVTCKSTAELEAEELEKLQQYKFK ARELDPRILEGGPILPKKPPVKPPTEPIGFDLEIEKRIQERESKKKTEDEHFEFHSRPCP TKILEDVVGVPEKKVLPITVPKSPAFALKNRIRMPTKEDEEEDEPVVIKAQPVPHYGVPF KPQIPEARTVEICPFSFDSRDKERQLQKEKKIKELQKGEVPKFKALPLPHFDTINLPEKK VKNVTQIEPFCLETDRRGALKAQTWKHQLEEELRQQKEAACFKARPNTVISQEPFVPKKE KKSVAEGLSGSLVQEPFQLATEKRAKERQELEKRMAEVEAQKAQQLEEARLQEEEQKKEE LARLRRELVTGSMSTDEHKHASVLFYLYLTLYQTGSKVHKANPIRKYQGLEIKSSDQPLT VPVSPKFSTRFHC >gi568815578r:31565952_31822218|GENSCAN_predicted_CDS_9|2202_bp atgtcacaagttaaaagctcttattcctatgatgccccctcggatttcatcaatttttca tccttggatgatgaaggagatactcaaaacatagattcatggtttgaggagaaggccaat ttggagaataagttactggggaagaatggaactggagggctttttcagggcaaaactcct ttgagaaaggctaatcttcagcaagctattgtcacacctttgaaaccagttgacaacact tactacaaagaggcagaaaaagaaaatcttgtggaacaatccattccgtcaaatgcttgt tcttccctggaagttgaggcagccatatcaagaaaaactccagcccagcctcagagttct aacaacaaaaagaagccagaggaagaaggcagtgctcatcaagatactgctgaaaagaat gcatcttccccagagaaagccaagggtagacatactgtgccttgtatgccacctgcaaag cagaagtttctaaaaagtactgaggagcaagagctggagaagagtatgaaaatgcagcaa gaggtggtggagatgcggaaaaagaatgaagaattcaagaaacttgctctggctggaata gggcaacctgtgaagaaatcagtgagccaggtcaccaaatcagttgacttccacttccgc acagatgagcgaatcaaacaacatcctaagaaccaggaggaatataaggaagtgaacttt acatctgaactacgaaagcatccttcatctcctgcccgagtgactaagggatgtaccatt gttaagcctttcaacctgtcccaaggaaagaaaagaacatttgatgaaacagtttctaca tatgtgccccttgcacagcaagttgaagacttccataaacgaacccctaacagatatcat ttgaggagcaagaaggatgatattaacctgttaccctccaaatcttctgtgaccaagatt tgcagagacccacagactcctgtactgcaaaccaaacaccgtgcacgggctgtgacctgc aaaagtacagcagagctggaggctgaggagctcgagaaattgcaacaatacaaattcaaa gcacgtgaacttgatcccagaatacttgaaggtgggcccatcttgcccaagaaaccacct gtgaaaccacccaccgagcctattggctttgatttggaaattgagaaaagaatccaggag cgagaatcaaagaagaaaacagaggatgaacactttgaatttcattccagaccttgccct actaagattttggaagatgttgtgggtgttcctgaaaagaaggtacttccaatcaccgtc cccaagtcaccagcctttgcattgaagaacagaattcgaatgcccaccaaagaagatgag gaagaggacgaaccggtagtgataaaagctcaacctgtgccacattatggggtgcctttt aagccccaaatcccagaggcaagaactgtggaaatatgccctttctcgtttgattctcga gacaaagaacgtcagttacagaaggagaagaaaataaaagaactgcagaaaggggaggtg cccaagttcaaggcacttcccttgcctcattttgacaccattaacctgccagagaagaag gtaaagaatgtgacccagattgaacctttctgcttggagactgacagaagaggtgctctg aaggcacagacttggaagcaccagctggaagaagaactgagacagcagaaagaagcagct tgtttcaaggctcgtccaaacaccgtcatctctcaggagccctttgttcccaagaaagag aagaaatcagttgctgagggcctttctggttctctagttcaggaaccttttcagctggct actgagaagagagccaaagagcggcaggagctggagaagagaatggctgaggtagaagcc cagaaagcccagcagttggaggaggccagactacaggaggaagagcagaaaaaagaggag ctggccaggctacggagagaactggtaactgggagcatgagcactgacgaacacaaacat gcctctgttttattttacctgtaccttaccttgtaccagacaggatctaaggtgcataag gcaaatccaatacgcaagtaccagggtctggagataaagtcaagtgaccagcctctgact gtgcctgtatctcccaaattctccactcgattccactgctaa >gi568815578r:31565952_31822218|GENSCAN_predicted_peptide_10|313_aa MITFLMEGFWAKEVMVKSAECALHTGPSSVVLMFELLIRKAVAKRHSGHGKNAVSSELGP RVTGGGSPSCMAPGLERLELDKQQHTPPYLMATENGAVELGIQNPSTDKAPKGPTGERPL AAGKDPGPPDPKKAPDPPTLKKDAKAPASEKGDGTLAQPSTSSQGPKGEGDRGGGPAEGS AGPPAALPQQTATPETSVKKPKAEQGASGSQDPGKPRAVAAEGFTSVFSPSSSEKLLAKK PPSEASELTFEGVPMTHSPTDPRPAKAEEGKNILAESQKEVGEKTPGQAGQAKMQGDTSR GIEFQAVPSEKSE >gi568815578r:31565952_31822218|GENSCAN_predicted_CDS_10|939_bp atgatcacattcctcatggagggattctgggccaaggaggttatggtcaagagtgcggag tgtgctctccacactggcccctcttcggtggttcttatgtttgagctccttataagaaag gcagtggccaaaaggcactcgggccatgggaagaatgccgtgtcctctgaactggggcca agagtgacaggtggcggcagtcccagctgcatggcccctggtctagaaagacttgagtta gacaagcagcagcacacgcctccctacctcatggcgacagaaaatggagcagttgagctg ggaattcagaacccatcaacagacaaggcacctaaaggtcccacaggtgaaagacccctg gctgcagggaaagaccctggccccccagacccaaagaaagctccggatccacccaccctg aagaaagatgccaaagcccctgcctcagagaaaggggatggtaccctggcccaaccctca actagcagccaaggccccaaaggagagggtgacaggggcggggggcccgcggagggcagt gctgggcccccggcagccctgccccagcagactgcgacacctgagaccagcgtcaagaag cccaaggctgagcagggagcctcaggcagccaggatcctggaaagcccagggctgtggcc gctgagggcttcacctctgtgttctcaccttctagttctgagaagctgctggccaagaag cccccaagcgaggcatcagagctcacctttgaaggggtgcccatgacccacagccccacg gatcccaggccagccaaggcagaagaaggaaagaacatcctggcagagagccagaaggaa gtgggagagaaaaccccaggccaggctggccaggctaagatgcaaggggacacctcgagg gggattgagttccaggctgttccctcagagaaatccgag