GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:25:11 Sequence gi568815595r:185317498_185518778 : 201281 bp : 41.18% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12988 13113 126 0 0 69 92 27 0.005 1.16 1.02 Intr + 36359 36458 100 1 1 83 86 75 0.285 5.56 1.03 Term + 36715 36956 242 2 2 64 44 100 0.263 -1.60 1.04 PlyA + 37046 37051 6 1.05 2.03 PlyA - 40181 40176 6 1.05 2.02 Term - 46220 45779 442 1 1 62 43 182 0.564 4.84 2.01 Init - 46423 46317 107 1 2 83 67 62 0.774 3.34 2.00 Prom - 50987 50948 40 -6.75 3.03 PlyA - 51917 51912 6 1.05 3.02 Term - 60502 60300 203 2 2 69 47 193 0.599 9.87 3.01 Init - 70955 70880 76 2 1 90 57 61 0.175 2.52 3.00 Prom - 80331 80292 40 -4.85 4.04 PlyA - 81585 81580 6 1.05 4.03 Term - 100252 99998 255 1 0 26 37 372 0.078 20.50 4.02 Intr - 101194 100589 606 1 0 28 75 677 0.059 52.12 4.01 Init - 103354 103346 9 1 0 66 80 0 0.228 -2.53 4.00 Prom - 109157 109118 40 -6.95 5.00 Prom + 110003 110042 40 -6.75 5.01 Init + 111085 111559 475 0 1 110 121 381 0.988 39.28 5.02 Intr + 119950 120133 184 2 1 73 58 138 0.375 7.22 5.03 Intr + 125948 126139 192 2 0 18 78 109 0.055 0.59 5.04 Intr + 130292 130450 159 2 0 49 101 147 0.996 10.28 5.05 Intr + 132403 132561 159 1 0 62 83 108 0.983 5.88 5.06 Intr + 133790 133898 109 2 1 88 111 129 0.998 14.57 5.07 Intr + 146053 146162 110 0 2 64 76 165 0.999 11.06 5.08 Intr + 148250 148366 117 2 0 99 44 140 0.997 9.36 5.09 Intr + 149329 149466 138 1 0 103 93 183 0.919 18.96 5.10 Intr + 155478 156264 787 0 1 100 98 372 0.657 29.94 5.11 Intr + 159829 159899 71 0 2 100 83 125 0.998 10.26 5.12 Intr + 162735 163032 298 0 1 123 59 480 0.999 44.55 5.13 Term + 164858 164959 102 1 0 86 39 117 0.923 3.90 5.14 PlyA + 165155 165160 6 -1.75 6.09 PlyA - 165206 165201 6 1.05 6.08 Term - 165903 165853 51 0 0 121 48 62 0.077 1.95 6.07 Intr - 168769 168645 125 2 2 25 58 82 0.036 -1.72 6.06 Intr - 171550 171448 103 1 1 52 110 57 0.290 3.13 6.05 Intr - 175091 174914 178 1 1 55 66 133 0.554 6.80 6.04 Intr - 175268 175222 47 2 2 88 39 37 0.307 -4.91 6.03 Intr - 177818 177657 162 2 0 88 116 105 0.963 12.55 6.02 Intr - 179484 179331 154 2 1 87 65 182 0.919 14.95 6.01 Init - 181464 181346 119 0 2 99 96 157 0.845 15.32 6.00 Prom - 181998 181959 40 -8.55 7.00 Prom + 182118 182157 40 -7.25 7.01 Init + 184377 184520 144 2 0 19 61 138 0.487 4.37 7.02 Intr + 185947 186086 140 0 2 89 110 98 0.678 10.44 7.03 Intr + 188998 189071 74 1 2 81 67 65 0.871 1.83 7.04 Term + 190547 190926 380 0 2 48 54 209 0.358 7.47 7.05 PlyA + 192192 192197 6 1.05 8.04 PlyA - 192977 192972 6 1.05 8.03 Term - 194099 194018 82 1 1 84 55 79 0.764 0.39 8.02 Intr - 197024 196913 112 0 1 122 89 90 0.961 11.02 8.01 Intr - 199665 199570 96 1 0 67 73 54 0.631 0.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 125985 126139 155 2 2 54 78 125 0.882 7.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:185317498_185518778|GENSCAN_predicted_peptide_1|155_aa PLELINFSKSIASAGPSSSFTCWLRRRSGGLGGVLGKKPVWQVYLLVETDHNHTPPPPTP NGHSIKEKKDLIQALGDLDVSLRDRTYLSAKGLLLPLCVNRGQQRASPGSLTCRLQTGAT AIQVPREANAWGVYALPPFEAYRLGYRVMLSAALC >gi568815595r:185317498_185518778|GENSCAN_predicted_CDS_1|468_bp cctcttgagctgattaacttctccaagtcaatagcttctgctgggccaagttcctcattt acttgttggttgaggagaaggagtggagggctggggggtgtcctgggaaagaagccagtg tggcaggtctacctccttgttgaaactgaccataaccacaccccgcccccgccaacccca aatggtcattccatcaaagagaagaaagacttaattcaagctctgggggacttagatgtt tccctaagggacaggacatatctatctgctaagggtctgctcctacctctctgtgtcaac agagggcagcagagagcatcacccggtagcctaacctgtaggcttcaaaccggggcaacg gcaatccaagtgcctagggaagcaaatgcctggggagtgtacgcattgcccccgtttgaa gcctataggttaggctaccgggtgatgctctccgctgccctctgttga >gi568815595r:185317498_185518778|GENSCAN_predicted_peptide_2|182_aa MAPVIRAFEKWLNKYSSSPNCNRKILTVTQYETKASTFLVFWAFLLIPSTSSLREMNLPL RKAKQASEVTCRTVNEKCPRLAGRQAKENQLSVSEAPSCNGVSGFSPPLSLCLSHGVALN NISFLITSNHTFSVCPCEINTEETGKTECNTHNNEEFLHMLFHFLALKRCKVNFKSLAEL EK >gi568815595r:185317498_185518778|GENSCAN_predicted_CDS_2|549_bp atggctcccgtcataagagcttttgaaaaatggttgaacaagtattcatcttcaccaaat tgcaatcgcaagatactgactgttacacaatatgaaacaaaggcaagcacctttctggtg ttctgggcttttctgctaataccatccacatcatctctcagagaaatgaatctcccttta aggaaagccaagcaggcatccgaggtaacgtgcagaaccgtaaatgaaaaatgtccacgt ctagcaggcagacaagcaaaggaaaaccagctgtcagtttcagaggctcctagctgtaac ggggtttcgggtttctctccgcctctgtctctctgtctctctcacggtgtcgccctcaat aatatcagctttctaatcacctcaaatcacacattctctgtctgtccttgtgaaataaat actgaagaaacaggaaaaacagagtgcaatactcacaacaacgaagaattcctacacatg ctgttccattttcttgctttgaaaagatgtaaggtgaattttaaaagccttgcagaattg gaaaaataa >gi568815595r:185317498_185518778|GENSCAN_predicted_peptide_3|92_aa MARYSLGLLGSSNAPISSSRVAGTTGACYKCRKSGHWAKECPQPGIPPKPCPICAGPHWK SDCPTRPAATPRAPGTLAQGPFPDLLGLAAED >gi568815595r:185317498_185518778|GENSCAN_predicted_CDS_3|279_bp atggctcgctacagcctcggcctcctgggctcaagcaatgctcccatctcatcctcccga gtagctggaactacaggagcttgctacaagtgccggaaatctggccactgggccaaggaa tgcccgcagcccgggattcctcccaagccatgtcccatctgtgcgggaccccactggaaa tcggactgtccaactcgcccggcagccactcccagagcccctggaactctggcccaaggc cccttcccagatcttctcggcttagcggctgaagactga >gi568815595r:185317498_185518778|GENSCAN_predicted_peptide_4|289_aa MEEAPIRPDIVNFVHTNLRKNNRQPYAASELAGHQTSAESWGTGRAMARIPRVRGGGTHR SGQGAFGNMCRGGRRFAPTKTWRRWHRRVNTTQKRYAICSALAASALPALVMSKGHRIEE VPELPLVFEDKVEGYKKTKEAVLLLKKLKAWNDIEKVYASQRMRAGKGKMRNRRRIQRRG LCIIYNEDNGIIKAFRNIPGITLLNARNHKLRVDKAAAAAAALPAKSDEKAAVAGKKPVV GKKGKKAAVGVKKQKKPLVGKKAAATKKPAPEKKPAEKKPTTEEKKPAA >gi568815595r:185317498_185518778|GENSCAN_predicted_CDS_4|870_bp atggaggaggctcctattcgaccagatattgtgaactttgttcacaccaacttgcgcaaa aacaacagacagccctacgctgccagtgaattagcaggtcatcagaccagtgctgagtct tggggtactggcagagctatggctcgaattcccagagttcgaggtggtgggactcaccgc tctggccagggtgcttttggaaacatgtgtcgtggaggccgaaggtttgcaccaaccaaa acctggcgccgttggcatcgtagagtgaacacaacccaaaaacgatatgccatctgttct gccctggctgcctcagccctaccagcactggtcatgtctaaaggtcatcgtattgaggaa gttcctgaacttcctttggtatttgaagataaagttgaaggctacaagaagaccaaggaa gctgttttgctccttaagaaacttaaagcctggaatgatatcgaaaaggtctatgcctct cagcgaatgagagctggcaaaggcaaaatgagaaaccgtcgccgtatccagcgcaggggc ctgtgcatcatctataatgaggataatggtatcatcaaggccttcagaaacatccctgga attactctgcttaatgccaggaatcacaagctccgggtggataaggcagctgctgcagca gcagcactaccagccaaatcagatgaaaaggcggcggttgcaggcaagaagcctgtggta ggtaagaaaggaaagaaggctgctgttggtgttaagaagcagaagaagcctctggtggga aaaaaggcagcagctaccaagaaaccagcccctgaaaagaagcctgcagaaaagaaacct actacagaggaaaagaagcctgctgcataa >gi568815595r:185317498_185518778|GENSCAN_predicted_peptide_5|966_aa MANFQEHLSCSSSPHLPFSESKTFNGLQDELTAMGNHPSPKLLEDQQEKGMVRTELIESV HSPVTTTVLTSVSEDSRDQFENSVLQLREHDESETAVSQGNSNTVDGESTSGTEDIKIQF SRSGSGSGGFLEGLFGCLRPVWNIIGKAYSTDYKLQQQDTWEVPFEEISELQWLGSGAQG AVFLGKFRAEEVAIKKVREQNETDIKHLRKLKHPNIIAFKGVCTQAPCYCIIMEYCAHGQ LYEVLRAGRKITPRLLVDWSTGIASGMNYLHLHKIIHRDLKSPNVLVTHTDAVKISDFGT SKELSDKSTKMSFAGTVAWMAPEVIRNEPVSEKVDIWSFGVVLWELLTGEIPYKDVDSSA IIWGVGSNSLHLPVPSTCPDGFKILMKQTWQSKPRNRPSFRQTLMHLDIASADVLATPQE TYFKSQAEWREEVKKHFEKIKSEGTCIHRLDEELIRRRREELRHALDIREHYERKLERAN NLYMELSAIMLQLEMREKELIKREQAVEKKYPGTYKRHPVRPIIHPNAMEKLMKRKGVPH KSGMQTKRPDLLRSEGIPTTEVAPTASPLSGSPKMSTSSSKSRYRSKPRHRRGNSRGSHS DFAAILKNQPAQENSPHPTYLHQAQSQYPSLHHHNSLQQQYQQPPPAMSQSHHPRLNMHG QDIATCANNLRYFGPAAALRSPLSNHAQRQLPGSSPDLISTAMAADCWRSSEPDKGQAGP WGCCQADAYDPCLQCRPEQYGSLDIPSAEPVGRSPDLSKSPAHNPLLENAQSSEKTEENE FSGCRSESSLGTSHLGTPPALPRKTRPLQKSGDDSSEEEEGEVDSEVEFPRRQRPHRCIS SCQSYSTFSSENFSVSDGEEGNTSDHSNSPDELADKLEDRLAEKLDDLLSQTPEIPIDIS SHSDGLSDKECAVRRVKTQMSLGKLCVEERGYENPMQFEESDCDSSDGECSDATVRTNKH YSSATW >gi568815595r:185317498_185518778|GENSCAN_predicted_CDS_5|2901_bp atggccaactttcaggagcacctgagctgctcctcttctccacacttacccttcagtgaa agcaaaaccttcaatggactacaagatgagctcacagctatggggaaccacccttctccc aagctgctcgaggaccagcaggaaaaggggatggtacgaacagagctaatcgagagcgtg cacagccccgtcaccacaacagtgttgacgagcgtaagtgaggattccagggaccagttt gagaacagcgttcttcagctaagggaacacgatgaatcagagacggcggtgtctcagggg aacagcaacacggtggacggagagagcacaagcggaactgaagacataaagattcagttc agcaggtcaggcagtggcagtggtgggtttcttgaaggactatttggatgcttaaggcct gtatggaatatcattgggaaggcatattccactgattacaaattgcagcagcaagatact tgggaagtgccatttgaggagatctcagagctgcagtggctgggtagtggagcccaagga gcggtcttcttgggcaagttccgggcggaagaggtggccatcaagaaagtgagagaacag aatgagacggatatcaagcatttgaggaagttgaagcaccctaacatcatcgcattcaag ggtgtttgtactcaggccccatgttattgtattatcatggaatactgtgcccatggacaa ctctacgaggtcttacgagctggcaggaagatcacacctcgattgctagtagactggtcc acaggaattgcaagtggaatgaattatttgcacctccataaaattattcatcgtgatctc aaatcacctaatgttttagtgacccacacagatgcggtaaaaatttcagattttggtaca tctaaggaactcagtgacaaaagtaccaagatgtcatttgctggcacggtcgcatggatg gcgccagaggtgatacggaatgaacctgtctctgaaaaagttgatatatggtcttttgga gtggtgctttgggagctgctgacaggagagatcccttacaaagatgtagattcttcagcc attatctggggtgttggaagcaacagcctccaccttccagttccttccacttgccctgat ggattcaaaatccttatgaaacagacgtggcagagtaaacctcgaaaccgaccttctttt cggcagacactcatgcatttagacattgcctctgcagatgtacttgccaccccacaagaa acttacttcaagtctcaggctgaatggagagaagaagtgaaaaaacattttgagaagatc aaaagtgaaggaacttgtatacaccggttagatgaagaactgattcgaaggcgcagagaa gagctcaggcatgcgctggatattcgtgaacactatgagcggaagcttgagcgggcgaat aatttatacatggaattgagtgccatcatgctgcagctagaaatgcgggagaaggagctc attaagcgtgagcaagcagtggaaaagaagtatcctgggacctacaaacgacaccctgtt cgtcctatcatccatcccaatgccatggagaaactcatgaaaaggaaaggagtgcctcac aaatctgggatgcagaccaaacggccagacttgttgagatcagaagggatccccaccaca gaagtggctcccactgcatcccctttgtccggaagtcccaaaatgtccacttctagcagc aagagccgatatcgaagcaaaccacgccaccgccgagggaatagcagaggcagccatagt gactttgccgcaatcttgaaaaaccagccagcccaggaaaattcaccccatcccacttac ctgcaccaagctcaatcccaatacccttctcttcatcaccataattctctgcagcagcaa taccagcagccccctcctgccatgtcccagagtcaccatcccagactcaatatgcacgga caggacatagcaacctgcgccaacaacctgaggtatttcggcccagcagcagccctgcgg agcccactcagcaaccatgctcagagacagctgcccggctcgagccctgacctcatctcc acagccatggctgcagactgctggagaagttctgagcctgacaagggccaagctggtccc tggggctgttgccaggctgacgcttatgacccctgccttcagtgcaggccagaacagtat gggtccttagacataccctctgctgagccagtggggaggagccctgacctttccaagtca ccagcacataatcctctcttggaaaacgcccagagttctgagaaaacggaagaaaatgaa ttcagcggctgtaggtctgagtcatccctcggcacctctcatctcggcacccctccagcg ctacctcgaaaaacaaggcctctgcagaagagtggagatgactcctcagaagaggaagaa ggggaagtagatagtgaagttgaatttccacgaagacagaggccccatcgctgtatcagc agctgccagtcatattcaacctttagctctgagaatttctctgtgtctgatggagaagag ggaaataccagtgaccactcaaacagtcctgatgagttagctgataaacttgaagaccgc ttggcagagaagctagacgacctgctgtcccagacgccagagattcccattgacatatcc tcacactcggatgggctctctgacaaggagtgtgccgtgcgccgtgtgaagactcagatg tctctgggcaagctgtgtgtggaggaacgtggctatgagaaccccatgcagtttgaagaa tcggactgtgactcttcagatggggagtgttctgatgccacagttaggaccaataaacac tacagctctgctacctggtaa >gi568815595r:185317498_185518778|GENSCAN_predicted_peptide_6|312_aa MRPLLGLLLVFAGCTFALYLLSTRLPRGRRLGSTEEAGGRSLWFPSDLAELRELSEVLRE YRKEHQAYVFLLFCGAYLYKQGFAIPGSSFLNVLAGALFGPWLGLLLCCVLTSVGATCCY LLSSIFGKQLVVSYFPDKVALLQRKKRNGLSECSKPIGISGQPVQPARLLESSGFDDIFE TVQNVNSTSFKKLFVNNGTSLFTLAVGYSEPNLGPTFGKQSFLFPLPFNTEREPASENNT GSWGLTVQETRGEPVNPHKVYDDNSITDNQDENSKKTDQNYEVSGVGFLGRPSLKDEVVS RQKGNELQGNGT >gi568815595r:185317498_185518778|GENSCAN_predicted_CDS_6|939_bp atgcgcccgcttctcggcctccttctggtcttcgccggctgcaccttcgccttgtacttg ctgtcgacgcgactgccccgcgggcggagactgggctccaccgaggaggctggaggcagg tcgctgtggttcccctccgacctggcagagctgcgggagctctctgaggtccttcgagag taccggaaggagcaccaggcctacgtgttcctgctcttctgcggcgcctacctctacaaa cagggctttgccatccccggctccagcttcctgaatgttttagctggtgccttgtttggg ccatggctggggcttctgctgtgctgtgtgttgacctcggtgggtgccacatgctgctac ctgctctccagtatttttggcaaacagttggtggtgtcctactttcctgataaagtggcc ctgctgcagagaaagaagagaaatggactatcagagtgttccaaaccaatcggaatctca ggccagcctgttcagccagccagacttttggaatccagtggatttgatgacatctttgag acagtacagaatgtgaatagtacttcttttaaaaaactgtttgtcaacaatggcacttct ctctttactttggctgttgggtactcagaaccaaatttggggcccacttttggcaaacag tcattcttgttcccattgccgttcaatacagagagggagccggcatcagaaaacaacact gggagctggggactgacagtacaggagaccagaggggagccagttaatccacataaggtg tatgatgacaatagcataacagataatcaagatgagaattcaaaaaaaactgatcaaaat tacgaggtatcaggggtaggttttctaggaagaccatcactgaaagatgaagtggtcagc aggcagaaaggaaatgagctacaaggaaatggcacttga >gi568815595r:185317498_185518778|GENSCAN_predicted_peptide_7|245_aa MDEWMNTCSDEITCAKFTAVSAEQEQAFVCHHVRPCFFLCPSLPFDMQAAAAPIEATHLQ HRPTQSDKGQLQSSVNCSPFPGTCWIGKQVGEKNRKENKKVTVLIKTQTNKTKKKPGLPV PAAALLAVSPSPSPPAPPCSNSTLEKAEGQHVAFLPSLSKQPEREAGVDRERLSYCKLDK PKCKSIFTFIKHLIDVAPMGFQKTLHRFLPPKPTQQPHEVDVSFVPFYVWELEECGTENA ELSPR >gi568815595r:185317498_185518778|GENSCAN_predicted_CDS_7|738_bp atggatgagtggatgaacacatgttcagatgaaatcacttgtgcaaagtttacagctgtt agtgctgaacaggaacaagcctttgtctgccatcatgtgcggccatgctttttcctctgc ccttcactgccattcgacatgcaggctgcagcagcccccatcgaggccactcatttacag cacaggcccacacagtctgacaaaggacagctgcaatcctctgtcaactgcagtcctttc ccaggcacatgctggattgggaaacaggtgggagaaaagaacaggaaagagaacaaaaaa gtgactgtgctgatcaaaacacagaccaataaaactaagaagaaaccagggctgccggtc ccagcagccgccctcttagctgtcagtccatccccatctcctccagctcctccatgctcc aacagcactttagagaaggcagagggccagcacgtggctttcttgccttccctgagcaaa cagcccgagagagaagctggtgttgatagagagaggctctcttattgcaaattagataag cccaaatgtaaaagcatattcacatttatcaaacatctcatagatgtggcacctatggga tttcaaaaaacccttcatagatttttgcctcccaaacccacacagcagccccatgaggta gacgtcagtttcgttcccttctacgtgtgggaactggaggaatgtggcacggagaatgca gagttgagtccacggtga >gi568815595r:185317498_185518778|GENSCAN_predicted_peptide_8|96_aa XYYADNWKDHLRGKDPPMTKAFFDTAEESPFCMYHYFVDIITWNKNVRRGDITIKLRDKA GNTTESKINQISNRPKVQAQDSPNEVKVPCPSGEVS >gi568815595r:185317498_185518778|GENSCAN_predicted_CDS_8|291_bp ngctattatgctgataattggaaagaccatctaagggggaaagatcctccaatgacgaag gcattctttgacacagctgaggagagcccattctgcatgtatcattactttgtggatatt ataacatggaacaagaatgtaagaagaggggacattaccatcaaattgagagacaaagct ggaaacaccacagaatccaaaatcaatcagatctctaataggcccaaggtacaagctcag gattctccgaatgaagttaaggtcccttgcccatccggagaggtgagctga