GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:57:15 Sequence gi568815590r:94781635_94994121 : 212487 bp : 41.87% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1535 1638 104 1 2 58 31 110 0.429 2.06 1.02 Intr + 2036 2168 133 2 1 78 64 110 0.972 7.33 1.03 Intr + 6260 6418 159 1 0 85 113 47 0.938 6.16 1.04 Intr + 41244 41547 304 2 1 78 96 110 0.418 6.04 1.05 Intr + 41727 41927 201 1 0 56 88 107 0.739 5.84 1.06 Intr + 43259 43433 175 0 1 58 63 68 0.677 -0.62 1.07 Intr + 46088 46159 72 2 0 114 75 25 0.721 1.40 1.08 Intr + 47341 47392 52 1 1 85 103 29 0.973 2.09 1.09 Intr + 50358 50540 183 2 0 50 66 179 0.989 10.96 1.10 Intr + 54890 54997 108 1 0 75 84 46 0.844 2.46 1.11 Intr + 56829 56984 156 2 0 92 110 70 0.996 8.89 1.12 Intr + 59857 59957 101 0 2 73 98 75 0.781 4.99 1.13 Intr + 68282 68457 176 2 2 48 71 34 0.381 -3.64 1.14 Intr + 69919 70052 134 2 2 45 92 49 0.605 0.44 1.15 Intr + 72171 72281 111 2 0 75 113 32 0.818 4.06 1.16 Intr + 75143 75344 202 1 1 87 99 183 0.978 17.24 1.17 Intr + 77877 77998 122 1 2 108 63 32 0.970 1.99 1.18 Intr + 83872 84066 195 0 0 62 84 139 0.982 9.59 1.19 Intr + 85506 85562 57 2 0 92 114 -6 0.593 0.56 1.20 Intr + 85642 85703 62 0 2 93 108 23 0.974 1.41 1.21 Intr + 90250 90368 119 1 2 89 111 96 0.994 11.19 1.22 Intr + 91740 91843 104 1 2 115 68 76 0.891 7.27 1.23 Term + 98484 98600 117 2 0 49 47 98 0.587 -0.64 1.24 PlyA + 98646 98651 6 1.05 2.05 PlyA - 98880 98875 6 -0.45 2.04 Term - 100111 99998 114 1 0 61 43 111 0.985 1.49 2.03 Intr - 100655 100498 158 0 2 106 84 57 0.936 5.91 2.02 Intr - 101258 101147 112 2 1 79 110 73 0.992 7.63 2.01 Init - 101908 101903 6 1 0 53 73 10 0.257 -3.64 2.00 Prom - 102328 102289 40 -7.55 3.00 Prom + 105992 106031 40 -4.45 3.01 Sngl + 110198 110473 276 1 0 36 32 337 0.986 18.03 3.02 PlyA + 110486 110491 6 1.05 4.07 PlyA - 111689 111684 6 -0.45 4.06 Term - 112976 112767 210 2 0 65 49 144 0.159 4.51 4.05 Intr - 114992 114818 175 1 1 62 45 98 0.189 1.92 4.04 Intr - 122042 121925 118 0 1 38 85 125 0.614 5.80 4.03 Intr - 124211 123894 318 0 0 40 61 134 0.521 0.91 4.02 Intr - 125081 124989 93 0 0 72 66 81 0.767 3.32 4.01 Init - 136342 136267 76 1 1 85 76 32 0.248 3.00 4.00 Prom - 137245 137206 40 -4.75 5.03 PlyA - 137286 137281 6 1.05 5.02 Term - 138104 137874 231 2 0 61 44 144 0.901 2.79 5.01 Init - 139236 139126 111 0 0 71 50 125 0.928 7.26 5.00 Prom - 139807 139768 40 -9.15 6.07 PlyA - 140287 140282 6 1.05 6.06 Term - 141499 141336 164 2 2 94 41 101 0.555 3.12 6.05 Intr - 149094 148862 233 2 2 105 22 150 0.187 6.59 6.04 Intr - 154919 154725 195 1 0 49 81 112 0.356 4.21 6.03 Intr - 158586 158226 361 1 1 -4 98 238 0.372 8.95 6.02 Intr - 159298 159196 103 1 1 48 82 148 0.057 9.03 6.01 Init - 163001 162945 57 2 0 60 81 55 0.212 1.37 6.00 Prom - 165797 165758 40 -7.55 7.08 PlyA - 166615 166610 6 1.05 7.07 Term - 167192 166918 275 0 2 29 46 252 0.982 9.75 7.06 Intr - 167951 167816 136 2 1 115 36 39 0.549 0.62 7.05 Intr - 168400 168339 62 2 2 81 75 47 0.318 0.23 7.04 Intr - 176494 176295 200 0 2 64 89 118 0.411 7.57 7.03 Intr - 179096 179006 91 0 1 68 81 51 0.121 0.53 7.02 Intr - 182115 181932 184 0 1 79 37 108 0.276 3.24 7.01 Intr - 202505 202390 116 0 2 38 100 86 0.008 4.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:94781635_94994121|GENSCAN_predicted_peptide_1|1048_aa MIQSGGVLYVLPGDRHMGKDSEGAGGEVDLEEAGLRQAPVAAVFAGSPQLMGAIKLCTGW MVTSLPLYNDDDLLKRNENIYQIYSKRSAEDIYKILTSYKANYLIVEDAICNEVGPMRGC RVKDLLDIANGHEASGKLVFSAGRREHLRRGPRCEEISEMGTGKLKAQEINFAWVTTRTL RFAFTKILRPVATLDEVVSFSRWRTVREHGKPTRDPASCGAVSYGNREEAWFEGTGPHPS VRLEPGSGPGGSGGGGRMSAEAADREAATSSRPCTPPQTCWFEFLLEESLLEKHLRKPCP DPAPVQLIVQFLEQASKPSVNEQNQVQPPPDNKRNRILKLLALKVAAHLKWDLDILEKRA IRTIVQSSFPVKQAKPGPPQLSVMNQMQQEKELTENILKVLKEQAADSILVLEAALKLNK DLYVHTMRTLDLLAMEPGMVNGETESSTAGLKVKTEEMQCQVCYDLGAAYFQQGSTNSAV YENAREKFFRTKELIAEIGSLSLHCTIDEKRLAGYCQACDVLVPSSDSTSQQLTPYSQVH ICLRSGNYQEVIQIFIEDNLTLSLPVQFRQSVLRELFKKAQQGNVCLGLEDLQYVFMISS HELFITLLKDEERKLLVDQMRKRSPRVNLCIKPVTSFYDIPASASVNIGQLEHQLILSVD PWRIRQILIELHGMTSERQFWTVSNKWEVPSVYSGVILGIKDNLTRDLVYILMAKGLHCS TVKDFSHAKQLFAACLELVTEFSPKLRQVMLNEMLLLDIHTHEAGTGQAGERPPSDLISR VRGYLEMRLPDIPLRQVIAEECVAFMLNWRENEYLTLQVPAFLLQSNPYVKLGQLLAATC KELPGPKESRRTAKDLWEVVVQICSVSSQHKRGNDGRVSLIKQRESTLGIMYRYVLEPLV LTIILSLFVKLHNVREDIVNDITAEHISIWPSSIPNLQSVDFEAVAITVKELVRYTLSIN PNNHSWLIIQADIYFATNQYSAALHYYLQAGAVCSDFFNKAVPPDVYTDQIKAIGQTELN ASNPEEVLQLAAQRRKKKFLQAMAKLYF >gi568815590r:94781635_94994121|GENSCAN_predicted_CDS_1|3147_bp atgattcagagtggaggagtcctatatgttctgcctggagacagacacatgggtaaagac agtgaaggagcaggaggagaggtggatttggaagaagctgggttaaggcaagctccagtt gcagctgtgtttgcagggagtccacagttaatgggtgcgattaaattatgcactggatgg atggtgacaagtttgcctctttacaatgatgatgatcttctcaagagaaatgaaaatatc taccaaatctattcaaagcgatctgctgaggatatttataaaatactgacatcttacaaa gctaattacctaattgtagaggatgctatctgcaatgaggtgggacccatgagaggctgt agggttaaagatttattagacattgcaaatggccacgaagcgagtgggaagctggtgttc tcggcaggacggcgggagcatctgaggagaggacccaggtgcgaggaaatctcagaaatg ggaacggggaaattgaaagcacaggagataaactttgcttgggtcacaactagaaccctt cgcttcgcctttacgaaaatattgaggcccgttgcaaccctagatgaggttgtctcattt tctcgctggagaactgtcagggaacacggaaaacccacccgcgacccggcttcatgcggc gccgtctcctatggcaaccgggaggaggcgtggttcgaaggcaccggcccgcatccaagt gtcaggttggagccgggaagcggccctggtggtagcggcggcgggggcaggatgagcgcg gaggcggcggaccgggaggcggccacctccagccggccctgcaccccgccgcagacctgc tggtttgagtttctgctggaggagtcactgttggagaaacatctgcgcaagccctgcccg gatcctgcaccagttcaacttatagttcagtttttggaacaggcttccaaaccttcagtt aatgaacaaaaccaagttcaacctccgcctgataacaagagaaatcgtattttaaaacta cttgctcttaaagttgctgcacatttgaagtgggacttagacatattagagaaaagggca attaggacaattgttcaaagtagttttccagtcaaacaggcaaaacccggaccccctcag ttaagtgttatgaatcaaatgcaacaggaaaaagagctaacagaaaacattttgaaagtg ctcaaagaacaagctgctgattctattttggtactagaagcagccctaaaattaaacaag gatctttatgtccatacgatgagaactctagatttattggccatggaaccaggcatggta aatggagaaactgagagttctactgctggattgaaagtcaaaaccgaagaaatgcagtgc caggtgtgctatgatttgggcgcagcatacttccagcaaggctccacaaattcagctgtc tatgaaaatgccagggaaaaattttttagaaccaaagaactaattgcagagataggttca ttatctcttcattgtaccatagatgagaagcggttagctggctattgtcaagcatgtgat gttcttgtaccttcttctgatagtacatctcaacagttgactccatatagtcaagtccat atttgtttgagatctggcaactatcaggaggtaatacagattttcattgaagacaactta accttgagtttacctgtccagttccgacagtcagtcctaagagaactctttaagaaagct caacaggggaatgtgtgtctggggttggaagatctgcagtatgttttcatgatttcttca catgagcttttcattacattgttgaaagatgaagaacgaaagctacttgttgatcagatg aggaagagatcccctagagtaaatctgtgcattaaacctgtaacttcattttatgatatc ccagcttcagcaagtgtcaacattggtcagttagagcatcaacttatattgtcagtggat ccttggaggattagacaaattttaattgaattacatggtatgacttcagagcgccagttc tggacagtgtctaataagtgggaagtaccttctgtctatagtggtgttatcctgggaatt aaagacaatttaacaagagatttggtttatattcttatggccaaaggtttgcactgcagt actgttaaggacttttcccatgctaaacagctctttgctgcttgtttggagttggtaaca gagttctcaccgaagcttcgtcaggtcatgctgaatgagatgttgcttttggatattcat acacacgaagctgggacagggcaggcaggagagagaccgccatccgaccttataagtaga gtacgaggctatctggaaatgaggcttcctgatattcctcttcgtcaagttatagctgag gaatgtgttgcctttatgttaaactggagagaaaatgaataccttacactccaagttcct gcatttttgcttcagagtaatccatatgtaaagcttggacagcttttagcagctacatgc aaagaacttccaggccctaaagaaagtagacggactgccaaagacctttgggaagttgtt gttcaaatctgtagtgtgtccagtcagcacaaacgaggaaatgatggcagagttagttta ataaaacagagggaatctacgttaggtatcatgtatcggtatgtattggaaccactcgtt ttgactattattttatcactctttgtgaaacttcacaatgttcgggaggacattgtgaat gatattacagctgaacacatttctatttggccatcttccattcccaacctccagtctgtg gactttgaagctgtggcaatcacagtgaaagagctagttcgatatacactcagtataaat ccaaataaccattcttggttaattatccaggcagatatttactttgcaacgaatcagtat tcagcagctcttcactattacctccaggcaggagctgtgtgttctgacttctttaacaag gctgtgccccctgatgtttatacagaccagatcaaagccatcggccagacagagttgaat gcaagcaatccagaagaagtgttacagctggcagcgcagagaaggaaaaaaaagtttctc caagcaatggcaaaactttacttttaa >gi568815590r:94781635_94994121|GENSCAN_predicted_peptide_2|129_aa MKLLDLCILAIDSLEFQYRILTAAALCHFTSIEVVKKASGLEWDSISECVDWMVPFVNVV KSTSPVKLKTFKKIPMEDRHNIQTHTNYLAMLEEVNYINTFRKGGQLSPVCNGGIMTPPK STEKPPGKH >gi568815590r:94781635_94994121|GENSCAN_predicted_CDS_2|390_bp atgaagcttttagatctgtgtattctagccattgattcattagagttccagtacagaata ctgactgctgctgccttgtgccattttacctccattgaagtggttaagaaagcctcaggt ttggagtgggacagtatttcagaatgtgtagattggatggtaccttttgtcaatgtagta aaaagtactagtccagtgaagctgaagacttttaagaagattcctatggaagacagacat aatatccagacacatacaaactatttggctatgctggaggaagtaaattacataaacacc ttcagaaaagggggacagttgtcaccagtgtgcaatggaggcattatgacaccaccgaag agcactgaaaaaccaccaggaaaacactaa >gi568815590r:94781635_94994121|GENSCAN_predicted_peptide_3|91_aa MLKNAESNAELKSLDVDSLVIEHLQVNKAPTMCQRTYRAHGRINSHVISPCHIEMIFSEK EQIVPKPEEAVAQKKKISQKKLKKQKLMAQE >gi568815590r:94781635_94994121|GENSCAN_predicted_CDS_3|276_bp atgcttaaaaatgcagagagtaatgctgaacttaagagtttagatgtagattctctggtc attgagcatctccaagtgaacaaagcacctacgatgtgccaacggacctacagagctcat ggtcggattaactcacacgtgatctctccctgccacatcgaaatgatctttagtgaaaag gaacaaattgttcctaaaccagaagaggcggttgcccagaagaaaaagatatcccagaag aaactgaagaaacaaaaacttatggcacaggagtaa >gi568815590r:94781635_94994121|GENSCAN_predicted_peptide_4|329_aa MCCRISIHSPQRDHEHKEASTNILARTTKVDPRLKPGHSLCSVSACPLVLQMGGPPGAIG NTRNCTAVQLVVDPSHPQGNSTPRGEDSPWMRTTMSRTKKEEGKNTRRGRRKSPECLEGT QVALGSAASKRQRSTLVGRGRMPPNQWLGKDQGSSGLVETHPRNTNVTDIAEAYYKLPPN PRCHCPEIDGYFGEARFYAITAAWADAARPSYRGFRDAFALLLHPGDLCSRSCSPPRCSR RKTCLRNTRRKSDLTMCSFGEGGGTWGRCPSPTPGVRGRGWRTQKNPFEEDVPPCAFKVS GVLGKFLDTRVPAVHSVEEMQACIQNELP >gi568815590r:94781635_94994121|GENSCAN_predicted_CDS_4|990_bp atgtgctgcagaatatcaattcacagcccccagagggaccatgagcataaagaggcttcc actaatattcttgcacgaaccaccaaggtggatccacgacttaaaccaggacactccctc tgctctgtgtcagcatgtcctttagtgctgcagatgggaggtcccccaggtgccatagga aacaccagaaactgcacagcagtccagctggtggtggatcccagccatccccagggaaat tcaaccccaagaggagaagactcaccctggatgagaaccactatgagcaggaccaagaag gaagagggcaagaacactaggagaggaagaaggaaatcaccagagtgcctagagggcacc caggtagctttgggaagtgcagcttccaagagacagaggagtaccttggtaggaaggggc aggatgcctccgaatcagtggctggggaaggaccagggaagctctgggcttgttgaaacc cacccaagaaatacaaatgttacagatatagctgaagcctattataaactccccccgaac ccaagatgccactgtcctgaaattgatggctactttggtgaagcacgcttttatgctatt actgcggcctgggcagacgcggcccgcccgagctaccgcgggttccgagacgccttcgca ctgctcctccacccgggggatctttgttcccggagctgttccccgcctcgctgctcccgc cgcaaaacctgtttgcggaatacccgccgcaagtctgacttgacgatgtgcagttttggg gagggaggcgggacgtgggggcgctgcccaagccccactcccggtgtacgggggagaggg tggaggacccagaaaaacccgtttgaggaagatgtgcctccctgcgcgttcaaggtgtca ggagtgcttggtaaatttttggacactcgcgttccagctgtacattcagttgaggaaatg caggcttgtatccaaaatgaattaccatga >gi568815590r:94781635_94994121|GENSCAN_predicted_peptide_5|113_aa MGIKGFHFFGGSAAAECRTLNERQLRDAHREWLATQPMQDIDLFPGDRIVMFSATRGFQC GLGGVAITLEGLEHPLCPWDGTTVAKTSGQIQRYVFLVALKMTEQIMGTAQNE >gi568815590r:94781635_94994121|GENSCAN_predicted_CDS_5|342_bp atgggaattaaaggctttcatttcttcggaggctcagctgcagctgaatgcagaacccta aatgagaggcagctgcgggatgctcatagagaatggctggcaacacagcctatgcaggat attgacctgtttccaggagacaggatagtaatgttcagtgccaccagggggttccagtgt ggcctgggaggagtggctatcactctagagggcttggagcatcctctctgtccctgggat ggaacaacagtagccaaaacaagtgggcagattcaacgctatgtattcctggtagctctg aaaatgacagagcaaatcatgggaactgcacaaaatgaatga >gi568815590r:94781635_94994121|GENSCAN_predicted_peptide_6|370_aa MKDELLPHLAGLLSTLSPKRLNKMFVGEVSSSSNQEPEFNEKEDDEWILVDFIDTCTGFS AEEEEEEEDISEESPTEHPSVFSCLPASLECLADTSDSCFLQFESCPMEESWFITPPPCF TAGGLTTIKVETSPMENLLIEHPSMSVYAVHNSCPGLSEATRGTDELHSPSSPRKTACSS AHRADLFLPSLIKVASFQTRCCLFTLELPSTTKQLFVSQLRAIGSVESSSSHSSRVSPRV EAQNEMGQHIHCYVAALAAHTTFLEQPKSFRPSQWIKEHSERQPLNRNSLRRQNLTRDCH PRQVKHNGWVVHQPCPHSHLHDFGHSHGRMSTFNRFSFRHACSANLQLQVNLNARLASTF ELPKSNWKKK >gi568815590r:94781635_94994121|GENSCAN_predicted_CDS_6|1113_bp atgaaggatgagctcctccctcacttggctgggcttctgagcactctgtcgcccaagagg ctgaataaaatgtttgtgggtgaagtcagttcttcctccaaccaagaaccagaattcaat gagaaagaagatgatgaatggattcttgttgacttcatagatacttgcactggtttctca gcagaagaagaagaagaagaggaggacatcagtgaagagtcacctactgagcacccttca gtcttttcctgtttaccggcatctcttgagtgcttggctgatacaagtgattcctgcttt ctccagtttgagtcatgtccaatggaggagagctggtttatcaccccacccccatgtttt actgcaggtggattaaccactatcaaggtggaaacaagtcctatggaaaaccttctcatt gaacatcccagcatgtctgtctatgctgtgcataactcctgccctggtctcagtgaggcc acccgtgggactgatgaattacatagcccaagtagtcccaggaaaacagcgtgcagttca gcccacagagccgatttgtttttaccttctttgatcaaagtagcatccttccaaacaaga tgctgtctgttcacgttggaactgccatccacaaccaagcagctctttgtcagtcagttg agagctattggcagtgtggaatccagctcctcacacagttccagagtcagtccaagagtg gaagctcaaaatgaaatggggcagcatattcattgttatgttgcagctcttgctgctcat acaacttttctggaacaacccaagagctttcgcccttcccagtggataaaagaacacagt gaaagacagcctcttaacagaaatagccttcgtcgccaaaatcttaccagggattgccac cctcggcaagtcaagcacaatggctgggttgttcatcagccctgcccgcactcccatctg cacgattttggccattctcatggtcgcatgtcaactttcaatcgtttctctttccgacat gcctgttctgctaacctccaactacaagtcaatctgaatgcaaggcttgcttctacattt gagctgcctaagagcaactggaaaaaaaaatga >gi568815590r:94781635_94994121|GENSCAN_predicted_peptide_7|354_aa XPLVSASARKELGSPFYLNSNYKAGQAEKSTTLLAPVRKTCCHGYLPTFTISHLVLWTFE FPEVGSAGSLGFGILYYLVRCLATALVPTRLQLSQPLALKGGFHERKEDINIHGLKTNTL EDTEMEHLAESQDSMATNHSSRTACAEVTTDHHLLCWTPADSSRAGTMSERPKKRPGDNK RDIGFVGWIYKQGWSNGGRRNPERAASPAGSRWRFLQQAFTLSSRFLLPGLVSELSAPPA GRQVTCVGDNLSQGGPGAPAPPIAWLVFLAAAGPPPLTPPHSPPLLLLRLLSAVGWGRPF GTRRRGSRGHPRDASPKTAALGLVARRSVPALRSRPLRLPSEFRRPCGYGEPAP >gi568815590r:94781635_94994121|GENSCAN_predicted_CDS_7|1065_bp ngacctctggtttctgcttctgcacgtaaggagcttggaagtccattctatcttaacagc aactacaaagctggacaggctgaaaaatcaacaactcttcttgcacctgtaagaaagacc tgctgtcatgggtatttgccaaccttcacaatttcccacctggttctttggacttttgag tttcctgaagttggctctgcaggcagcttgggttttggaatcctctattacctagttaga tgtctggctacggcccttgttccaacccgtctccagctgagccagcctctggccctcaag ggtggtttccatgagaggaaagaggacataaacatacatggtctcaaaacaaacactttg gaggatactgaaatggagcatttagcagaaagtcaggactccatggccaccaaccactcc tcacgaactgcctgtgccgaagtcaccactgaccaccatctgttgtgctggactcctgct gactccagtagggctggcaccatgtcagaaaggccgaagaagagacctggagacaacaaa cgagacatagggtttgttggatggatttacaaacagggatggtccaatggcggcaggcgg aatccagagcgtgcagcaagcccggccggctctcggtggcggtttctacagcaggccttc acgctcagctcccggtttttgttgcccgggcttgtttcggagctgagcgcgccgccggcc gggcgccaggtcacgtgcgttggtgacaacctctcgcagggcggccccggggcccccgca ccgccgattgcgtggcttgttttcttggccgcggcgggacctcctcctctcacccctcct cactcccctccactcctcctcctccgcctgctctcggccgttggatggggccgccccttc gggactcggcgtcggggctcccgcggccacccccgggacgcatctccgaagacagcggcg cttgggcttgtggcccggcgctctgtccccgccctgcgatcccgtcccctgcgcctgccc tccgagttccggaggccctgcggctatggggaacctgctccgtga