GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:35:58 Sequence gi568815594r:163371857_163573734 : 201878 bp : 37.26% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1645 1811 167 1 2 90 58 72 0.053 3.08 1.02 Term + 32904 33172 269 1 2 56 44 218 0.451 8.87 1.03 PlyA + 34067 34072 6 1.05 2.00 Prom + 39760 39799 40 -5.55 2.01 Sngl + 42910 43434 525 0 0 33 43 230 0.632 8.83 2.02 PlyA + 44475 44480 6 1.05 3.00 Prom + 44944 44983 40 -7.15 3.01 Init + 54001 54239 239 0 2 55 73 146 0.615 7.34 3.02 Intr + 54554 54624 71 2 2 53 69 65 0.732 -0.99 3.03 Intr + 55096 55182 87 2 0 48 92 100 0.923 5.42 3.04 Intr + 56722 56856 135 2 0 58 52 110 0.083 4.02 3.05 Intr + 57639 57881 243 1 0 62 88 66 0.040 0.55 3.06 Term + 80131 80324 194 2 2 46 47 153 0.066 3.60 3.07 PlyA + 80416 80421 6 1.05 4.00 Prom + 89289 89328 40 -3.65 4.01 Sngl + 91313 91501 189 1 0 42 39 180 0.527 3.36 4.02 PlyA + 92234 92239 6 1.05 5.02 PlyA - 93293 93288 6 1.05 5.01 Sngl - 101875 99998 1878 1 0 80 37 1927 0.605 180.49 5.00 Prom - 110981 110942 40 -6.55 6.10 PlyA - 111463 111458 6 1.05 6.09 Term - 122880 122729 152 1 2 26 43 178 0.989 4.19 6.08 Intr - 123040 122900 141 2 0 112 89 197 0.999 21.60 6.07 Intr - 126393 126281 113 2 2 46 97 70 0.444 2.80 6.06 Intr - 150233 150125 109 0 1 57 94 80 0.122 3.92 6.05 Intr - 151324 151100 225 2 0 81 87 89 0.030 5.13 6.04 Intr - 157190 156948 243 0 0 84 56 129 0.044 5.85 6.03 Intr - 165251 165180 72 0 0 60 98 79 0.478 4.56 6.02 Intr - 172727 172580 148 2 1 76 50 175 0.873 11.39 6.01 Init - 186641 186507 135 2 0 73 60 84 0.302 4.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 143457 143605 149 0 2 104 50 132 0.958 9.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:163371857_163573734|GENSCAN_predicted_peptide_1|145_aa XYIQPVNFSGKAIDFEPAQAWISISPESFKRCITLAQRTHSSEPQYLHLLNRDKSNGSSG ATIDFCLSFPVCGILLQQQPYQKEYAFPTPQYCQHSVALVLFPAISNKLSLFLMIAHASY IGDASIQQEPAGFATLLKLSRRTFE >gi568815594r:163371857_163573734|GENSCAN_predicted_CDS_1|438_bp nngtacattcagccagtgaattttagtggaaaggccatcgactttgaaccagcacaggcc tggattagtatttcaccagagtcattcaaaagatgtataaccttagcccaacggactcat tcttccgaacctcagtatcttcacctattaaacagggataaaagcaatggaagttccggt gcaacaatcgatttctgtttaagcttcccagtttgtggtattttgctacagcagcagccc tatcagaaagaatacgccttccccactccacaatactgccagcattctgtggcattggtg ctgtttcctgcaataagcaataaactcagccttttcttaatgatagctcatgctagttat attggagatgccagcattcagcaagagcctgctggatttgccactcttttaaagctgtca agaagaacttttgaataa >gi568815594r:163371857_163573734|GENSCAN_predicted_peptide_2|174_aa MPSQKFPAGAGHLWRAFARAVQKGNIGLEAPHRVPTEAPPIGAVRRGPLSSRPQNGRSTN SLPRLPGKATDTQRQPMKAAREETVPCKATGAELPKTIGTHLLHQHDLDVRPGVKGDHFG ALKFDYPPGFWTCMGTVTTSFCPISPIWNGCIYPMPVLPLYLGSNQFAFNFTGS >gi568815594r:163371857_163573734|GENSCAN_predicted_CDS_2|525_bp atgcccagtcagaagttccctgcaggggcagggcacttatggagagcctttgctagggca gtgcagaagggaaatatagggttggaggccccacacagagtccctactgaggcaccacct attggagctgtgagaagagggccactgtcctccagaccccagaatggtagatccactaac agcttgccccgtttgcctggaaaagccacagacactcaacgccagcccatgaaagcagcc agggaggagactgtaccctgcaaagccacaggggcagagctgcccaagaccataggaacc cacctcttgcatcagcatgatctggacgtgagacctggagtcaaaggagatcattttgga gctttaaaatttgactaccctcctggattttggacttgcatgggaactgtaaccacttcg ttttgtccaatttctcccatttggaatggctgtatttacccaatgcctgtactcccattg tatttaggaagtaaccagtttgctttcaattttacaggctcatag >gi568815594r:163371857_163573734|GENSCAN_predicted_peptide_3|322_aa MRQSLEGAFPQRTKFLWTRSPQPDDPTTGLRVPGQAPAHVITLTQPRVHLTIEGQEIDFL LDTGVAFSVLVSCPGQLSSRETGIALGVLTQTRGTTPQPVAYLRISAQLAELVILTRALT LGKGRRINVYTDIALRAATFTARVCSFIPEVSKPTNPLGGTNNSGHANFKSCNIHCEDIP DDIDGCFKAYKLHTFRTLVLVCFCIATKKYTRLGNLQRKRFNWLMVLQAIQEAWCQHLLS FWEGLMELLLMAEGEAGAGNLEEMDKFLEIYNPPRLNQEEIETLNRPITSSKIEMVILKV QDQMDSQLNSMRHSEKNWYQSY >gi568815594r:163371857_163573734|GENSCAN_predicted_CDS_3|969_bp atgaggcaatcactggaaggagcatttccccagaggacaaagtttctctggaccagaagc ccccaaccagatgatccaacaacaggactgagggtgccagggcaagcgccagctcatgtc atcaccctcactcagccccgggtacatttaaccattgagggccaggaaattgacttcctt ctggacactggcgtggctttctcagtgttagtctcctgtcctggacagctgtcctcaaga gagacaggaatagctcttggggtccttactcagactcgtgggacaaccccacaaccagtg gcatatctaagaatcagtgcccagttagcagaactagtgatacttacccgagccttaaca ctgggaaagggaagaagaataaatgtgtatacagatatagctttacgagctgcaacattc actgcaagggtctgcagcttcattcctgaagtcagcaagcccacgaacccactgggagga acaaacaactccggacatgccaactttaagagctgtaacattcactgcgaagacattcct gatgacattgatggttgctttaaggcttacaaattgcacacttttaggactcttgtgtta gtctgcttttgcattgctacaaagaaatacacaagactgggtaatttacaaagaaagagg tttaattggctcatggttctgcaggctatacaggaagcatggtgccagcatctgctcagc ttctgggaaggcctcatggagcttttactcatggcagaaggcgaagcaggagcaggaaac ctagaggaaatggataaattcctggaaatatacaaccctcctagattaaaccaggaagaa atagaaactctgaacagaccaataacaagcagcaagattgaaatggtaattttaaaagtc caggaccagatggattcacagctgaattctatgagacattcagagaagaattggtaccaa tcctattga >gi568815594r:163371857_163573734|GENSCAN_predicted_peptide_4|62_aa MKDNIGNDDRGIPKQISFRNKNMDFRTWKQTHLKSSMPTYQDPSKVGLPQFEDCELHSLE AL >gi568815594r:163371857_163573734|GENSCAN_predicted_CDS_4|189_bp atgaaagacaatattggaaatgatgacaggggtattcccaaacagatcagttttagaaac aaaaacatggacttcaggacctggaagcagacgcacttaaaatcttccatgccgacatac caagatccttccaaagtaggcctacctcagtttgaagactgtgaacttcactcgctagaa gcactttaa >gi568815594r:163371857_163573734|GENSCAN_predicted_peptide_5|625_aa MANDAKPDVKTVQVLRDTANRLRIHSIRATCASGSGQLTSCCSAAEVVSVLFFHTMKYKQ TDPEHPDNDRFILSRGHAAPILYAAWVEVGDISESDLLNLRKLHSDLERHPTPRLPFVDV ATGSLGQGLGTACGMAYTGKYLDKASYRVFCLMGDGESSEGSVWEAFAFASHYNLDNLVA VFDVNRLGQSGPAPLEHGADIYQNCCEAFGWNTYLVDGHDVEALCQAFWQASQVKNKPTA IVAKTFKGRGIPNIEDAENWHGKPVPKERADAIVKLIESQIQTNENLIPKSPVEDSPQIS ITDIKMTSPPAYKVGDKIATQKTYGLALAKLGRANERVIVLSGDTMNSTFSEIFRKEHPE RFIECIIAEQNMVSVALGCATRGRTIAFAGAFAAFFTRAFDQLRMGAISQANINLIGSHC GVSTGEDGVSQMALEDLAMFRSIPNCTVFYPSDAISTEHAIYLAANTKGMCFIRTSQPET AVIYTPQENFEIGQAKVVRHGVNDKVTVIGAGVTLHEALEAADHLSQQGISVRVIDPFTI KPLDAATIISSAKATGGRVITVEDHYREGGIGEAVCAAVSREPDILVHQLAVSGVPQRGK TSELLDMFGISTRHIIAAVTLTLMK >gi568815594r:163371857_163573734|GENSCAN_predicted_CDS_5|1878_bp atggccaacgacgccaagcccgacgtgaagaccgtgcaggtgctgcgggacacagccaac cgcctgcggatccattccatcagggccacgtgtgcctctggttctggccagctcacgtcg tgctgcagtgcagcggaggtcgtgtctgtcctcttcttccacacgatgaagtataaacag acagacccagaacacccggacaacgaccggttcatcctctccaggggacatgctgctcct atcctctatgctgcttgggtggaggtgggtgacatcagtgaatctgacttgctgaacctg aggaaacttcacagcgacttggagagacaccctaccccccgattgccgtttgttgacgtg gcaacagggtccctaggtcagggattaggtactgcatgtggaatggcttatactggcaag taccttgacaaggccagctaccgggtgttctgccttatgggagatggcgaatcctcagaa ggctctgtgtgggaggcttttgcttttgcctcccactacaacttggacaatctcgtggcg gtcttcgacgtgaaccgcttgggacaaagtggccctgcaccccttgagcatggcgcagac atctaccagaattgctgtgaagcctttggatggaatacttacttagtggatggccatgat gtggaggccttgtgccaagcattttggcaagcaagtcaagtgaagaacaagcctactgct atagttgccaagaccttcaaaggtcggggtattccaaatattgaggatgcagaaaattgg catggaaagccagtgccaaaagaaagagcagatgcaattgtcaaattaattgagagtcag atacagaccaatgagaatctcataccaaaatcgcctgtggaagactcacctcaaataagc atcacagatataaaaatgacctccccacctgcttacaaagttggtgacaagatagctact cagaaaacatatggtttggctctggctaaactgggccgtgcaaatgaaagagttattgtt ctgagtggtgacacgatgaactccaccttttctgagatattcaggaaagaacaccctgag cgtttcatagagtgtattattgctgaacaaaacatggtaagtgtggcactaggctgtgct acacgtggtcgaaccattgcttttgctggtgcttttgctgccttttttactagagcattc gatcagctccgaatgggagccatttctcaagccaatatcaaccttattggttcccactgt ggggtatccactggagaagatggagtctcccagatggccctggaggatctagccatgttc cgaagcattcccaattgtactgttttctatccaagtgatgccatctcgacagagcatgct atttatctagccgccaataccaagggaatgtgcttcattcgaaccagccaaccagaaact gcagttatttataccccacaagaaaattttgagattggccaggccaaggtggtccgccac ggtgtcaatgataaagtcacagtaattggagctggagttactctccatgaagccttagaa gctgctgaccatctttctcaacaaggtatttctgtccgtgtcatcgacccatttaccatt aaacccctggatgccgccaccatcatctccagtgcaaaagccacaggcggccgagttatc acagtggaggatcactacagggaaggtggcattggagaagctgtttgtgcagctgtctcc agggagcctgatatccttgttcatcaactggcagtgtcaggagtgcctcaacgtgggaaa actagtgaattgctggatatgtttggaatcagtaccagacacattatagcagccgtaaca cttactttaatgaagtaa >gi568815594r:163371857_163573734|GENSCAN_predicted_peptide_6|445_aa MTKLSSRRPILKPENLLSSPKHLLHLLKHLLRTTEKLFLLPHSEEVAAAHASVHRPEHFK KPKGAAGIKGRPGPTIGIVDGVSVRQRGDGDSVPEKNRVHLFEGKKKSWLSVKDGHLDSV LEWPFWTKLVVVAIGFTGGLVFMYVQCKVYVQLWRRLKAYNRVIFVQNCPDTAKKLEKNF SCNVNTDIKDAVVVPVPQTEDQEANDILHIPMLWEDYRTVFQGSVRSSAAKPCPAVDQVD THKAIRVLATGARTVARRPVKKLLHIGQARHGDFATICYPQGITKMIAAGEKKTRPRAKV FYEWAEGSSGRHIYTCTKGGINNQCSSIAMSVVAKECKQFKGLSQIGLSSRNRTLPLCLL NKSLPGNQPSGSGEQEGAHHGDVLVATEQLRTLDPGAAAHKSTRAESHVAPEKPAPTQPE QERRVRFSKASPLTETASFSPVLGF >gi568815594r:163371857_163573734|GENSCAN_predicted_CDS_6|1338_bp atgaccaagttgagtagcagaagacccatcctgaagccagagaaccttctatcttctccg aagcatctcctccacttgctcaagcatctcctcagaacaacagaaaaactcttcctactg cctcactcagaagaggtggcagcagcacacgccagcgtgcaccgccctgagcatttcaaa aagccgaaaggggctgcggggataaagggaaggccaggacccacgataggaattgtggat ggggtttctgtgcgtcaaagaggggatggtgacagcgttccagaaaaaaatcgtgtccac ttatttgaaggcaagaaaaagagctggctgtctgttaaagatggtcatttggacagtgtc cttgaatggccattttggacaaaactggttgtggtagccattggcttcacaggaggtctt gtcttcatgtacgtacagtgtaaagtctatgttcagttgtggcgcaggctgaaggcctac aaccgtgtgatctttgtacaaaattgcccagacactgccaaaaaactggagaagaacttc tcatgtaatgtaaacacagacatcaaagatgctgtggtagtgcctgtaccacaaacagaa gatcaggaggcaaatgatatcctccatatccccatgctctgggaagattataggacagtc ttccaaggctcagtcaggtcatccgcagcaaaaccctgcccagctgtggatcaagtggac acacacaaggccatcagggtgctggcaacaggagcaagaacagtggcaagaaggccagtt aagaagctactgcacataggacaggcaagacatggcgactttgccactatttgttatcca caaggaatcactaaaatgattgctgcaggggagaagaaaactaggccacgagctaaagta ttctatgaatgggcagaaggttcttctgggagacatatttacacgtgtaccaaagggggg ataaataatcaatgcagcagtattgcaatgtctgtggtagcaaaagaatgtaaacaattt aaaggtctatcacagatcgggttaagttcccgaaaccgcaccctccctctctgcctgctc aacaagagccttccggggaaccaaccgagcggttcgggggagcaggagggggcccaccat ggtgacgtcctcgtggccacggagcagctccgcactctagacccaggagcagcagctcac aagagcacccgggccgaatcccacgtggcgcctgagaagcccgcgccgacccaaccggaa caggagaggcgagtgcgtttctctaaagcgtccccgttaactgagaccgcgtctttttcg cccgtgttgggcttttaa