GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:35:40 Sequence gi568815582f:69009396_69218459 : 209064 bp : 45.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 24417 24569 153 2 0 81 -9 189 0.723 8.48 1.02 Term + 24600 25193 594 2 0 -3 44 360 0.368 17.03 1.03 PlyA + 25984 25989 6 1.05 2.00 Prom + 27821 27860 40 -8.96 2.01 Init + 30056 30058 3 1 0 93 53 0 0.839 -3.00 2.02 Intr + 30913 31026 114 0 0 115 78 97 0.981 12.04 2.03 Intr + 43239 43355 117 2 0 44 36 95 0.033 0.56 2.04 Intr + 60381 60495 115 2 1 87 32 88 0.026 3.12 2.05 Term + 73977 74266 290 1 2 58 41 367 0.028 24.54 2.06 PlyA + 74797 74802 6 1.05 3.00 Prom + 79532 79571 40 -7.06 3.01 Init + 80115 80304 190 2 1 80 100 96 0.459 7.18 3.02 Intr + 96270 96392 123 1 0 68 96 61 0.051 5.46 3.03 Intr + 97975 98171 197 2 2 69 56 118 0.062 5.83 3.04 Intr + 100001 100636 636 1 0 126 110 1142 0.129 112.49 3.05 Intr + 104046 104147 102 2 0 92 110 129 0.998 15.87 3.06 Term + 104948 105871 924 1 0 71 44 1287 0.985 114.78 3.07 PlyA + 108239 108244 6 1.05 4.05 PlyA - 108310 108305 6 1.05 4.04 Term - 111254 111030 225 2 0 59 48 237 0.946 13.58 4.03 Intr - 111775 111658 118 0 1 60 78 87 0.885 5.37 4.02 Intr - 112098 112041 58 1 1 126 111 -19 0.795 2.14 4.01 Init - 112750 112714 37 1 1 35 98 20 0.503 -2.13 4.00 Prom - 115897 115858 40 -5.86 5.00 Prom + 117990 118029 40 -7.66 5.01 Init + 121478 121525 48 1 0 86 86 3 0.916 0.85 5.02 Intr + 123060 123294 235 2 1 93 105 107 0.996 10.16 5.03 Intr + 124063 124223 161 2 2 123 106 138 0.998 18.81 5.04 Intr + 127301 127492 192 1 0 42 45 145 0.848 5.29 5.05 Intr + 128406 128490 85 2 1 82 80 89 0.893 6.89 5.06 Intr + 133783 133994 212 2 2 65 86 196 0.980 15.73 5.07 Intr + 141142 141313 172 0 1 55 90 145 0.986 10.92 5.08 Intr + 141418 141509 92 2 2 100 113 103 0.998 13.71 5.09 Intr + 146476 146598 123 0 0 45 110 26 0.685 1.28 5.10 Intr + 147728 147845 118 1 1 94 75 30 0.836 2.34 5.11 Intr + 150961 151067 107 2 2 66 105 64 0.930 5.83 5.12 Intr + 153688 153783 96 0 0 70 110 118 0.989 12.41 5.13 Intr + 155946 156131 186 2 0 111 97 113 0.999 14.39 5.14 Intr + 157680 157790 111 2 0 84 93 34 0.892 4.08 5.15 Term + 159426 159542 117 2 0 43 41 127 0.931 2.04 5.16 PlyA + 159606 159611 6 1.05 6.00 Prom + 174577 174616 40 -4.26 6.01 Init + 177772 178351 580 0 1 47 84 1126 0.784 101.42 6.02 Term + 191677 191759 83 2 2 94 44 29 0.052 -3.04 6.03 PlyA + 191763 191768 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:69009396_69218459|GENSCAN_predicted_peptide_1|248_aa MTEDKEWIPITKLVPLVKDMKIKSLETYLFSLPIEECDIISFFQGASLMGETHTGHRTRF KAFVTVRDYDGHISLSVKCSKEVTTAIQGTITLAKLSIVPMRRGYWGNKIGKSYTTPCMV TGCCSSGLVHLIPAPRNNGIISTPVPKMLLLMAGIDNCYTSARGCTATLDNFTKATFNAI SRTYSCLTPSLWKETIHQASLWNALTITSRPTPEPPCRGPRLQLWLQHKLSYKNNSELHL LKIISRRK >gi568815582f:69009396_69218459|GENSCAN_predicted_CDS_1|747_bp atgactgaggacaaggagtggatacccatcaccaagctggtccctctggtcaaggatatg aagatcaagtccctggagacctatctcttctccctgcccatcgaggagtgtgatatcatt agcttttttcaaggggcatctctcatgggtgagacccacactggccaccgcaccagattc aaggcatttgttactgttagggactatgatggccacatcagtttaagtgttaagtgctcc aaggaggttactactgccatccaagggaccatcaccctggccaagctttccattgtccct atgcggagaggttactgggggaacaagatcggcaagtcctacaccaccccctgtatggtt acaggctgctgtagctctgggctagtacaccttatccctgcccccaggaacaatggcatc atctcgactcctgtgcccaagatgctgctgctgatggccggtattgacaactgctacacc tcagccaggggctgcactgccaccctggacaacttcaccaaggccaccttcaatgccatc tccaggacctatagctgcctgacccccagcctctggaaagagactattcaccaagcctcc ctgtggaatgcactgaccatcacatcaagacccacaccagagcctccatgcagaggaccc aggctccagctgtggttacagcataagctttcatacaagaataatagtgaattacacctg ttaaaaataataagcagaagaaaatga >gi568815582f:69009396_69218459|GENSCAN_predicted_peptide_2|212_aa MVTACLIAVAKTDGEVQVRRAAIHVVVLLLRGLSQKATEVEKLAYVGGMKKQSSGFGFKY GLWSWMSKLKSQLCNLLANIEFLNTTVTRKGVSIPDPKKGILDLAQERIQGESTLQRSLW MSSQISSGRRRGQFSRTEKCRHSRQAVMAVSLMQVLSAVLKDLYHLLKHVVCLEPDDVAK LHAQLALEELDDIMKNFLFPPQKLEKKIMVLP >gi568815582f:69009396_69218459|GENSCAN_predicted_CDS_2|639_bp atggtaacagcttgcctgattgctgtggccaaaacagatggtgaagttcaagtacgcaga gctgccatacatgtggttgtgctgctgcttcggggactcagccagaaagctactgaggtg gaaaaactggcctatgtgggaggtatgaagaaacagtccagtggctttggcttcaagtat ggactctggagctggatgtccaagctgaaatcccagctttgcaacttgctggcaaatatt gaatttctaaatacaactgttacaaggaaaggggtctccattccagaccccaagaaaggg atcttggatctcgcgcaagaaagaattcagggcgagtccacattgcaaagaagtctgtgg atgtcctcacagatctcctcaggcaggaggagagggcagttcagtcgtactgagaaatgc cggcacagtagacaggcggtcatggctgtctctctcatgcaggtgctgagcgccgtcctc aaggatctctaccacctgctgaagcacgtagtgtgtctggagcccgatgacgtggccaag ctccatgcccagttggccctagaagagctggatgacatcatgaaaaacttcctgttccct ccacagaagctggagaagaagatcatggtcctgccgtag >gi568815582f:69009396_69218459|GENSCAN_predicted_peptide_3|723_aa MLQSHAAATLQCAAPLPAMLHSVGTESPLGGWEPVLLLTIPGAASFLQAELRAVLSALCT GWQAPLSCQQEGFAALAFGSPLDSELPLRVQKSGKSALSSPSLQGTLCQGSWGTRKSRAW SDWDPLGFPFLPGWVVGKRHVGVAGAAFAKRRGSSGDEKRGECVRGCFDLMPVQLTTALR VVGTSLFALAVLGGILAAYVTGYQFIHTEKHYLSFGLYGAILGLHLLIQSLFAFLEHRRM RRAGQALKLPSPRRGSVALCIAAYQEDPDYLRKCLRSAQRISFPDLKVVMVVDGNRQEDA YMLDIFHEVLGGTEQAGFFVWRSNFHEAGEGETEASLQEGMDRVRDVVRASTFSCIMQKW GGKREVMYTAFKALGDSVDYIQVCDSDTVLDPACTIEMLRVLEEDPQVGGVGGDVQILNK YDSWISFLSSVRYWMAFNVERACQSYFGCVQCISGPLGMYRNSLLQQFLEDWYHQKFLGS KCSFGDDRHLTNRVLSLGYRTKYTARSKCLTETPTKYLRWLNQQTRWSKSYFREWLYNSL WFHKHHLWMTYESVVTGFFPFFLIATVIQLFYRGRIWNILLFLLTVQLVGIIKATYACFL RGNAEMIFMSLYSLLYMSSLLPAKIFAIATINKSGWGTSGRKTIVVNFIGLIPVSIWVAV LLGGLAYTAYCQDLFSETELAFLVSGAILYGCYWVALLMLYLAIIARRCGKKPEQYSLAF AEV >gi568815582f:69009396_69218459|GENSCAN_predicted_CDS_3|2172_bp atgctgcagtcacatgcagcagccacactgcagtgcgcggcacctctgcctgccatgctc cactctgttggaactgaatcaccgcttgggggttgggaacctgtgttgctccttacaata cccggtgctgcctctttcctgcaggcagagctcagggctgtgctcagcgccctctgcaca ggctggcaagcgcctctttcctgccagcaggaaggttttgctgccttggctttcgggagc cccctagacagcgaactgccccttcgcgtgcagaagtcggggaagagtgctctcagctcg ccttccttgcagggcacactttgccaaggtagctggggtaccaggaagagcagggcctgg tccgactgggatcccttgggttttccgttccttcccggttgggtcgtgggaaagcggcac gtgggtgtggccggcgccgcctttgccaagaggcgaggctcgagcggggatgagaagcgt ggcgagtgcgttcgcggctgctttgacctgatgccggtgcagctgacgacagccctgcgt gtggtgggcaccagcctgtttgccctggcagtgctgggtggcatcctggcagcctatgtg acgggctaccagttcatccacacggaaaagcactacctgtccttcggcctgtacggcgcc atcctgggcctgcacctgctcattcagagcctttttgccttcctggagcaccggcgcatg cgacgtgccggccaggccctgaagctgccctccccgcggcggggctcggtggcactgtgc attgccgcataccaggaggaccctgactacttgcgcaagtgcctgcgctcggcccagcgc atctccttccctgacctcaaggtggtcatggtggtggatggcaaccgccaggaggacgcc tacatgctggacatcttccacgaggtgctgggcggcaccgagcaggccggcttctttgtg tggcgcagcaacttccatgaggcaggcgagggtgagacggaggccagcctgcaggagggc atggaccgtgtgcgggatgtggtgcgggccagcaccttctcgtgcatcatgcagaagtgg ggaggcaagcgcgaggtcatgtacacggccttcaaggccctcggcgattcggtggactac atccaggtgtgcgactctgacactgtgctggatccagcctgcaccatcgagatgcttcga gtcctggaggaggatccccaagtagggggagtcgggggagatgtccagatcctcaacaag tacgactcatggatttccttcctgagcagcgtgcggtactggatggccttcaacgtggag cgggcctgccagtcctactttggctgtgtgcagtgtattagtgggcccttgggcatgtac cgcaacagcctcctccagcagttcctggaggactggtaccatcagaagttcctaggcagc aagtgcagcttcggggatgaccggcacctcaccaaccgagtcctgagccttggctaccga actaagtataccgcgcgctccaagtgcctcacagagacccccactaagtacctccggtgg ctcaaccagcaaacccgctggagcaagtcttacttccgggagtggctctacaactctctg tggttccataagcaccacctctggatgacctacgagtcagtggtcacgggtttcttcccc ttcttcctcattgccacggttatacagcttttctaccggggccgcatctggaacattctc ctcttcctgctgacggtgcagctggtgggcattatcaaggccacctacgcctgcttcctt cggggcaatgcagagatgatcttcatgtccctctactccctcctctatatgtccagcctt ctgccggccaagatctttgccattgctaccatcaacaaatctggctggggcacctctggc cgaaaaaccattgtggtgaacttcattggcctcattcctgtgtccatctgggtggcagtt ctcctgggagggctggcctacacagcttattgccaggacctgttcagtgagacagagcta gccttccttgtctctggggctatactgtatggctgctactgggtggccctcctcatgcta tatctggccatcatcgcccggcgatgtgggaagaagccggagcagtacagcttggctttt gctgaggtgtga >gi568815582f:69009396_69218459|GENSCAN_predicted_peptide_4|145_aa MCLKDLEKYLAHTCFHLLFKDRKLMVQIVISSARAGGLAEWVLMELQGEIEARYSTGLAG NLLGDLHYTTEGIPVLIVGHHILYGKIIHLEKPFAVLVKHTPGDQDCDELGRETGTRYLV TALIKDKILFKTRPKPIITSVPKKV >gi568815582f:69009396_69218459|GENSCAN_predicted_CDS_4|438_bp atgtgtctaaaggacttagaaaagtacctagcacacacttgttttcacttgctttttaaa gacagaaagctcatggtgcaaattgttatttccagtgcgagggctggaggcctggcagaa tgggtgctgatggagctacagggggagatcgaggctcgctacagcactggattagctgga aacctcctgggagacctacattacaccactgagggaatccctgtgctgatcgtggggcat catatcctgtatgggaaaatcatccacctggagaaaccttttgcagtccttgtcaaacac actcctggggatcaggactgtgatgagcttggccgcgagactggcacccggtacctggtg acagcactcatcaaagacaagatccttttcaaaacccgccccaagcccattatcaccagc gtccccaagaaagtatga >gi568815582f:69009396_69218459|GENSCAN_predicted_peptide_5|684_aa MAVVVHQTASGIKFWKPPVPPARGLTCNGGRRARRRAANGRQPNLPPPSPPPRALPAHFR LCARSTSGAEPESARARGAGAERGEHREGERGAAGMGEFKVHRVRFFNYVPSGIRCVAYN NQSNRLAVSRTDGTVEIYNLSANYFQEKFFPGHESRATEALCWAEGQRLFSAGLNGEIME YDLQALNIKYAMDAFGGPIWSMAASPSGSQLLVGCEDGSVKLFQITPDKIQFERNFDRQK SSAVHKMIVDRQYMGVSKRKCIVWGVAFLSDGTIISVDSAGKVQFWDSATGTLVKSHLIA NADVQSIAVADQEDSFVVGTAEGTVFHFQLVPVTSNSSEKQWVRTKPFQHHTHDVRTVAH SPTALISGGTDTHLVFRPLMEKVEVKNYDAALRKITFPHGPENIICSCISPCGSWIAYST VSRFFLYRLNYEHDNISLKRILFSEDSTKLFVASNQGALHIVQLSGGSFKHLHAFQPQSG TVEAMCLLAVSPDGNWLAASGTSAGVHVYNVKQLKLHCTVPAYNFPVTAMAIAPNTNNLV IAHSDQQVFEYSIPDKQYTDWSRTVQKQGFHHLWLQRDTPITHISFHPKRPMHILLHDAY MFCIIDKSLPLPNDKTLLYNPFPPTNESDVIRRRTAHAFKISKIYKPLLFMDLLDERTLV AVERPLDDIIAQLPPPIKKKKFGT >gi568815582f:69009396_69218459|GENSCAN_predicted_CDS_5|2055_bp atggcagttgtggttcaccaaactgcttcagggattaagttctggaagcctcccgtgccc cccgcccgcggcctgacctgcaatggcggccgccgagcgcggcgccgcgcggccaacggg cgacaaccgaacctcccgccgccgtcgccgccgccgcgagcactgcctgcgcacttccga ctatgcgccaggagcacttccggggcagagcctgagagcgcgcgcgcacgtggggccggg gcggagagaggcgagcaccgggaaggggagcgtggggccgctggaatgggtgaatttaag gtccatcgagtacgtttctttaattatgttccatcaggaatccgctgtgtggcttacaat aaccagtcaaacagattggctgtttcacgaacagatggcactgtggaaatttataacttg tcagcaaactactttcaggagaaatttttcccaggtcatgagtctcgggctacagaagct ttgtgctgggcagaaggacagcgactctttagtgctgggctcaatggcgagattatggag tatgatttacaggcgttaaacatcaagtatgctatggatgcctttggaggacctatttgg agcatggctgccagccccagtggctctcaacttttggttggttgtgaagatggatctgtg aaactatttcaaattaccccagacaaaatccagtttgaaagaaattttgatcggcagaaa agcagcgctgttcataagatgattgtggacaggcagtatatgggcgtgtctaagcggaag tgcatcgtgtggggtgtcgccttcttgtccgatggcactatcataagtgtggactctgct gggaaggtgcagttctgggactcagccactgggacgcttgtgaagagccatctcatcgct aatgctgacgtgcagtccattgctgtagctgaccaagaagacagtttcgtggtgggcaca gccgagggaacagtcttccattttcagctggtccctgtgacatctaacagcagtgagaag cagtgggtgcggacaaaaccgttccagcatcacactcatgacgtgcgcactgtggcccac agcccaacagcgctgatatctggaggcactgacacccacttagtctttcgtcctctcatg gagaaggtggaagtaaagaattacgatgccgctctccgaaaaatcacctttccccacggt cctgagaacattatctgtagctgtatctccccatgtggaagttggatagcctattctaca gtttctcggttttttctctatcggctgaattatgaacatgacaacataagcctcaaaagg attttgttttctgaagattcaacaaagctctttgtagcatcaaatcaaggagctctgcat attgttcagctgtcaggaggaagcttcaagcacctgcatgctttccagcctcagtcagga acagtggaggccatgtgtcttttggcagtcagtccagatgggaattggctagctgcatca ggtaccagtgctggagtccatgtctacaacgtaaaacagctaaagcttcactgcacggtg cctgcttacaatttcccagtgactgctatggctattgcccccaataccaacaaccttgtc atcgctcattcggaccagcaggtatttgagtacagcatcccagacaaacagtatacagat tggagccggactgtccagaagcagggctttcaccacctttggctccaaagggatactcct atcacacacatcagttttcatcccaagagaccgatgcacatccttctccatgatgcctac atgttctgcatcattgacaagtcattgccccttccaaatgacaaaaccttactctacaat ccatttcctcccacgaatgaatcagatgtcatccggaggcgcacagctcatgcttttaaa atttctaagatatataagcctctactcttcatggatcttttggatgaaagaacactcgtg gcagtagaacggcctctggatgacatcattgctcagctcccaccacccattaaaaagaag aaatttggaacctaa >gi568815582f:69009396_69218459|GENSCAN_predicted_peptide_6|220_aa MRVAAATAAAGAGPAMAVWTRATKAGLVELLLRERWVRVVAELSGESLSLTGDAAAAELE PALGPAAAAFNGLPNGGGAGDSLPGSPSRGLGPPSPPAPPRGPAGEAGASPPVRRVRVVK QEAGGLGISIKGGRENRMPILISKIFPGLAADQSRALRLGDAILSVNGTDLRQATHDQAV QALKRAGKEVLLEGPPGAWFESRTFRKAIGKQHPGLKVTT >gi568815582f:69009396_69218459|GENSCAN_predicted_CDS_6|663_bp atgagggtagctgcggcgactgcggcggctggagcggggccggccatggcggtgtggacg cgggccaccaaagcggggctggtggagctgctcctgagggagcgctgggtccgagtggtg gccgagctgagcggggagagcctgagcctgacgggcgacgccgccgcggccgagctggag cccgctctgggacccgcggccgccgccttcaacggcctcccaaacggcggcggcgcgggc gactcgctgcccgggagcccaagccgcggcctggggcccccgagcccgccggcgccgcct cggggccccgcgggtgaggcgggcgcgtcgccgcccgtgcgccgggtgcgggtggtgaag caagaggcgggcggcctgggcatcagcatcaagggcggccgcgagaaccggatgccgatc ctcatctccaagatcttccccgggctggctgccgaccagagccgggcgctgcggctgggc gacgccatcctgtcggtgaacggcaccgacctgcgccaggccacccacgaccaggccgtg caggcgctgaagcgcgcgggcaaggaggtgctgctggaggggcctccaggggcctggttt gaaagcagaacattcaggaaagccataggaaaacagcatcctggcttaaaagttaccacc tga