GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:57:26 Sequence gi568815582r:69020428_69221173 : 200746 bp : 46.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13385 13537 153 1 0 81 -9 189 0.722 8.48 1.02 Term + 13568 14161 594 1 0 -3 44 360 0.369 17.03 1.03 PlyA + 14952 14957 6 1.05 2.00 Prom + 16789 16828 40 -8.96 2.01 Init + 19024 19026 3 0 0 93 53 0 0.839 -3.00 2.02 Intr + 19881 19994 114 2 0 115 78 97 0.981 12.04 2.03 Intr + 32207 32323 117 1 0 44 36 95 0.033 0.56 2.04 Intr + 49349 49463 115 1 1 87 32 88 0.026 3.12 2.05 Term + 62945 63234 290 0 2 58 41 367 0.028 24.54 2.06 PlyA + 63765 63770 6 1.05 3.00 Prom + 68500 68539 40 -7.06 3.01 Init + 69083 69272 190 1 1 80 100 96 0.459 7.18 3.02 Intr + 85238 85360 123 0 0 68 96 61 0.051 5.46 3.03 Intr + 86943 87139 197 1 2 69 56 118 0.062 5.83 3.04 Intr + 88969 89604 636 0 0 126 110 1142 0.129 112.49 3.05 Intr + 93014 93115 102 1 0 92 110 129 0.998 15.87 3.06 Term + 93916 94839 924 0 0 71 44 1287 0.985 114.78 3.07 PlyA + 97207 97212 6 1.05 4.05 PlyA - 97278 97273 6 1.05 4.04 Term - 100222 99998 225 1 0 59 48 237 0.946 13.58 4.03 Intr - 100743 100626 118 2 1 60 78 87 0.885 5.37 4.02 Intr - 101066 101009 58 0 1 126 111 -19 0.795 2.14 4.01 Init - 101718 101682 37 0 1 35 98 20 0.503 -2.13 4.00 Prom - 104865 104826 40 -5.86 5.00 Prom + 106958 106997 40 -7.66 5.01 Init + 110446 110493 48 0 0 86 86 3 0.916 0.85 5.02 Intr + 112028 112262 235 1 1 93 105 107 0.996 10.16 5.03 Intr + 113031 113191 161 1 2 123 106 138 0.998 18.81 5.04 Intr + 116269 116460 192 0 0 42 45 145 0.848 5.29 5.05 Intr + 117374 117458 85 1 1 82 80 89 0.893 6.89 5.06 Intr + 122751 122962 212 1 2 65 86 196 0.980 15.73 5.07 Intr + 130110 130281 172 2 1 55 90 145 0.986 10.92 5.08 Intr + 130386 130477 92 1 2 100 113 103 0.998 13.71 5.09 Intr + 135444 135566 123 2 0 45 110 26 0.685 1.28 5.10 Intr + 136696 136813 118 0 1 94 75 30 0.836 2.34 5.11 Intr + 139929 140035 107 1 2 66 105 64 0.930 5.83 5.12 Intr + 142656 142751 96 2 0 70 110 118 0.989 12.41 5.13 Intr + 144914 145099 186 1 0 111 97 113 0.999 14.39 5.14 Intr + 146648 146758 111 1 0 84 93 34 0.892 4.08 5.15 Term + 148394 148510 117 1 0 43 41 127 0.931 2.04 5.16 PlyA + 148574 148579 6 1.05 6.00 Prom + 163545 163584 40 -4.26 6.01 Init + 166740 167319 580 2 1 47 84 1126 0.785 101.42 6.02 Term + 180645 180727 83 1 2 94 44 29 0.054 -3.04 6.03 PlyA + 180731 180736 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:69020428_69221173|GENSCAN_predicted_peptide_1|248_aa MTEDKEWIPITKLVPLVKDMKIKSLETYLFSLPIEECDIISFFQGASLMGETHTGHRTRF KAFVTVRDYDGHISLSVKCSKEVTTAIQGTITLAKLSIVPMRRGYWGNKIGKSYTTPCMV TGCCSSGLVHLIPAPRNNGIISTPVPKMLLLMAGIDNCYTSARGCTATLDNFTKATFNAI SRTYSCLTPSLWKETIHQASLWNALTITSRPTPEPPCRGPRLQLWLQHKLSYKNNSELHL LKIISRRK >gi568815582r:69020428_69221173|GENSCAN_predicted_CDS_1|747_bp atgactgaggacaaggagtggatacccatcaccaagctggtccctctggtcaaggatatg aagatcaagtccctggagacctatctcttctccctgcccatcgaggagtgtgatatcatt agcttttttcaaggggcatctctcatgggtgagacccacactggccaccgcaccagattc aaggcatttgttactgttagggactatgatggccacatcagtttaagtgttaagtgctcc aaggaggttactactgccatccaagggaccatcaccctggccaagctttccattgtccct atgcggagaggttactgggggaacaagatcggcaagtcctacaccaccccctgtatggtt acaggctgctgtagctctgggctagtacaccttatccctgcccccaggaacaatggcatc atctcgactcctgtgcccaagatgctgctgctgatggccggtattgacaactgctacacc tcagccaggggctgcactgccaccctggacaacttcaccaaggccaccttcaatgccatc tccaggacctatagctgcctgacccccagcctctggaaagagactattcaccaagcctcc ctgtggaatgcactgaccatcacatcaagacccacaccagagcctccatgcagaggaccc aggctccagctgtggttacagcataagctttcatacaagaataatagtgaattacacctg ttaaaaataataagcagaagaaaatga >gi568815582r:69020428_69221173|GENSCAN_predicted_peptide_2|212_aa MVTACLIAVAKTDGEVQVRRAAIHVVVLLLRGLSQKATEVEKLAYVGGMKKQSSGFGFKY GLWSWMSKLKSQLCNLLANIEFLNTTVTRKGVSIPDPKKGILDLAQERIQGESTLQRSLW MSSQISSGRRRGQFSRTEKCRHSRQAVMAVSLMQVLSAVLKDLYHLLKHVVCLEPDDVAK LHAQLALEELDDIMKNFLFPPQKLEKKIMVLP >gi568815582r:69020428_69221173|GENSCAN_predicted_CDS_2|639_bp atggtaacagcttgcctgattgctgtggccaaaacagatggtgaagttcaagtacgcaga gctgccatacatgtggttgtgctgctgcttcggggactcagccagaaagctactgaggtg gaaaaactggcctatgtgggaggtatgaagaaacagtccagtggctttggcttcaagtat ggactctggagctggatgtccaagctgaaatcccagctttgcaacttgctggcaaatatt gaatttctaaatacaactgttacaaggaaaggggtctccattccagaccccaagaaaggg atcttggatctcgcgcaagaaagaattcagggcgagtccacattgcaaagaagtctgtgg atgtcctcacagatctcctcaggcaggaggagagggcagttcagtcgtactgagaaatgc cggcacagtagacaggcggtcatggctgtctctctcatgcaggtgctgagcgccgtcctc aaggatctctaccacctgctgaagcacgtagtgtgtctggagcccgatgacgtggccaag ctccatgcccagttggccctagaagagctggatgacatcatgaaaaacttcctgttccct ccacagaagctggagaagaagatcatggtcctgccgtag >gi568815582r:69020428_69221173|GENSCAN_predicted_peptide_3|723_aa MLQSHAAATLQCAAPLPAMLHSVGTESPLGGWEPVLLLTIPGAASFLQAELRAVLSALCT GWQAPLSCQQEGFAALAFGSPLDSELPLRVQKSGKSALSSPSLQGTLCQGSWGTRKSRAW SDWDPLGFPFLPGWVVGKRHVGVAGAAFAKRRGSSGDEKRGECVRGCFDLMPVQLTTALR VVGTSLFALAVLGGILAAYVTGYQFIHTEKHYLSFGLYGAILGLHLLIQSLFAFLEHRRM RRAGQALKLPSPRRGSVALCIAAYQEDPDYLRKCLRSAQRISFPDLKVVMVVDGNRQEDA YMLDIFHEVLGGTEQAGFFVWRSNFHEAGEGETEASLQEGMDRVRDVVRASTFSCIMQKW GGKREVMYTAFKALGDSVDYIQVCDSDTVLDPACTIEMLRVLEEDPQVGGVGGDVQILNK YDSWISFLSSVRYWMAFNVERACQSYFGCVQCISGPLGMYRNSLLQQFLEDWYHQKFLGS KCSFGDDRHLTNRVLSLGYRTKYTARSKCLTETPTKYLRWLNQQTRWSKSYFREWLYNSL WFHKHHLWMTYESVVTGFFPFFLIATVIQLFYRGRIWNILLFLLTVQLVGIIKATYACFL RGNAEMIFMSLYSLLYMSSLLPAKIFAIATINKSGWGTSGRKTIVVNFIGLIPVSIWVAV LLGGLAYTAYCQDLFSETELAFLVSGAILYGCYWVALLMLYLAIIARRCGKKPEQYSLAF AEV >gi568815582r:69020428_69221173|GENSCAN_predicted_CDS_3|2172_bp atgctgcagtcacatgcagcagccacactgcagtgcgcggcacctctgcctgccatgctc cactctgttggaactgaatcaccgcttgggggttgggaacctgtgttgctccttacaata cccggtgctgcctctttcctgcaggcagagctcagggctgtgctcagcgccctctgcaca ggctggcaagcgcctctttcctgccagcaggaaggttttgctgccttggctttcgggagc cccctagacagcgaactgccccttcgcgtgcagaagtcggggaagagtgctctcagctcg ccttccttgcagggcacactttgccaaggtagctggggtaccaggaagagcagggcctgg tccgactgggatcccttgggttttccgttccttcccggttgggtcgtgggaaagcggcac gtgggtgtggccggcgccgcctttgccaagaggcgaggctcgagcggggatgagaagcgt ggcgagtgcgttcgcggctgctttgacctgatgccggtgcagctgacgacagccctgcgt gtggtgggcaccagcctgtttgccctggcagtgctgggtggcatcctggcagcctatgtg acgggctaccagttcatccacacggaaaagcactacctgtccttcggcctgtacggcgcc atcctgggcctgcacctgctcattcagagcctttttgccttcctggagcaccggcgcatg cgacgtgccggccaggccctgaagctgccctccccgcggcggggctcggtggcactgtgc attgccgcataccaggaggaccctgactacttgcgcaagtgcctgcgctcggcccagcgc atctccttccctgacctcaaggtggtcatggtggtggatggcaaccgccaggaggacgcc tacatgctggacatcttccacgaggtgctgggcggcaccgagcaggccggcttctttgtg tggcgcagcaacttccatgaggcaggcgagggtgagacggaggccagcctgcaggagggc atggaccgtgtgcgggatgtggtgcgggccagcaccttctcgtgcatcatgcagaagtgg ggaggcaagcgcgaggtcatgtacacggccttcaaggccctcggcgattcggtggactac atccaggtgtgcgactctgacactgtgctggatccagcctgcaccatcgagatgcttcga gtcctggaggaggatccccaagtagggggagtcgggggagatgtccagatcctcaacaag tacgactcatggatttccttcctgagcagcgtgcggtactggatggccttcaacgtggag cgggcctgccagtcctactttggctgtgtgcagtgtattagtgggcccttgggcatgtac cgcaacagcctcctccagcagttcctggaggactggtaccatcagaagttcctaggcagc aagtgcagcttcggggatgaccggcacctcaccaaccgagtcctgagccttggctaccga actaagtataccgcgcgctccaagtgcctcacagagacccccactaagtacctccggtgg ctcaaccagcaaacccgctggagcaagtcttacttccgggagtggctctacaactctctg tggttccataagcaccacctctggatgacctacgagtcagtggtcacgggtttcttcccc ttcttcctcattgccacggttatacagcttttctaccggggccgcatctggaacattctc ctcttcctgctgacggtgcagctggtgggcattatcaaggccacctacgcctgcttcctt cggggcaatgcagagatgatcttcatgtccctctactccctcctctatatgtccagcctt ctgccggccaagatctttgccattgctaccatcaacaaatctggctggggcacctctggc cgaaaaaccattgtggtgaacttcattggcctcattcctgtgtccatctgggtggcagtt ctcctgggagggctggcctacacagcttattgccaggacctgttcagtgagacagagcta gccttccttgtctctggggctatactgtatggctgctactgggtggccctcctcatgcta tatctggccatcatcgcccggcgatgtgggaagaagccggagcagtacagcttggctttt gctgaggtgtga >gi568815582r:69020428_69221173|GENSCAN_predicted_peptide_4|145_aa MCLKDLEKYLAHTCFHLLFKDRKLMVQIVISSARAGGLAEWVLMELQGEIEARYSTGLAG NLLGDLHYTTEGIPVLIVGHHILYGKIIHLEKPFAVLVKHTPGDQDCDELGRETGTRYLV TALIKDKILFKTRPKPIITSVPKKV >gi568815582r:69020428_69221173|GENSCAN_predicted_CDS_4|438_bp atgtgtctaaaggacttagaaaagtacctagcacacacttgttttcacttgctttttaaa gacagaaagctcatggtgcaaattgttatttccagtgcgagggctggaggcctggcagaa tgggtgctgatggagctacagggggagatcgaggctcgctacagcactggattagctgga aacctcctgggagacctacattacaccactgagggaatccctgtgctgatcgtggggcat catatcctgtatgggaaaatcatccacctggagaaaccttttgcagtccttgtcaaacac actcctggggatcaggactgtgatgagcttggccgcgagactggcacccggtacctggtg acagcactcatcaaagacaagatccttttcaaaacccgccccaagcccattatcaccagc gtccccaagaaagtatga >gi568815582r:69020428_69221173|GENSCAN_predicted_peptide_5|684_aa MAVVVHQTASGIKFWKPPVPPARGLTCNGGRRARRRAANGRQPNLPPPSPPPRALPAHFR LCARSTSGAEPESARARGAGAERGEHREGERGAAGMGEFKVHRVRFFNYVPSGIRCVAYN NQSNRLAVSRTDGTVEIYNLSANYFQEKFFPGHESRATEALCWAEGQRLFSAGLNGEIME YDLQALNIKYAMDAFGGPIWSMAASPSGSQLLVGCEDGSVKLFQITPDKIQFERNFDRQK SSAVHKMIVDRQYMGVSKRKCIVWGVAFLSDGTIISVDSAGKVQFWDSATGTLVKSHLIA NADVQSIAVADQEDSFVVGTAEGTVFHFQLVPVTSNSSEKQWVRTKPFQHHTHDVRTVAH SPTALISGGTDTHLVFRPLMEKVEVKNYDAALRKITFPHGPENIICSCISPCGSWIAYST VSRFFLYRLNYEHDNISLKRILFSEDSTKLFVASNQGALHIVQLSGGSFKHLHAFQPQSG TVEAMCLLAVSPDGNWLAASGTSAGVHVYNVKQLKLHCTVPAYNFPVTAMAIAPNTNNLV IAHSDQQVFEYSIPDKQYTDWSRTVQKQGFHHLWLQRDTPITHISFHPKRPMHILLHDAY MFCIIDKSLPLPNDKTLLYNPFPPTNESDVIRRRTAHAFKISKIYKPLLFMDLLDERTLV AVERPLDDIIAQLPPPIKKKKFGT >gi568815582r:69020428_69221173|GENSCAN_predicted_CDS_5|2055_bp atggcagttgtggttcaccaaactgcttcagggattaagttctggaagcctcccgtgccc cccgcccgcggcctgacctgcaatggcggccgccgagcgcggcgccgcgcggccaacggg cgacaaccgaacctcccgccgccgtcgccgccgccgcgagcactgcctgcgcacttccga ctatgcgccaggagcacttccggggcagagcctgagagcgcgcgcgcacgtggggccggg gcggagagaggcgagcaccgggaaggggagcgtggggccgctggaatgggtgaatttaag gtccatcgagtacgtttctttaattatgttccatcaggaatccgctgtgtggcttacaat aaccagtcaaacagattggctgtttcacgaacagatggcactgtggaaatttataacttg tcagcaaactactttcaggagaaatttttcccaggtcatgagtctcgggctacagaagct ttgtgctgggcagaaggacagcgactctttagtgctgggctcaatggcgagattatggag tatgatttacaggcgttaaacatcaagtatgctatggatgcctttggaggacctatttgg agcatggctgccagccccagtggctctcaacttttggttggttgtgaagatggatctgtg aaactatttcaaattaccccagacaaaatccagtttgaaagaaattttgatcggcagaaa agcagcgctgttcataagatgattgtggacaggcagtatatgggcgtgtctaagcggaag tgcatcgtgtggggtgtcgccttcttgtccgatggcactatcataagtgtggactctgct gggaaggtgcagttctgggactcagccactgggacgcttgtgaagagccatctcatcgct aatgctgacgtgcagtccattgctgtagctgaccaagaagacagtttcgtggtgggcaca gccgagggaacagtcttccattttcagctggtccctgtgacatctaacagcagtgagaag cagtgggtgcggacaaaaccgttccagcatcacactcatgacgtgcgcactgtggcccac agcccaacagcgctgatatctggaggcactgacacccacttagtctttcgtcctctcatg gagaaggtggaagtaaagaattacgatgccgctctccgaaaaatcacctttccccacggt cctgagaacattatctgtagctgtatctccccatgtggaagttggatagcctattctaca gtttctcggttttttctctatcggctgaattatgaacatgacaacataagcctcaaaagg attttgttttctgaagattcaacaaagctctttgtagcatcaaatcaaggagctctgcat attgttcagctgtcaggaggaagcttcaagcacctgcatgctttccagcctcagtcagga acagtggaggccatgtgtcttttggcagtcagtccagatgggaattggctagctgcatca ggtaccagtgctggagtccatgtctacaacgtaaaacagctaaagcttcactgcacggtg cctgcttacaatttcccagtgactgctatggctattgcccccaataccaacaaccttgtc atcgctcattcggaccagcaggtatttgagtacagcatcccagacaaacagtatacagat tggagccggactgtccagaagcagggctttcaccacctttggctccaaagggatactcct atcacacacatcagttttcatcccaagagaccgatgcacatccttctccatgatgcctac atgttctgcatcattgacaagtcattgccccttccaaatgacaaaaccttactctacaat ccatttcctcccacgaatgaatcagatgtcatccggaggcgcacagctcatgcttttaaa atttctaagatatataagcctctactcttcatggatcttttggatgaaagaacactcgtg gcagtagaacggcctctggatgacatcattgctcagctcccaccacccattaaaaagaag aaatttggaacctaa >gi568815582r:69020428_69221173|GENSCAN_predicted_peptide_6|220_aa MRVAAATAAAGAGPAMAVWTRATKAGLVELLLRERWVRVVAELSGESLSLTGDAAAAELE PALGPAAAAFNGLPNGGGAGDSLPGSPSRGLGPPSPPAPPRGPAGEAGASPPVRRVRVVK QEAGGLGISIKGGRENRMPILISKIFPGLAADQSRALRLGDAILSVNGTDLRQATHDQAV QALKRAGKEVLLEGPPGAWFESRTFRKAIGKQHPGLKVTT >gi568815582r:69020428_69221173|GENSCAN_predicted_CDS_6|663_bp atgagggtagctgcggcgactgcggcggctggagcggggccggccatggcggtgtggacg cgggccaccaaagcggggctggtggagctgctcctgagggagcgctgggtccgagtggtg gccgagctgagcggggagagcctgagcctgacgggcgacgccgccgcggccgagctggag cccgctctgggacccgcggccgccgccttcaacggcctcccaaacggcggcggcgcgggc gactcgctgcccgggagcccaagccgcggcctggggcccccgagcccgccggcgccgcct cggggccccgcgggtgaggcgggcgcgtcgccgcccgtgcgccgggtgcgggtggtgaag caagaggcgggcggcctgggcatcagcatcaagggcggccgcgagaaccggatgccgatc ctcatctccaagatcttccccgggctggctgccgaccagagccgggcgctgcggctgggc gacgccatcctgtcggtgaacggcaccgacctgcgccaggccacccacgaccaggccgtg caggcgctgaagcgcgcgggcaaggaggtgctgctggaggggcctccaggggcctggttt gaaagcagaacattcaggaaagccataggaaaacagcatcctggcttaaaagttaccacc tga