GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:21:34 Sequence gi568815575r:1365482_1566558 : 201077 bp : 50.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13178 13283 106 0 1 107 105 136 0.992 16.47 1.02 Intr + 15542 15623 82 2 1 49 84 90 0.859 4.34 1.03 Term + 16910 16984 75 1 0 164 53 50 0.945 6.94 1.04 PlyA + 19169 19174 6 1.05 2.05 PlyA - 20693 20688 6 1.05 2.04 Term - 21278 21121 158 0 2 116 49 322 0.851 29.20 2.03 Intr - 21938 21798 141 0 0 48 63 298 0.994 23.62 2.02 Intr - 24246 23760 487 0 1 121 83 1068 0.970 102.19 2.01 Init - 26528 26418 111 2 0 120 75 295 0.993 31.61 2.00 Prom - 28114 28075 40 -4.06 3.21 PlyA - 28314 28309 6 1.05 3.20 Term - 38008 37788 221 2 2 106 48 361 0.921 31.10 3.19 Intr - 47439 47251 189 1 0 56 94 175 0.500 14.46 3.18 Intr - 52635 52492 144 1 0 79 99 209 0.999 21.35 3.17 Intr - 52956 52831 126 1 0 43 62 86 0.811 2.15 3.16 Intr - 53633 53501 133 2 1 82 90 215 0.975 21.32 3.15 Intr - 56361 56177 185 1 2 84 69 223 0.770 19.51 3.14 Intr - 60206 60044 163 2 1 99 56 223 0.998 19.75 3.13 Intr - 62640 62253 388 2 1 109 77 528 0.999 48.59 3.12 Intr - 66896 66788 109 0 1 110 84 239 0.999 25.04 3.11 Intr - 69602 69541 62 1 2 105 86 34 0.915 3.28 3.10 Intr - 70277 70213 65 2 2 118 119 98 0.838 13.42 3.09 Intr - 73663 73616 48 1 0 72 89 92 0.294 6.58 3.08 Intr - 76836 76726 111 0 0 134 22 201 0.269 18.68 3.07 Intr - 77274 77075 200 1 2 79 65 68 0.766 2.67 3.06 Intr - 77724 77529 196 0 1 52 66 92 0.017 2.39 3.05 Intr - 87425 87267 159 2 0 88 64 187 0.873 16.48 3.04 Intr - 99462 99347 116 1 2 86 8 70 0.797 -1.03 3.03 Intr - 101101 100066 1036 1 1 136 78 1715 0.942 165.40 3.02 Intr - 103189 103021 169 0 1 94 76 50 0.642 4.35 3.01 Init - 116580 116471 110 0 2 38 26 108 0.250 -0.71 3.00 Prom - 118602 118563 40 -0.76 4.08 PlyA - 120034 120029 6 1.05 4.07 Term - 121794 121695 100 2 1 112 55 48 0.008 1.50 4.06 Intr - 128330 128245 86 2 2 74 44 97 0.001 2.52 4.05 Intr - 152073 151958 116 1 2 50 53 78 0.074 0.67 4.04 Intr - 164070 163937 134 2 2 79 76 34 0.201 1.59 4.03 Intr - 171667 171440 228 0 0 8 99 199 0.169 9.98 4.02 Intr - 177124 176999 126 0 0 63 12 110 0.021 0.69 4.01 Init - 181802 181411 392 2 2 106 -8 168 0.084 3.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:1365482_1566558|GENSCAN_predicted_peptide_1|87_aa XCDQEEGANTRAWRTSLLIALGTLLALVCVFVICRRYLVMQRLFPRIPHMKDPIGDSFQN DKLVVWEAGKAGLEECLVTEVQVVQKT >gi568815575r:1365482_1566558|GENSCAN_predicted_CDS_1|264_bp nagtgcgaccaggaggagggcgcaaacacacgtgcctggcggacgtcgctgctgatcgcg ctggggacgctgctggccctggtctgtgtcttcgtgatctgcagaaggtatctggtgatg cagagactctttccccgcatccctcacatgaaagaccccatcggtgacagcttccaaaac gacaagctggtggtctgggaggcgggcaaagccggcctggaggagtgtctggtgactgaa gtacaggtcgtgcagaaaacttga >gi568815575r:1365482_1566558|GENSCAN_predicted_peptide_2|298_aa MTEQAISFAKDFLAGGIAAAISKTAVAPIERVKLLLQVQHASKQIAADKQYKGIVDCIVR IPKEQGVLSFWRGNLANVIRYFPTQALNFAFKDKYKQIFLGGVDKHTQFWRYFAGNLASG GAAGATSLCFVYPLDFARTRLAADVGKSGTEREFRGLGDCLVKITKSDGIRGLYQGFSVS VQGIIIYRAAYFGVYDTAKGMLPDPKNTHIVVSWMIAQTVTAVAGVVSYPFDTVRRRMMM QSGRKGADIMYTGTVDCWRKIFRDEGGKAFFKGAWSNVLRGMGGAFVLVLYDELKKVI >gi568815575r:1365482_1566558|GENSCAN_predicted_CDS_2|897_bp atgacggaacaggccatctccttcgccaaagacttcttggccggaggcatcgccgccgcc atctccaagacggccgtggctccgatcgagcgggtcaagctgctgctgcaggtccagcac gccagcaagcagatcgccgccgacaagcagtacaagggcatcgtggactgcattgtccgc atccccaaggagcagggcgtgctgtccttctggaggggcaaccttgccaacgtcattcgc tacttccccactcaagccctcaacttcgccttcaaggataagtacaagcagatcttcctg gggggcgtggacaagcacacgcagttctggaggtactttgcgggcaacctggcctccggc ggtgcggccggcgcgacctccctctgcttcgtgtacccgctggatttcgccagaacccgc ctggcagcggacgtgggaaagtcaggcacagagcgcgagttccgaggcctgggagactgc ctggtgaagatcaccaagtccgacggcatccggggcctgtaccagggcttcagtgtctcc gtgcagggcatcatcatctaccgggcggcctacttcggcgtgtacgatacggccaagggc atgctccccgaccccaagaacacgcacatcgtggtgagctggatgatcgcgcagaccgtg acggccgtggccggcgtggtgtcctaccccttcgacacggtgcggcggcgcatgatgatg cagtccgggcgcaaaggagctgacatcatgtacacgggcaccgtcgactgttggaggaag atcttcagagatgaggggggcaaggccttcttcaagggtgcgtggtccaacgtcctgcgg ggcatggggggcgccttcgtgctggtcctgtacgacgagctcaagaaggtgatctaa >gi568815575r:1365482_1566558|GENSCAN_predicted_peptide_3|1309_aa MRYRNQRNAAAAFAEDADQNLHGASAEDSMQISLGKRQFSAQKFLDSHLCRPDTAKNNRC ADFLKFPSCPLSPGGAQTASLCLLCQGSWHQLESPGEDPSRMQVPNSTGPDNATLQMLRN PAIAVALPVVYSLVAAVSIPGNLFSLWVLCRRMGPRSPSVIFMINLSVTDLMLASVLPFQ IYYHCNRHHWVFGVLLCNVVTVAFYANMYSSILTMTCISVERFLGVLYPLSSKRWRRRRY AVAACAGTWLLLLTALSPLARTDLTYPVHALGIITCFDVLKWTMLPSVAMWAVFLFTIFI LLFLIPFVITVACYTATILKLLRTEEAHGREQRRRAVGLAAVVLLAFVTCFAPNNFVLLA HIVSRLFYGKSYYHVYKLTLCLSCLNNCLDPFVYYFASREFQLRLREYLGCRRVPRDTLD TRRESLFSARTTSVRSEAGVLVSVARCTPKFLFISLRGTVVAVVGILLSEERLGLLQPRC AHRGLRAQKCGRPAPGVDAMVLCPVIGKLLHKRVVLASASPRRQEILSNAMAVCVHDGGV CPRWRCVSKMAACVHDGGVCPRRRCVSQMAVCVHDDGVCPRWWYASKMVVSLQDGAIIAG VFPEFPVGRLGFVLCPNLQVSGNPRTLLARGFKARVNTSLLASLFLASSFANLFPREPRR PLGLRFEVVPSKFKEKLDKASFATPYGYAMETAKQKALEKDLRAPDVVIGADTIVTVGGL ILEKPVDKQDAYRMLSRLSGREHSVFTGVAIVHCSSKDHQLDTRVSEFYEETKVKFSELS EELLWEYVHSGEPMDKAGGYGIQALGGMLVESVHGDFLNVVGFPLNHFCKQLVKLYYPPR PEDLRRSVKHDSIPAADTFEDLSDVEGGGSEPTQRDAGSRDEKAEAGEAGQATAEAECHR TRETLPPFPTRLLELIEGFMLSKGLLTACKLKVFDLLKDEAPQKAADIASKVDASACGME RLLDICAAMGLLEKTEQGYSNTETANVYLASDGEYSLHGFIMHNNDLTWNLFTYLEFAIR EGTNQHHRALGKKAEDLFQDAYYQSPETRLRFMRAMHGMTKLTACQVATAFNLSRFSSAC DVGGLCPLHVAQSGCCSTGHCVYIPTAGRRREYSPPPLGWKDFPAGCTGALARELAREYP RMQVTVFDLPDIIELAAHFQPPGPQAVQIHFAAGSVRGGLLLPYLTDVSRFVFKPGDFFR DPLPSAELYVLCRILHDWPDDKVHKLLSRVAESCKPGAGLLLVETLLDEEKRVAQRALMQ SLNMLVQTEGKERSLGEYQCLLELHGFHQVQVVHLGGVLDAILATKVAP >gi568815575r:1365482_1566558|GENSCAN_predicted_CDS_3|3930_bp atgagatacagaaatcagagaaacgcggccgctgcctttgcagaagacgcagatcaaaat ctccacggggcgtctgcggaggacagcatgcaaatctcgctgggcaaacgacaattcagt gcccagaaatttctggacagccacctctgcaggccagacaccgccaagaacaaccgttgt gctgattttctaaaatttccatcctgcccgctgagccccggcggggcccaaaccgcaagc ctgtgcctcctgtgtcagggcagctggcatcagttagagagcccgggcgaggacccctcc aggatgcaggtcccgaacagcaccggcccggacaacgcgacgctgcagatgctgcggaac ccggcgatcgcggtggccctgcccgtggtgtactcgctggtggcggcggtcagcatcccg ggcaacctcttctctctgtgggtgctgtgccggcgcatggggcccagatccccgtcggtc atcttcatgatcaacctgagcgtcacggacctgatgctggccagcgtgttgcctttccaa atctactaccattgcaaccgccaccactgggtattcggggtgctgctttgcaacgtggtg accgtggccttttacgcaaacatgtattccagcatcctcaccatgacctgtatcagcgtg gagcgcttcctgggggtcctgtacccgctcagctccaagcgctggcgccgccgtcgttac gcggtggccgcgtgtgcagggacctggctgctgctcctgaccgccctgtccccgctggcg cgcaccgatctcacctacccggtgcacgccctgggcatcatcacctgcttcgacgtcctc aagtggacgatgctccccagcgtggccatgtgggccgtgttcctcttcaccatcttcatc ctgctgttcctcatcccgttcgtgatcaccgtggcttgttacacggccaccatcctcaag ctgttgcgcacggaggaggcgcacggccgggagcagcggaggcgcgcggtgggcctggcc gcggtggtcttgctggcctttgtcacctgcttcgcccccaacaacttcgtgctcctggcg cacatcgtgagccgcctgttctacggcaagagctactaccacgtgtacaagctcacgctg tgtctcagctgcctcaacaactgtctggacccgtttgtttattactttgcgtcccgggaa ttccagctgcgcctgcgggaatatttgggctgccgccgggtgcccagagacaccctggac acgcgccgcgagagcctcttctccgccaggaccacgtccgtgcgctccgaggccggggtt ctggtctccgtagcccggtgcacgccgaaatttctgtttatttcactcaggggcactgtg gttgctgtggttggaattcttctttcagaggagcgcctggggctcctgcaaccgcgctgc gcgcaccgcgggctccgggctcagaagtgcggacgcccggctcccggcgtggacgccatg gtgctgtgcccggtgattgggaagctgctgcacaagcgcgtggtgctggccagcgcctcc ccacgccgtcaggagatcctcagcaacgcgatggcagtgtgtgtccacgatggcggtgtg tgtccacgatggcggtgtgtgtccaagatggcggcgtgtgtccacgacggcggtgtgtgt ccacgacgacggtgtgtgtcccagatggcagtgtgtgtccacgacgacggtgtgtgtcca cgatggtggtatgcgtccaagatggtggtgtctctccaagatggcgccatcattgctggc gtttttcctgaatttcctgtgggcaggttgggttttgtactctgccccaaccttcaggtc tctgggaatcctcggacactgttggctcgtgggttcaaggcccgagtcaatacctccctg ctggcctctttgttcctggcttcttcctttgccaatctcttccctcgagagccaagaagg cccctgggtctcaggtttgaggtggtcccctccaagtttaaagagaagctggacaaagcc tccttcgctactccgtatgggtacgccatggagaccgccaagcagaaggccctggagaaa gacctgcgggcccccgacgtggtcattggagcggacacgatcgtgacagtcggggggctg attctggagaagccggtggacaagcaggacgcctacaggatgctgtcccggttgagtggg agagaacacagcgtgttcacaggtgtcgcgatcgtccactgctccagcaaagaccatcag ctggacaccagggtctcggaattctacgaggaaacgaaggtgaagttctcggagctgtcc gaggagctgctctgggaatacgtccacagcggggagcccatggacaaagctggcggctac gggatccaggccctgggcggcatgctggtggagtccgtacacggggactttctgaacgtg gtgggattcccgctgaaccacttctgcaagcagctggtgaagctctactacccgccccgt ccggaggacctgcggcggagtgtcaagcacgactccatcccggccgcggacaccttcgaa gacctcagtgacgtggaggggggcggctcggagcccactcagagggacgcgggcagccgc gatgagaaggccgaggcgggagaggcgggacaggccacggcagaggctgagtgtcacagg actcgggagaccctgcctccgttcccgacacgcctcctggagctgattgagggctttatg ctatccaagggcctgctcaccgcttgcaaactgaaggtgttcgatttgttaaaagatgaa gcaccccagaaggctgcggatattgccagcaaagtggacgcctctgcgtgtggaatggag aggcttctggacatctgtgctgccatggggctcctggagaagacagagcaaggttacagt aacacagagacagcgaacgtctacctggcatcggatggcgaatactctctgcacggcttc atcatgcacaataatgacctcacatggaacctctttacatacctggagtttgccatccga gagggaacaaaccagcaccacagggcgttggggaagaaggcggaagatctgttccaggat gcgtactaccagagcccggagacgcggctgaggttcatgcgggccatgcacggcatgacg aagctgactgcgtgccaggtggccacggccttcaatctgtcccgcttctcctccgcctgc gacgtgggaggtctgtgccccctacacgtggcccagagtggctgctgtagcaccggccat tgtgtctacatccccacggcaggaaggagaagggagtacagccctccacccctgggctgg aaggatttccctgcaggctgcacgggtgcactggcccgagagctggcccgtgagtaccct cgtatgcaggtgactgtgtttgacctcccagacattatcgagctggccgcccacttccaa ccccccggaccgcaggcagtgcagatccacttcgcagcaggatctgtccggggaggactg cttcttccatacctgacggacgtatctcgttttgtctttaaaccaggtgactttttcagg gaccccctccccagcgctgagctgtacgtcctgtgccggatcctgcatgactggccagac gacaaagtccacaagttactcagcagggtcgccgagagctgcaagccaggggccggcctg ctgctggtggagacgctcctggatgaggagaagagggtggcgcagcgcgccctgatgcag tcactgaacatgctggtgcagactgaaggcaaggagcggagcctgggcgagtatcagtgc ttgctggagctgcacggcttccaccaggtgcaggtggtgcacttggggggtgtcctggat gccatcttggccaccaaagtggccccctga >gi568815575r:1365482_1566558|GENSCAN_predicted_peptide_4|393_aa MAWPPPQPLAALSLCPLGVHIGRAAPNSQELTGQQQALQDPLEAGSLVSQGRRGGLARAA ISVPLFLQALSVFRSSLNIRPSALPGHEHILFLVSFRETAPNNLTAVTKPTGVQLASSDQ LRSAQTPSNTIKLKKVSKYSYNLHLFEQHLGREKITAFAVKSPFSKDSLPNATEAAAFAR LLDRWNWKGSRLPPTNTFSSTLSDHGLQAGGWPPESRALRHGRRLKRCILLPGDPLSSHL LLLPLLLHRALSIFPGSTHHAPGVHPPVVTPQTAQDIDKHPHWNRISRNSEPQEPSDDFL NTFQFILRLWRVEQQSKAMYWGLPTRHTEADKTLPQEKSKIVLSVLTEIPSLQANCSFDE GQLGRKLCVAGKDKEEIETMLDLRTWTPFTVAP >gi568815575r:1365482_1566558|GENSCAN_predicted_CDS_4|1182_bp atggcttggcccccaccgcagcccctggccgccttgtctctctgtcccttgggtgtgcac attgggagggcagctcctaacagccaggaacttacgggtcaacaacaggctttacaggat cccctggaagcaggcagcctcgtttcccagggcagacgtgggggcctggcccgggctgct atttctgtccctctcttcctccaggccctgtcggttttccgttcctctctcaacatcagg ccctcagctctcccgggacatgaacacatcctgttcctggtttccttcagagaaacagcc ccaaacaaccttacggcggtcactaaacccactggggttcagctggcatcctcggatcag ctcagatcggctcagacaccttcaaacaccatcaagctgaagaaggtgtccaaatattcc tacaacctccacttatttgagcaacacttgggcagagagaagatcacagctttcgccgtg aagagcccattttcaaaggactctcttccaaatgccaccgaagcggccgcctttgcaagg ttgctggacagatggaactggaagggcagccgtctgccgcccacgaacaccttctcaagc actttgagtgaccacggcttgcaagctggtggctggccccccgagtcccgggctctgagg cacggccgtcgacttaagcgttgcatcctgttacctggagaccctctgagctctcacctg ctacttctgccgctgcttctgcacagggcactgagcatcttcccaggctccacccaccac gccccaggagtacaccctccagttgtgacaccccaaactgcccaagatatagacaagcat ccccactggaacagaatctcccgcaattctgaaccacaggagccctcagatgacttcctt aataccttccagttcatcctgagactctggagggtggaacagcagtctaaggccatgtac tggggactgcccacaaggcacacagaggcggacaagacgttgccgcaagaaaagagcaaa atcgtcttgtccgttttaacggagattcccagtttgcaagccaattgttcctttgacgag gggcagctaggaaggaagctctgtgtggccgggaaagacaaggaggaaattgaaacaatg ctggacttgagaacttggacgcctttcactgtggctccctga