GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:29:37 Sequence gi568815583r:78822133_79044981 : 222849 bp : 47.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8130 8223 94 2 1 79 81 27 0.299 0.64 1.02 Intr + 8893 9061 169 2 1 76 12 140 0.419 4.20 1.03 Intr + 9073 9612 540 1 0 62 50 717 0.293 57.32 1.04 Intr + 9884 10013 130 2 1 76 55 113 0.262 7.50 1.05 Intr + 18420 18543 124 2 1 45 98 62 0.036 3.06 1.06 Term + 40542 40795 254 1 2 53 38 134 0.082 0.70 1.07 PlyA + 40814 40819 6 1.05 2.00 Prom + 45636 45675 40 -4.36 2.01 Init + 51275 51597 323 1 2 51 52 249 0.803 12.41 2.02 Intr + 64009 64095 87 1 0 101 89 107 0.995 11.19 2.03 Intr + 68857 68882 26 1 2 130 69 25 0.800 2.47 2.04 Intr + 69352 69440 89 2 2 54 82 61 0.991 1.79 2.05 Intr + 70080 70181 102 2 0 88 93 60 0.982 6.87 2.06 Intr + 71407 71495 89 0 2 65 60 76 0.903 1.27 2.07 Term + 74851 74935 85 1 1 84 54 102 0.822 3.53 2.08 PlyA + 75078 75083 6 1.05 3.15 PlyA - 75192 75187 6 1.05 3.14 Term - 95977 95843 135 1 0 59 42 128 0.710 3.32 3.13 Intr - 100073 100004 70 1 1 76 50 83 0.709 2.48 3.12 Intr - 100986 100861 126 2 0 92 102 28 0.951 4.39 3.11 Intr - 103308 103202 107 0 2 87 109 163 0.989 17.31 3.10 Intr - 105649 105581 69 1 0 84 78 139 0.935 11.88 3.09 Intr - 107361 107280 82 2 1 101 88 122 0.997 13.14 3.08 Intr - 109374 109253 122 0 2 92 28 163 0.851 9.99 3.07 Intr - 109846 109737 110 2 2 35 75 77 0.705 1.10 3.06 Intr - 110326 110240 87 2 0 114 91 116 0.723 14.44 3.05 Intr - 112950 112767 184 0 1 60 31 115 0.321 2.36 3.04 Intr - 113618 113548 71 0 2 78 96 40 0.949 2.70 3.03 Intr - 115291 115186 106 1 1 93 68 149 0.908 13.19 3.02 Intr - 118970 118939 32 0 2 120 48 38 0.329 0.85 3.01 Init - 122849 122759 91 2 1 64 73 200 0.495 14.75 3.00 Prom - 133906 133867 40 -0.96 4.24 PlyA - 134973 134968 6 1.05 4.23 Term - 140104 140012 93 1 0 125 48 50 0.361 2.53 4.22 Intr - 149802 149734 69 0 0 105 121 74 0.999 11.78 4.21 Intr - 151288 151171 118 0 1 39 74 213 0.553 15.47 4.20 Intr - 158567 158488 80 2 2 94 106 41 0.966 4.85 4.19 Intr - 163072 162875 198 1 0 32 80 328 0.973 25.95 4.18 Intr - 168141 168057 85 2 1 111 86 76 0.989 9.52 4.17 Intr - 169752 169559 194 0 2 18 80 176 0.315 8.09 4.16 Intr - 173668 173608 61 0 1 91 89 92 0.912 8.34 4.15 Intr - 176076 175964 113 0 2 62 103 248 0.995 22.88 4.14 Intr - 176693 176587 107 0 2 51 94 102 0.576 7.03 4.13 Intr - 177781 177611 171 2 0 100 91 188 0.999 20.21 4.12 Intr - 179655 179530 126 1 0 80 87 114 0.985 11.15 4.11 Intr - 182043 181670 374 2 2 133 70 714 0.999 68.81 4.10 Intr - 184302 184054 249 2 0 141 39 649 0.844 61.85 4.09 Intr - 193277 193195 83 2 2 74 108 136 0.922 12.64 4.08 Intr - 194468 194368 101 0 2 77 81 63 0.783 4.33 4.07 Intr - 197972 197909 64 2 1 86 113 52 0.942 5.79 4.06 Intr - 203342 203182 161 0 2 103 116 56 0.977 9.61 4.05 Intr - 205727 205609 119 1 2 117 86 292 0.967 32.01 4.04 Intr - 209377 209268 110 1 2 118 94 186 0.999 21.28 4.03 Intr - 210184 209991 194 2 2 105 43 481 0.749 44.51 4.02 Intr - 213078 212999 80 2 2 99 92 84 0.763 9.09 4.01 Init - 218452 218391 62 1 2 58 95 21 0.336 0.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:78822133_79044981|GENSCAN_predicted_peptide_1|436_aa ISCFSVFSPVVSSAADLGKKIRDTCRADHSEAGRKRRRGLLPNSSLGNGSENTSPARSPR SLHGLDGVAGAVRAPALRALIAGAYWALPGGQRLVLVRVRSQQWRRHWLLNCFLLNLAAT DLQFVLTLPFWAVDTARDFSWPFGGAICKVMLTLTVLNMYASIFLLSAMSVARYCIVTGA LPPSHRGASRASCVCCLLWAMAVLATAPTALFATAARVGGKHSCLLRFPAGGPKWQVLYH LQKIAVAFVLPLATLGTCSLLLRFLRLWAFESCVAEPSGRCPSEQAPTAAPQRLHSSNEG RLSLRQYQGHGQKRMIPFPHPQVTDALGRYSNYILSSTVESPSLRSMNFEKQGGNLEKDT QTQTPTSGNLGESHVQTKAETGVMLLQAKEHQMLLQAKERQRLPANHQKLGERNGTDSSS QSTEGTKPTNTFIWDV >gi568815583r:78822133_79044981|GENSCAN_predicted_CDS_1|1311_bp attagttgcttctccgtcttcagcccagttgtaagcagtgcggctgaccttgggaagaag atcagggatacttgcagagcagaccacagtgaagctggccggaagaggcgtcgggggctg ctccctaacagctccctgggcaacggatcagagaacactagcccggcccggagtccccgc agcctgcatggcctggacggggtggcgggggccgtccgggcgccggcgctgcgggctctg attgcgggcgcctactgggccctgcctggtgggcaacggctggtgctagtccgggtgagg tcccagcagtggcgccgccactggctgctcaattgcttcctcctcaatctggcagccact gacctgcagtttgtgctaacgctgcccttttgggccgtggacacggcgcgcgactttagc tggcccttcgggggtgccatctgcaaggtgatgctgacgctcaccgtgctcaacatgtat gccagcatcttcctcctcagtgccatgagcgtggcacgctattgcattgtgactggcgcg ctgcctccgagccatcggggcgcatcacgggccagctgtgtgtgctgcctgctctgggct atggccgtcctggctacggcgcccaccgccctgttcgccacggcagctagggtgggggga aagcactcgtgcctgctgcgcttccccgccggcggccccaaatggcaggtgctctaccac ctgcagaagatcgcagtagccttcgtgctgccgctggccacgctgggcacctgttcgctg ctgctgcgcttcctgcgactgtgggccttcgagagctgtgtagctgagcccagcggccgg tgcccctccgagcaagctcccacggcagcgccccagcggctgcactcttccaatgagggc cgcttgtcgctgcgtcagtatcaagggcatgggcaaaaaagaatgatccccttcccccat cctcaagtgactgatgctcttggaagatactccaactatatcctgagtagcactgtggag agccccagtctgagaagcatgaactttgagaagcaagggggaaatttggaaaaagacaca caaacccagacaccaacatctggaaacctgggagaaagccatgtgcagacgaaggcagaa actggggtaatgcttctacaagctaaggaacaccaaatgctcctgcaagctaaggaacgc caaagattgccagcaaaccaccagaagctaggggagaggaatggcacagattcttcctca cagtccacagaaggaaccaaacctaccaacacctttatatgggatgtctag >gi568815583r:78822133_79044981|GENSCAN_predicted_peptide_2|266_aa MWRRGGRGGGCFGRSCGAPGRLGSYDGFFGASWSPGVVAFGGNSAADGGGFGATGREKCV VKWPLQDGGAIMAAGVSVPYGGTAYGQMQRPLPRRPEGCRGPPHTTECWDEWVPESRVLK YVDTNLQKQRELQKANQKTKKNKQKTPGNGDGGSTSETPQPPRKKRARVDPTVENEETFM NRVEVKVKIPEELKPWLVDDWDLITRQKQLFYLPAKKNVDSILEDYANYKKSRGNTDNKY LAKNSATLFSASDYEVAPPEYHRKAV >gi568815583r:78822133_79044981|GENSCAN_predicted_CDS_2|801_bp atgtggcggcggggggggaggggcggtggctgtttcgggcggtcctgcggcgcgcccggc cgcttgggcagctacgacgggtttttcggtgcttcctggagccccggggtggttgcgttc ggtggcaactcagctgcggatgggggtgggtttggcgccacggggcgggagaagtgcgtt gtaaaatggccgttgcaagatggcggcgccatcatggccgccggcgtctcggtcccgtat ggaggcacagcttacgggcagatgcagcgcccccttccacgacgaccagaaggttgccgg ggccctccgcataccacagagtgttgggatgaatgggttccggagagcagagtactcaaa tacgtggacaccaatttgcagaaacagcgagaacttcaaaaagccaatcagaaaacgaaa aagaacaaacagaaaacacctggaaatggagatggtggcagtaccagtgagacccctcag cctcctcggaagaaaagggcccgggtagatcctactgttgaaaatgaggaaacattcatg aacagagttgaagttaaagtaaagattcctgaagagctaaaaccgtggcttgttgatgac tgggacttaattaccaggcaaaaacagctcttttatcttcctgccaagaagaatgtggat tccattcttgaggattatgcaaattacaagaaatctcgtggaaacacagataataagtac ctggcaaagaattctgcaactttgttcagtgccagcgattatgaagtggctcctcctgag taccatcggaaagctgtgtga >gi568815583r:78822133_79044981|GENSCAN_predicted_peptide_3|463_aa MWATLPLLCAGAWLLGVPVCGAAELCVNSLEKNVKVIKHKNHRKTYSTEEYHHRLQTFAS NWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSV DWRKKGNFVSPVKNQVCLAISILSLQNGHTPRQYTRKNNSKGCLRQLLDFLHHWGPGVCD RHRNRKDAVLETDDAPEGTILAPASVEKGGTVHSRPGDAMRTLGCGQAEQQLVDCAQDFN NHGCQGYVSDQKGPKTSSFPTDRFDRIQGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCK FQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKT PDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIPLA PGSTQDVYNIQCWQDCCETSSMACALVLFGFKEIGRHGYIQLR >gi568815583r:78822133_79044981|GENSCAN_predicted_CDS_3|1392_bp atgtgggccacgctgccgctgctctgcgccggggcctggctcctgggagtccccgtctgc ggtgccgccgaactgtgcgtgaactccttagaaaagaatgtgaaggtgatcaagcataaa aatcaccgtaagacctacagtacggaggagtaccaccacaggctgcagacgtttgccagc aactggaggaagataaacgcccacaacaatgggaaccacacatttaaaatggcactgaac caattttcagacatgagctttgctgaaataaaacacaagtatctctggtcagagcctcag aattgctcagccaccaaaagtaactaccttcgaggtactggtccctacccaccttccgtg gactggcggaaaaaaggaaattttgtctcacctgtgaaaaatcaggtatgcctggcaatc tccatcctctccctgcaaaacggccacactcccagacaatacaccaggaagaacaactca aaagggtgcctgcggcagttgctggactttctccaccactggggccctggagtctgcgat cgccatcgcaaccggaaagatgctgtccttgagacggatgatgcccctgaaggaactatc ctagcccctgcctctgtggagaaaggagggacagtgcacagccggcctggtgatgccatg aggaccctgggttgtggacaggcggaacagcagctggtggactgcgcccaggacttcaat aatcacggctgccaagggtacgtctctgaccagaaggggcccaaaacttcctcatttccc actgaccgcttcgacaggatccagggtctccccagccaggctttcgagtatatcctgtac aacaaggggatcatgggtgaagacacctacccctaccagggcaaggatggttattgcaag ttccaacctggaaaggccatcggctttgtcaaggatgtagccaacatcacaatctatgac gaggaagcgatggtggaggctgtggccctctacaaccctgtgagctttgcctttgaggtg actcaggacttcatgatgtatagaaccggcatctactccagtacttcctgccataaaact ccagataaagtaaaccatgcagtactggctgttgggtatggagaaaaaaatgggatccct tactggatcgtgaaaaactcttggggtccccagtggggaatgaacgggtacttcctcatc gagcgcggaaagaacatgtgtggcctggctgcctgcgcctcctaccccatccctctggca ccagggagcacacaggatgtttataacatccagtgttggcaggactgttgtgaaacaagc agcatggcctgtgccttggtgctgtttggatttaaagagatcggcagacatggctacatc caactgagataa >gi568815583r:78822133_79044981|GENSCAN_predicted_peptide_4|1003_aa MACAKALRQKKANPGNRRPECETIMFLHQIFYQGLKARISSWPTLVLADLFDILLPMLNI YQEFVRNHQYSLQILAHCKQNRDFDKLLKHYEAKPDCEERTLETFLTYPMFQIPRYILTL HELLAHTPHEHVERNSLDYAKSKLEELSRIMHDEVSETENIRKNLAIERMIIEGCEILLD TSQTFVRQGSLIQVPMSEKGKITRGRLGSLSLKKEGERQCFLFSKHLIICTRGSGGKLHL TKNGVISLIDCTLLEEPESTEEEVSRSRQRPGAACPEGAHQASALLAAVVDASEGVKCVD NIRCNGLMMNAFEENSKVTVPQMIKSDASLYCDDVDIRFSKTMNSCKVLQIRYASVERLL ERLTDLRFLSIDFLNTFLHSYRVFTTAIVVLDKLITIYKKPISAIPARSLELLFASGQNN KLLYGEPPKSPRATRKFSSPPPLSITKTSSPSRRRKLSLNIPIITGGKALDLAALSCNSN GYTSMYSAMSPFSKATLDTSKLYVSSSFTNKIPDEGDTTPEKPEDPSALSKQSSEVSMRE ESDIDQNQSDDGDTETSPTKSPTTPKSVKNKNSSEFPLFSYNNGVVMTSCRELDNNRSAL SAASAFAIATAGANEGTPNKEKYRRMSLASAGFPPDQRNGDKEFVIRRAATNRVLNVLRH WVSKHSQDFETNDELKCKVIGFLEEVMHDPELLTQERKAAANIIRTLTQEDPGDNQITLE EITQMGEIPPQRQLGIQEVSVMGLTQNVAPPTQNQAEGVKAEPFENHSALEIAEQLTLLD HLVFKKIPYEEFFGQGWMKLEKNERTPYIMKTTKHFNDISNLIASEIIRNEDINARVSAI EKWVAVADICRCLHNYNAVLEITSSMNRSAIFRLKKTWLKVSKQTKALIDKLQKLVSSEG RFKNLREALKNCDPPCVPYLGMYLTDLAFIEEGTPNYTEDGLVNFSKMRMISHIIREIRQ FQQTAYKIEHQAKVTQYLLDQSFVMDEESLYESSLRIEPKLPT >gi568815583r:78822133_79044981|GENSCAN_predicted_CDS_4|3012_bp atggcctgtgcaaaggccctaaggcagaagaaagcaaatccagggaacagaaggccagaa tgcgaaaccatcatgtttttacatcagatcttttaccaaggcctgaaggcccgcatctcc agctggcccacgctggtcctggctgacctatttgacatcctgctgcccatgctcaacatc taccaagagttcgtccgcaaccaccagtacagcctgcagatcctggcccactgcaagcag aaccgtgacttcgacaagctgctgaagcactacgaggccaagcctgactgcgaggagagg acgctggagaccttcctcacctaccccatgttccagatccccaggtacatcctgaccctc catgagctcctggcccacacgcctcatgagcacgttgagcgcaacagcctggactacgcc aagtccaaactggaggagctgtccagaataatgcacgatgaagtaagtgagacggagaac atccggaaaaacctggccatcgagcgcatgatcatcgaaggctgtgagatcctcctggac accagccagacctttgtgagacaaggttccctcattcaggtgcccatgtctgaaaagggc aagatcaccagggggcgcctggggtctctctccctaaagaaagagggcgagcgacagtgc ttcctgttttctaagcatctgattatctgtaccagaggctctggagggaagcttcacttg accaagaatggagtcatatccctcattgactgcactttattggaggagccagaaagcacg gaggaggaagtgagccgcagccgtcaaaggcctggggctgcttgcccagagggtgcccac caggcctctgccctcctggcagccgttgtggatgccagtgaaggggtcaagtgtgtggat aacatccgatgcaatgggctcatgatgaacgcatttgaagaaaattccaaggtcactgtg ccgcagatgatcaagtccgacgcctccttatattgtgatgatgttgacattcgcttcagc aaaaccatgaactcctgcaaagtgctgcagatccgctacgccagtgtggagcggctgctg gagaggctgacggacctgcgcttcctgagcatcgacttcctcaacaccttcctgcactcc taccgcgtcttcaccaccgccatcgtggtcctggacaagctcattaccatctacaagaag cctatcagtgccattcctgccaggtcgctggagctcctgtttgccagtggccagaacaat aagctcctgtacggtgaaccccccaagtccccgcgcgccacccgcaagttctcctcgccg ccacctctgtccatcaccaagacatcgtcaccgagccgccggcggaagctctccctgaac atccccatcatcactggcggcaaggccctggacctggccgccctcagctgcaactccaat ggctacaccagcatgtactcggccatgtcacccttcagcaaggccacgctggacaccagc aagctctatgtgtccagcagcttcaccaacaagattccagatgagggcgatacgacccct gagaagcccgaagacccttcagcgctcagcaagcagagctcagaagtctccatgagagag gagtcagatattgatcaaaaccagagtgatgatggtgatactgaaacatcaccaactaaa tctccaacaacacccaaatcagtcaaaaacaaaaattcttcagagttcccactcttttcc tataacaatggagtcgtcatgacctcctgtcgtgaactggacaataaccgcagtgccttg tcggccgcctctgcctttgccatagcaaccgccggggccaacgagggcaccccaaacaag gagaagtaccggaggatgtccttagccagtgcagggtttcccccagaccagaggaatgga gacaaggagtttgtgatccgcagagcagccaccaatcgtgtcttgaacgtgctccgccac tgggtgtccaagcactctcaggactttgagaccaacgatgagctcaaatgcaaggtgatc ggcttcctggaagaagtcatgcacgacccggagctcctgacccaggagcggaaggctgca gccaacatcatcaggactctgacccaggaggacccaggtgacaaccagatcacgctggag gagatcacgcagatgggcgagatacccccacagaggcagctgggaatccaggaggtgtcc gtcatgggcctaactcaaaatgttgctcccccaactcaaaaccaggctgaaggcgtgaag gctgagccctttgaaaaccactcagccctggagatcgcggagcagctgaccctgctagat cacctcgtcttcaagaagattccttatgaggagttcttcggacaaggatggatgaaactg gaaaagaatgaaaggaccccttatatcatgaaaaccactaagcacttcaatgacatcagt aacttgattgcttcagaaatcatccgcaatgaggacatcaacgccagggtgagcgccatc gagaagtgggtggccgtagctgacatatgccgctgcctccacaactacaatgccgtactg gagatcacctcgtccatgaaccgcagtgcaatcttccggctcaaaaagacgtggctcaaa gtctctaagcagactaaagctttgattgataagctccaaaagcttgtgtcatctgagggc agatttaagaatctcagagaagctctgaaaaattgtgacccaccctgtgtcccttacctg gggatgtacctcaccgacctggccttcatcgaggaggggacgcccaattacacggaagac ggcctggtcaacttctccaagatgaggatgatatcccatattatccgagagattcgccag tttcaacaaactgcctacaaaatagagcaccaagcaaaggtaacgcaatatttactggac caatcttttgtaatggatgaagaaagcctctacgagtcttctctccgaatagaaccaaaa ctccccacctga