GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:37:54 Sequence gi568815591r:95485644_95696293 : 210650 bp : 37.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 329 515 187 1 1 82 93 154 0.997 14.57 1.02 Intr + 10115 10414 300 0 0 129 99 112 0.968 12.18 1.03 Intr + 15702 15755 54 1 0 75 91 61 0.873 3.13 1.04 Intr + 18695 18748 54 0 0 75 91 61 0.802 3.13 1.05 Term + 19265 19341 77 0 2 35 47 93 0.381 -3.08 1.06 PlyA + 19918 19923 6 1.05 2.00 Prom + 19935 19974 40 -5.65 2.01 Init + 27491 27530 40 1 1 87 116 64 0.890 9.50 2.02 Intr + 42170 42660 491 0 2 107 100 729 0.124 67.80 2.03 Intr + 50794 50907 114 0 0 97 84 97 0.807 9.92 2.04 Term + 65020 65133 114 0 0 76 33 64 0.045 -2.71 2.05 PlyA + 65356 65361 6 1.05 3.02 PlyA - 65977 65972 6 1.05 3.01 Sngl - 66934 66566 369 1 0 97 48 268 0.719 19.08 3.00 Prom - 72890 72851 40 -8.35 4.00 Prom + 75083 75122 40 -3.35 4.01 Sngl + 81802 82197 396 0 0 69 35 209 0.735 9.70 4.02 PlyA + 82307 82312 6 1.05 5.12 PlyA - 83867 83862 6 1.05 5.11 Term - 100138 99998 141 1 0 113 49 122 0.950 7.75 5.10 Intr - 101480 101367 114 2 0 101 63 109 0.998 9.42 5.09 Intr - 101885 101775 111 2 0 95 91 98 0.999 10.46 5.08 Intr - 102182 102084 99 2 0 77 86 111 0.989 9.19 5.07 Intr - 103629 103540 90 0 0 102 24 58 0.472 0.07 5.06 Intr - 104073 103997 77 1 2 78 86 10 0.975 -1.88 5.05 Intr - 106954 106868 87 2 0 79 95 72 0.983 6.02 5.04 Intr - 107301 107117 185 2 2 15 98 118 0.729 4.01 5.03 Intr - 108127 108056 72 0 0 110 63 75 0.710 4.80 5.02 Intr - 109521 109380 142 1 1 78 119 83 0.992 8.89 5.01 Init - 110650 110521 130 1 1 90 77 266 0.996 24.06 5.00 Prom - 112507 112468 40 -2.25 6.00 Prom + 118705 118744 40 -8.05 6.01 Init + 119697 119887 191 2 2 58 45 173 0.364 6.53 6.02 Intr + 120288 120441 154 0 1 13 26 113 0.081 -3.25 6.03 Intr + 127408 127589 182 0 2 36 119 120 0.445 7.64 6.04 Intr + 152478 152585 108 0 0 79 83 87 0.307 5.78 6.05 Intr + 155468 155588 121 2 1 97 99 43 0.290 5.88 6.06 Intr + 185988 186076 89 2 2 76 87 68 0.244 3.35 6.07 Term + 189786 189864 79 0 1 48 48 108 0.301 -0.94 6.08 PlyA + 190107 190112 6 1.05 7.00 Prom + 192021 192060 40 -1.25 7.01 Init + 203959 204057 99 0 0 35 50 119 0.094 3.11 7.02 Term + 205374 205517 144 2 0 84 48 66 0.092 -0.87 7.03 PlyA + 207004 207009 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:95485644_95696293|GENSCAN_predicted_peptide_1|223_aa MDGTTAPVTKSGAAKLVKRNFLEALKSNDFGKLKAILIQRQIDVDTVFEVEDENMVLASY KQGYWLPSYKLKSSWATGLHLSVLFGHVECLLVLLDHNATINCRPNGKTPLHVACEMANV DCVKILCDRGAKLNCYSLSGHTALHFCTTPSSILCAKQLVWRVTQVNHMLGNSLVNEVEH VTQVNHMLGNSLVNEVEHVSQQLLQSRKACSTKNHTTLQETQP >gi568815591r:95485644_95696293|GENSCAN_predicted_CDS_1|672_bp atggacggcaccactgcccctgtcactaaatctggagctgccaagttagttaagagaaat ttccttgaggcgctaaagtccaatgacttcggaaaattgaaggctattttgatccaaagg caaatagatgtggacactgtttttgaagtcgaagatgagaatatggttttggcatcttat aaacaaggttactggttgcctagctataaattgaagtcttcctgggccacaggcctccat ctctctgtcttgtttggccatgtggaatgtcttctggtgctactggaccacaatgctaca atcaactgtagacccaatgggaaaacccctcttcacgtggcttgtgaaatggccaatgtg gattgtgttaagatcctctgtgatcgtggggcaaagctcaattgctactccttaagtgga cacacagctttgcacttttgtacaactccaagttccattctctgtgccaagcaattggtt tggagagtgacacaagtcaaccacatgttaggaaattccctggtcaatgaagtggaacat gtgacacaagtcaaccacatgttaggaaattccctggtcaatgaagtggaacatgtcagt cagcagttactccagtcccgtaaagcatgttccacgaaaaatcataccacacttcaggaa acacaaccttaa >gi568815591r:95485644_95696293|GENSCAN_predicted_peptide_2|252_aa MVTKCGKPLEAGKGANVNMKTNNQDEETPLHTAAHFGLSELVAFYVEHGAIVDSVNAHME TPLAIAAYWALRFKEQEYSTEHHLVCRMLLDYKAEVNARDDDFKSPLHKAAWNCDHVLMH MMLEAGAEANLMDINGCAAIQYVLKVTSVRPAAQPEICYQLLLNHGAARIYPPQFHKVIQ ACHSCPKAIEVVVNAYEHIRWNTKWRRAIPDDDLERLRPWIFEVEPKENSQQLCPFAQEE SELLLFLISPLL >gi568815591r:95485644_95696293|GENSCAN_predicted_CDS_2|759_bp atggtcacgaaatgtgggaagcctctagaagctggaaaaggggcgaatgtgaacatgaag accaacaaccaagatgaggagacgcccttgcacacggctgcccacttcggcctttcggag ctggtggccttctacgtggaacacggggccatagtggacagcgtgaatgcccacatggag acccccctggccatcgccgcctactgggccctccgctttaaggagcaggagtacagcacg gagcaccacctggtctgccgcatgctgcttgactacaaagccgaagtcaatgcccgagat gacgactttaaatctcccctccacaaggcagcctggaactgtgaccacgtgctcatgcac atgatgctggaagctggcgccgaagccaatctcatggatatcaacggctgtgctgccatc cagtacgtgctgaaggtcacctccgtgcgccctgctgcccagcctgagatctgctaccag ctcctgttgaaccatggggctgcccgaatataccctccacagttccataaggtgatacag gcctgccattcttgtcctaaagcaattgaagttgtagtcaatgcctatgaacacatcaga tggaacacaaagtggagaagagctatccccgatgatgacttggagagactcaggccgtgg atctttgaggtagaacccaaggaaaatagtcaacaactttgtccatttgcccaggaagag tctgagttgttgctgttcctcattagtcctctgctttag >gi568815591r:95485644_95696293|GENSCAN_predicted_peptide_3|122_aa MPEPLPSMGSCVAGASPTSAVPCSTAPSPIDHPRAEESGHTMQDWQAAPPAAPVWDPLGE ASWVPDSGGDLQNLCVDTVYLANLVGTWRTVVSSSGIVNAPISALSKQTTWLYQSAGWGG AR >gi568815591r:95485644_95696293|GENSCAN_predicted_CDS_3|369_bp atgcctgagcctctcccttccatgggctcctgtgtggcaggagcctccccgacaagcgcc gtcccctgctccaccgcgcccagtcccatcgaccacccaagggctgaggagagcgggcac acgatgcaggactggcaggcagctccacctgcagccccggtgtgggatccactgggtgaa gccagctgggtacctgactctggtggggacttgcagaacctttgtgtggacactgtttat ctagctaatctagtggggacgtggagaaccgttgtgtctagctcagggattgtaaacgca ccaatcagcgccctgtcaaaacagaccacttggctctaccaatcagcaggatggggtggg gccagataa >gi568815591r:95485644_95696293|GENSCAN_predicted_peptide_4|131_aa MLFCKTNKEKKREESNRCNKNDKRDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLD TYTLPRLNQEELESLNRPITGSEIVAIINSLPTKRVQDQMDSQPNSTRDTRRNWYHSSET IPINRKRGNPP >gi568815591r:95485644_95696293|GENSCAN_predicted_CDS_4|396_bp atgctgttttgcaagactaataaagaaaaaaagagagaagaatcaaatagatgcaataaa aatgataaaagggatatcaccactgatcccacagaaatacaaactaccatcagagaatac tataaacacctctacgcaaataaactagaaaatctagaagaaatggataaattcctcgac acatacaccctcccaagactaaaccaggaagaacttgaatctctgaatagaccaataaca ggttctgaaattgtggcaataatcaatagcttaccaacaaaaagagtccaggaccagatg gattcacagccgaattctaccagagatacaaggaggaactggtaccattcctctgaaact attccaatcaatagaaaaagagggaatcctccctaa >gi568815591r:95485644_95696293|GENSCAN_predicted_peptide_5|415_aa MKAARFVLRSAGSLNGAGLVPREVEHFSRYSPSPLSMKQLLDFGSENACERTSFAFLRQE LPVRLANILKEIDILPTQLVNTSSVQLVKSWYIQSLMDLVEFHEKSPDDQKALSDFVDTL IKVRNRHHNVVPTMAQGIIEYKDACTVDPVTNQNLQYFLDRFYMNRISTRMLMNQHILIF SDSQTGNPSHIGSIDPNCDVVAVVQGKFPDQPIHIVYVPSHLHHMLFELFKAGPTPNSAV QVQNSCRWYGSIQSPECAKSTNAMRATVEHQENQPSLTPIEVIVVLGKEDLTIKISDRGG GVPLRIIDRLFSYTYSTAPTPVMDNSRNAPLAGFGYGLPISRLYAKYFQGDLNLYSLSGY GTDAIIYLKALSSESIEKLPVFNKSAFKHYQMSSEADDWCIPSREPKNLAKEVAM >gi568815591r:95485644_95696293|GENSCAN_predicted_CDS_5|1248_bp atgaaggcggcccgcttcgtgctgcgcagcgctggctcgctcaacggcgccggcctggtg ccccgagaggtggagcatttctcgcgctacagcccgtccccgctgtccatgaagcagcta ctggactttggttcagaaaatgcatgtgaaagaacttcttttgcatttttgcgacaagaa ttgcctgtgagactcgccaacattctgaaggaaattgatatcctcccgacccaattagta aatacctcttcagtgcaattggttaaaagctggtatatacagagcctgatggatttggtg gaattccatgagaaaagcccagatgaccagaaagcattatcagactttgtagatacactc atcaaagttcgaaatagacaccataatgtagtccctacaatggcacaaggaatcatagag tataaagatgcctgtacagttgacccagtcaccaatcaaaatcttcaatatttcttggat cgattttacatgaaccgtatttctactcggatgctgatgaaccagcacattcttatattt agtgactcacagacaggaaacccaagccacattggaagcattgatcctaactgtgatgtg gtagcagtggtccaaggaaaatttccagaccaaccaattcacatcgtgtatgttccttct cacctccatcatatgctctttgaactatttaaggctggtcctactcctaactcagcagtt caggtccagaattcctgtaggtggtatggctccattcagagtccagaatgtgctaagagc acaaatgcaatgcgggcaacagttgaacaccaggaaaatcagccttcccttacaccaata gaggttattgttgtcttgggaaaagaagaccttaccattaagatttcagacagaggaggt ggtgttcccctgagaattattgaccgcctctttagttatacatactccactgcaccaacg cctgtgatggataattcccggaatgctcctttggctggttttggttacggcttgccaatt tctcgtctgtatgcaaagtactttcaaggagatctgaatctctactctttatcaggatat ggaacagatgctatcatctacttaaaggctttgtcttctgagtctatagaaaaacttcca gtttttaacaagtcagccttcaaacattatcagatgagctctgaggctgatgactggtgt atcccaagcagggaaccaaagaacctggcaaaagaagtggccatgtga >gi568815591r:95485644_95696293|GENSCAN_predicted_peptide_6|307_aa MGAPLWAGRGRSQLPLLAGRCGGRGTGGNRGCTQLSWASMSSEWTWARWAPHSEQPARAA SPGQFVNAPISVLCLANLVGTWRTSVSSSEIVNAPISTLSKRTNQLSVKWINQQDALLDE ISKKACQEPSLLLMGQEKALTCAHQHPITEPTPPSGRQSPAGLPPIPLAALPLLLRLHKC GSVAGSQGVLQGGSRKTTKVRKREKSGEAHMSPRLSNAPSPYQQVFLRHCTPGRLRAPER FFLTGENGYLSLERNGKSEDVFRSHKPEVKAESERLLFPAISNVCFDQTLTDRPTLETAF CENGNDH >gi568815591r:95485644_95696293|GENSCAN_predicted_CDS_6|924_bp atgggagcccctctctgggctggccgaggtcggagccagctccctctgcttgcagggagg tgtggagggagaggcacaggcgggaaccggggctgcacgcaactctcatgggccagcatg agttctgagtggacgtgggcacggtgggccccgcactcggagcagccggctcgtgctgcc agccccgggcagtttgtaaatgcaccaatcagtgttctgtgtctagctaatctagtgggg acttggagaacttctgtgtctagctcagagattgtaaatgcaccaatcagcaccctgtca aaacggaccaatcagctctctgtaaaatggatcaatcagcaggatgcactccttgatgaa ataagtaagaaagcttgtcaagagccatcactcttgctcatgggacaagagaaggcactg acctgtgcccaccagcaccccatcactgagccaacaccaccttcaggaagacagtctcca gcagggcttcccccaattcccctagctgctttgcctctgctactccggttacacaagtgt ggaagtgtggcagggtctcaaggggtccttcaaggtggtagcagaaaaactaccaaagtc agaaagcgagagaaaagtggtgaagctcacatgagcccaaggttaagcaatgctccatcc ccttaccagcaggtgttcctaagacattgtaccccaggtcggctaagagctcctgaacgt ttctttctcacaggagaaaatggttacctcagcctggaaaggaatggaaaatcagaagac gtgtttagatcacacaaaccagaggtgaaagctgaatctgagaggcttctgtttccagca ataagcaatgtctgctttgatcagactctcactgaccgtccaacactggaaactgcattt tgtgaaaatggtaatgatcattaa >gi568815591r:95485644_95696293|GENSCAN_predicted_peptide_7|80_aa MAARSTSWYKPCEGQFENARKHDKYMYPQMHDKLCGFRQKPHNMTELRGYTYKMRAIKLT LKDCCLLTFEGSFLARETLT >gi568815591r:95485644_95696293|GENSCAN_predicted_CDS_7|243_bp atggctgccaggagcacaagctggtacaagccctgtgaagggcaatttgaaaatgcccgt aaacatgacaaatacatgtatcctcagatgcacgacaaactttgtggatttaggcaaaaa ccacataacatgactgagcttagaggctacacctataaaatgagggcaataaaactcacc ttaaaggactgttgtctcttaacttttgaagggagtttcttagccagagaaaccctaacc tga