GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:11:11 Sequence gi568815584r:100234878_100469185 : 234308 bp : 49.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3552 3831 280 1 1 112 -4 181 0.823 7.84 1.02 Intr + 4322 5046 725 2 2 87 99 1694 0.994 161.48 1.03 Intr + 5689 5853 165 2 0 82 24 138 0.026 6.73 1.04 Intr + 27427 27589 163 2 1 50 97 154 0.946 11.53 1.05 Intr + 38936 39002 67 2 1 56 96 23 0.322 -1.19 1.06 Intr + 41613 41771 159 2 0 90 80 131 0.971 12.68 1.07 Term + 42541 42723 183 0 0 80 48 174 0.999 10.14 1.08 PlyA + 42782 42787 6 1.05 2.07 PlyA - 43714 43709 6 1.05 2.06 Term - 58155 57406 750 0 0 136 48 1461 0.999 140.35 2.05 Intr - 58500 58417 84 0 0 126 75 137 0.998 16.22 2.04 Intr - 64008 63965 44 1 2 111 113 42 0.250 6.96 2.03 Intr - 72871 72832 40 1 1 74 92 9 0.025 -2.30 2.02 Intr - 73387 73283 105 1 0 43 115 20 0.086 0.61 2.01 Init - 78353 78111 243 2 0 89 70 139 0.172 10.02 2.00 Prom - 86329 86290 40 -8.56 3.00 Prom + 87115 87154 40 -0.26 3.01 Init + 88953 88999 47 2 2 89 103 77 0.808 7.47 3.02 Intr + 90374 90399 26 2 2 88 58 26 0.892 -2.73 3.03 Intr + 90911 90954 44 0 2 138 86 95 0.992 12.26 3.04 Intr + 91280 91351 72 1 0 94 46 62 0.820 2.20 3.05 Intr + 92311 92493 183 0 0 78 77 350 0.999 32.88 3.06 Intr + 93849 94167 319 2 1 125 99 353 0.999 35.63 3.07 Term + 94488 94768 281 1 2 71 42 182 0.960 7.51 3.08 PlyA + 95477 95482 6 1.05 4.11 PlyA - 98933 98928 6 1.05 4.10 Term - 100159 99998 162 1 0 103 42 279 0.999 22.74 4.09 Intr - 102325 102185 141 1 0 84 105 255 0.998 27.35 4.08 Intr - 107694 107521 174 0 0 110 105 197 0.999 23.74 4.07 Intr - 108510 108398 113 1 2 97 76 92 0.995 9.00 4.06 Intr - 111969 111869 101 2 2 117 91 144 0.963 17.35 4.05 Intr - 118992 118810 183 2 0 111 74 368 0.987 36.60 4.04 Intr - 119689 119570 120 0 0 112 106 62 0.996 10.01 4.03 Intr - 125785 125677 109 2 1 80 71 64 0.998 3.24 4.02 Intr - 127044 126831 214 0 1 101 84 171 0.871 16.29 4.01 Init - 134308 134210 99 1 0 102 83 110 0.755 12.16 4.00 Prom - 134481 134442 40 -11.72 5.00 Prom + 134733 134772 40 -0.76 5.01 Init + 140179 140287 109 0 1 92 22 90 0.027 3.19 5.02 Intr + 146088 146869 782 1 2 22 116 277 0.628 15.10 5.03 Intr + 147082 147132 51 0 0 89 78 18 0.472 0.00 5.04 Term + 154950 155090 141 2 0 79 53 164 0.981 9.93 5.05 PlyA + 157245 157250 6 1.05 6.04 PlyA - 159238 159233 6 1.05 6.03 Term - 159554 159378 177 2 0 102 44 98 0.287 4.49 6.02 Intr - 193594 193544 51 1 0 81 105 16 0.134 1.70 6.01 Init - 195868 195722 147 1 0 93 102 22 0.291 2.47 6.00 Prom - 207054 207015 40 -3.06 7.00 Prom + 208443 208482 40 -4.86 7.01 Init + 218645 218745 101 1 2 29 50 137 0.408 3.73 7.02 Intr + 231746 231818 73 2 1 78 75 53 0.353 2.41 7.03 Intr + 233144 233291 148 1 1 120 96 193 0.757 23.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 5689 5861 173 2 2 82 40 147 0.973 7.29 S.002 Init + 24919 24940 22 0 1 55 116 25 0.879 1.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:100234878_100469185|GENSCAN_predicted_peptide_1|580_aa XSRPRGARGRADRSPAPKAAAVAATPRSRVRPRPEAAGFVAVAPRRAAAARHREAGGGGG GGGALTSRAAGQPGRVRAAPPPVPSAPIREEPGDRGAEAAAAVAAEPSAMASGDTLYIAT DGSEMPAEIVELHEIEVETIPVETIETTVVGEEEEEDDDDEDGGGGDHGGGGGHGHAGHH HHHHHHHHHPPMIALQPLVTDDPTQVHHHQEVILVQTREEVVGGDDSDGLRAEDGFEDQI LIPVPAPAGGDDDYIEQTLVTVAAAGKSGGGGSSSSGGGRVKKGGGKKSGKKSYLSGGAG AAGGGGADPGNKKWEQKQVQIKTLEGEFSVTMWSSAPGWTLPRRSRSPKSSFAAGRDLGL HVGLACAGSERQSEPVTAVASEFLAFSPSCDEKKDIDHETVVEEQIIGENSPPDYSEYMT GKKLPPGGIPGIDLSDPKQLAEFARSIGNIWRESFDKSPKLDSVEKQGCTKMFRDNSAMR KHLHTHGPRVHVCAECGKAFVESSKLKRHQLVHTGEKPFQCTFEGCGKRFSLDFNLRTHV RIHTGDRPYVCPFDGCNKKFAQSTNLKSHILTHAKAKNNQ >gi568815584r:100234878_100469185|GENSCAN_predicted_CDS_1|1743_bp ngctcccgcccccgtggtgcccggggccgcgcggaccgctcaccggctcccaaggcagcg gctgtagcggcgacgccccgttcccgagtgcggccccggcccgaggcggcgggttttgtg gctgttgcaccgcgaagggcggcagccgcgcgacaccgggaagcgggaggcggtggcggc ggcggcggcgcgctgacgtcacgcgccgcgggccagccagggcgcgtgcgagccgccccg cccccggtcccatcggccccaatccgggaggagcccggcgaccgaggagccgaggccgcc gcggccgtggcggcggagccctcagccatggcctcgggcgacaccctctacatcgccacg gacggctcggagatgccggccgagatcgtggagctgcacgagatcgaggtggagaccatc ccggtggagaccatcgagaccacagtggtgggcgaggaggaggaggaggacgacgacgac gaggacggcggcggtggcgaccacggcggcgggggcggccacgggcacgccggccaccac caccaccaccatcaccaccaccaccacccgcccatgatcgctctgcagccgctggtcacc gacgacccgacccaggtgcaccaccaccaggaggtgatcctggtgcagacgcgcgaggag gtggtgggcggcgacgactcggacgggctgcgcgccgaggacggcttcgaggatcagatt ctcatcccggtgcccgcgccggccggcggcgacgacgactacattgaacaaacgctggtc accgtggcggcggccggcaagagcggcggcggcggctcgtcgtcgtcgggaggcggccgc gtcaagaagggcggcggcaagaagagcggcaagaagagttacctcagcggcggggccggc gcggcgggcggcggcggcgccgacccgggcaacaagaagtgggagcagaagcaggtgcag atcaagaccctggagggcgagttctcggtcaccatgtggtcctcagcgccgggctggacc ctgccccggcggtcacgctcgcccaagtcgtcgtttgctgcggggcgggacttggggctg cacgtagggctcgcgtgtgcgggctccgagcgtcagtcggagcctgtcaccgccgttgcc agcgaattcctggccttttcgccttcctgcgatgaaaaaaaagatattgaccatgagaca gtggttgaagaacagatcattggagagaactcacctcctgattattcagaatatatgaca ggaaagaaacttcctcctggaggaatacctggcattgacctctcagatcccaaacaactg gcagaatttgctagatctatagggaatatttggagagaaagttttgacaaaagtccaaag cttgattcagtggagaagcagggctgcacaaagatgttcagggataactcggccatgaga aaacatctgcacacccacggtcccagagtccacgtctgtgcagaatgtggcaaagctttt gttgagagttcaaaactaaaacgacaccaactggttcatactggagagaagccctttcag tgcacgttcgaaggctgtgggaaacgcttttcactggacttcaatttgcgcacacatgtg cgaatccataccggagacaggccctatgtgtgccccttcgatggttgtaataagaagttt gctcagtcaactaacctgaaatctcacatcttaacacatgctaaggccaaaaacaaccag tga >gi568815584r:100234878_100469185|GENSCAN_predicted_peptide_2|421_aa MTRQCQASNQQLGPQPLATTPPDLFLGFQQLFGTFASWGREPHCGGPDFPQPIQTVEDAL RKPSGLQLINYIHTYLPLPAKVLDFPPKPPQPGMDVQGFVLYLMSPPASQGHPVPKYQIH RTFRPAPPGGVAGVLVGHPFDTVKVRLQVQSVEKPQYRGTLHCFKSIIKQESVLGLYKGL GSPLMGLTFINALVFGVQGNTLRALGHDSPLNQFLAGAAAGAIQCVICCPMELAKTRLQL QDAGPARTYKGSLDCLAQIYGHEGLRGVNRGMVSTLLRETPSFGVYFLTYDALTRALGCE PGDRLLVPKLLLAGGTSGIVSWLSTYPVDVVKSRLQADGLRGAPRYRGILDCVHQSYRAE GWRVFTRGLASTLLRAFPVNAATFATVTVVLTYARGEEAGPEGEAVPAAPAGPALAQPSS L >gi568815584r:100234878_100469185|GENSCAN_predicted_CDS_2|1266_bp atgactcgccagtgccaggcctcaaaccaacagctcgggccccagcccctggccacaacg ccacctgacctgttcctgggttttcagcagctgtttgggacctttgctagctggggccgg gagccccactgcgggggccccgatttcccccaacccatccagaccgttgaagatgccctt aggaagccttcaggacttcagctgattaattacattcacacttatcttccacttcctgca aaggtactggatttccctcccaaacctcctcaaccaggaatggatgtgcagggatttgtg ctttacctcatgagccctccagcatcccaaggacatcctgttcctaagtaccaaatacac agaacctttcgacctgcacctccaggaggtgtggcaggcgtgcttgtgggacacccgttt gacacggtcaaggtacggcttcaggtccagagcgtggagaagcctcagtaccgcgggacg ttgcactgcttcaagtccatcatcaagcaagagagcgtgctgggcctgtacaagggcctg ggctcgccgctcatggggctcaccttcatcaacgcgctggtgttcggggtgcagggcaac accctccgggccctgggccacgactcgcccctcaaccagttcctggcaggtgcggcggcg ggcgccatccagtgcgtcatctgctgccccatggagctggccaagacgcggctgcagctg caggacgcgggcccagcgcgcacctacaagggctcgctggactgcctcgcgcagatctac gggcacgagggtctgcgtggcgtcaaccggggcatggtgtccacgttgctgcgtgagacg cccagcttcggcgtctacttcctcacctatgacgctctcacgcgggcgctgggctgcgag ccgggcgaccgcctgctggtgcccaagctgctgttggcgggcggtacgtcaggcatcgtg tcctggctctctacctatcctgtggacgtggtcaagtcgcggctgcaggcggacggactg cggggcgccccgcgctaccgcggcatcctggactgcgtgcaccagagctaccgcgccgag ggctggcgcgtcttcacacgggggctggcgtccacgctgctgcgcgccttccccgtcaac gctgccaccttcgccaccgtcacggtggtgctcacctacgcgcgcggcgaggaggccggg cccgagggcgaggctgtgcccgccgcccctgcggggcctgccctggcgcagccctccagc ctgtga >gi568815584r:100234878_100469185|GENSCAN_predicted_peptide_3|323_aa MGLVLQQGLEGAGRGRQSWLFTNPGVCGVAVGYPLDTVKVRIQTEPKYTGIWHCVRDTYH RERVWGFYRGLSLPVCTVSLVSSVSFGTYRHCLAHICRLRYGNPDAKPTKADITLSGCAS GLVRVFLTSPTEVAKVRLQTQTQAQKQQRRLSASGPLAVPPMCPVPPACPEPKYRGPLHC LATVAREEGLCGLYKGSSALVLRDGHSFATYFLSYAVLCEWLSPAGHSRPDVPGVLVAGG CAGVLAWAVATPMDVIKSRLQADGQGQRRYRGLLHCMVTSVREEGPRVLFKGLVLNCCRA FPVNMVVFVAYEAVLRLARGLLT >gi568815584r:100234878_100469185|GENSCAN_predicted_CDS_3|972_bp atgggcctggtcctgcagcagggactggaaggggcgggcagaggacggcagtcctggctc ttcacgaaccccggcgtctgcggtgttgctgtgggctaccccctggacacggtgaaggtc aggatccagacggagccaaagtacacaggcatctggcactgcgtccgggatacgtatcac cgagagcgcgtgtggggcttctaccggggcctctcgctgcccgtgtgcacggtgtccctg gtatcttccgtgtcttttggcacctaccgccactgcctggcgcacatctgccggctccgg tacggcaaccctgacgccaagcccaccaaggccgacatcacgctctcgggatgcgcctcc ggcctcgtccgcgtgttcctgacgtcgcccactgaggtggccaaagtccgcttgcagacg cagacacaggcgcagaagcagcagcggcggctttcggcctcggggccgttggctgtgccc cccatgtgtcctgtgcccccagcctgcccagagcccaagtaccgcgggccactgcactgc ctggccacggtagcccgtgaggaggggctgtgcggcctctacaagggcagctcggccctg gtcttacgggacggccactcctttgccacctacttcctttcctacgcggtcctctgcgag tggctcagccccgctggccacagccggccagatgtcccgggcgtgctggtggccgggggc tgtgcaggagtcctggcctgggctgtggccacccccatggacgtgatcaagtcgagactg caggcagacgggcagggccagaggcgctaccggggtctcctgcactgtatggtgaccagc gttcgagaggagggaccccgggtccttttcaaggggctggtactcaattgctgccgcgcc ttccctgtcaacatggtggtcttcgtcgcctatgaggcagtgctgaggctcgcccggggt ctgctcacatag >gi568815584r:100234878_100469185|GENSCAN_predicted_peptide_4|471_aa MPNSEPASLLELFNSIATQGELVRSLKAGNASKDEIDSAVKMLVSLKMSYKAAAGEDYKA DCPPGNPAPTSNHGPDATEAEEDFVDPWTVQTSSAKGIDYDKLIVRFGSSKIDKELINRI ERATGQRPHHFLRRGIFFSHRDMNQVLDAYENKKPFYLYTGRGPSSEAMHVGHLIPFIFT KWLQDVFNVPLVIQMTDDEKYLWKDLTLDQAYSYAVENAKDIIACGFDINKTFIFSDLDY MGMSSGFYKNVVKIQKHVTFNQVKGIFGFTDSDCIGKISFPAIQAAPSFSNSFPQIFRDR TDIQCLIPCAIDQDPYFRMTRDVAPRIGYPKPALLHSTFFPALQGAQTKMSASDPNSSIF LTDTAKQIKTKVNKHAFSGGRDTIEEHRQFGGNCDVDVSFMYLTFFLEDDDKLEQIRKDY TSGAMLTGELKKALIEVLQPLIAEHQARRKEVTDEIVKEFMTPRKLSFDFQ >gi568815584r:100234878_100469185|GENSCAN_predicted_CDS_4|1416_bp atgcccaacagtgagcccgcatctctgctggagctgttcaacagcatcgccacacaaggg gagctcgtaaggtccctcaaagcgggaaatgcgtcaaaggatgaaattgattctgcagta aagatgttggtgtcattaaaaatgagctacaaagctgccgcgggggaggattacaaggct gactgtcctccagggaacccagcacctaccagtaatcatggcccagatgccacagaagct gaagaggattttgtggacccatggacagtacagacaagcagtgcaaaaggcatagactac gataagctcattgttcggtttggaagtagtaaaattgacaaagagctaataaaccgaata gagagagccaccggccaaagaccacaccacttcctgcgcagaggcatcttcttctcacac agagatatgaatcaggttcttgatgcctatgaaaataagaagccattttatctgtacacg ggccggggcccctcttctgaagcaatgcatgtaggtcacctcattccatttattttcaca aagtggctccaggatgtatttaacgtgcccttggtcatccagatgacggatgacgagaag tatctgtggaaggacctgaccctggaccaggcctatagctatgctgtggagaatgccaag gacatcatcgcctgtggctttgacatcaacaagactttcatattctctgacctggactac atggggatgagctcaggtttctacaaaaatgtggtgaagattcaaaagcatgttaccttc aaccaagtgaaaggcattttcggcttcactgacagcgactgcattgggaagatcagtttt cctgccatccaggctgctccctccttcagcaactcattcccacagatcttccgagacagg acggatatccagtgccttatcccatgtgccattgaccaggatccttactttagaatgaca agggacgtcgcccccaggatcggctatcctaaaccagccctgctgcactccaccttcttc ccagccctgcagggcgcccagaccaaaatgagtgccagcgaccccaactcctccatcttc ctcaccgacacggccaagcagatcaaaaccaaggtcaataagcatgcgttttctggaggg agagacaccatcgaggagcacaggcagtttgggggcaactgtgatgtggacgtgtctttc atgtacctgaccttcttcctcgaggacgacgacaagctcgagcagatcaggaaggattac accagcggagccatgctcaccggtgagctcaagaaggcactcatagaggttctgcagccc ttgatcgcagagcaccaggcccggcgcaaggaggtcacggatgagatagtgaaagagttc atgactccccggaagctgtccttcgactttcagtag >gi568815584r:100234878_100469185|GENSCAN_predicted_peptide_5|360_aa MTVLRGKLRIQARDCSARISVPYGSSESPELWVNGRTYDDSDSEAETEHAGSFNATGQQK DTSGVARPPGQDFASGTLDVPKAGAQPTKHGSCEDPGGYRLPLAQLGRSDWGSCPSQRLQ WPGKEPQVTFPIKEPSCSSLWTSHVPASHMPLAAARFKQVKLSRNFPKSSFHAQSESETV GKNGSSFQKKKCEDCVVPYTPRRLRQRQALSTETGKGKDVEPQGPPAGRAPAPLYVGPGV SEFIQPYLNSHYKETTVPRKVLFHLRGHRGPVNTIQWCPVLSKSHMLLSTSMDKTFKVTR GYCFLSLSEAVREEDWGTGELGPPIERGPYRDERHPLGLLLLKVVDFATLIAWASWMSLD >gi568815584r:100234878_100469185|GENSCAN_predicted_CDS_5|1083_bp atgacggtgctgagaggcaagctacggatccaagcccgcgactgctcagccaggatctct gttccctatgggagcagtgaatcacctgaactttgggtcaatggaagaacgtatgatgat tcggactcggaggctgagacagagcatgcaggaagttttaatgctaccggccagcagaaa gacacttctggtgtggccagaccacctgggcaggattttgcatctggtacactggatgtg cccaaagcaggggcacagcccacaaagcatggctcctgtgaagacccagggggctatcgc cttccattggctcagcttgggagaagcgattggggatcttgccccagccagaggctacag tggcccgggaaggagcctcaagtcaccttccccatcaaagagccttcttgttcttctctg tggacgagccatgttccagccagccacatgcccctggcagctgcccgctttaagcaagta aaactctccaggaactttcccaagtcatctttccatgctcaaagtgagtctgaaaccgta ggtaaaaatggcagctcttttcagaagaaaaaatgtgaggactgtgtggtaccctatact cccagaagactaagacagcggcaggcattaagcacggagacaggcaagggtaaagacgtg gagccacaggggccccctgcagggcgtgccccagcccctctctacgtgggcccgggagtg tctgagtttattcagccatatttgaatagccattataaagaaaccacagttccccggaaa gtgcttttccacctgagaggccacaggggccctgtcaacaccattcagtggtgtccagtc ctttctaagagccacatgcttctctccacttctatggataaaactttcaaggttaccaga ggatactgtttcttatccctctcggaggccgtgagggaggaggattggggcactggggag ctcggcccccccatagaacgagggccatacagagatgagcggcaccctctggggctgctg ctgctgaaggtcgttgactttgcaactctgattgcttgggcatcctggatgtccctggat tga >gi568815584r:100234878_100469185|GENSCAN_predicted_peptide_6|124_aa MGFLHLHHPFPTAYRDAASLTGSPITAAGSHCASVSSSANRDHSPAWHKRRTLRPRKAKM AAHTRLQRQHKARPEEFTGHTAASPEAWLMETSQGPSAEPSPKSTTSTTASHVTLLDWKT RSHT >gi568815584r:100234878_100469185|GENSCAN_predicted_CDS_6|375_bp atgggcttcctgcatctccaccatcccttccccacagcctatcgagatgcagcctccctc actgggtcacctataacagccgctggttcccactgtgcctcagtgtcctcatctgcaaac agggaccacagtcctgcctggcataagaggaggacgctgaggcccagaaaggctaaaatg gctgcccacaccaggctgcagagacagcataaagcaaggcccgaggaattcacaggccac acagcagcaagtccagaagcctggctcatggagacgtcacaaggcccatcagctgagcca tcccccaagtccaccacctcaaccactgccagccatgtcacactgctggactggaagaca agaagccacacctag >gi568815584r:100234878_100469185|GENSCAN_predicted_peptide_7|108_aa MIIATWRRSQNNALDYDAFKCPGYEVPDYDAFKREERTEAQRGKATQPRPEACQQPYQVW NAVDSGHCLQTYSLHTEAVRAARWAPCGRRILSGGFDFALHLTDLETX >gi568815584r:100234878_100469185|GENSCAN_predicted_CDS_7|324_bp atgattattgctacctggaggcggtcccagaataatgccctggattatgatgctttcaaa tgtcccggatatgaagttccagattatgatgctttcaaaagggaggagaggactgaggcc cagagaggcaaagccactcagccaaggcccgaggcctgccagcagccgtatcaggtatgg aacgccgtggactccgggcactgcctgcagacctactccctgcacacagaggcagtgcgg gccgcccggtgggctccctgtggccggcgcatcctcagtggtggctttgacttcgcgctg cacctaacagaccttgaaacagnn