GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:33:04 Sequence gi568815595r:129070456_129271757 : 201302 bp : 46.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 522 545 24 1 0 138 94 1 0.535 3.70 1.02 Term + 1910 2025 116 0 2 29 38 118 0.648 -0.37 1.03 PlyA + 3759 3764 6 1.05 2.18 PlyA - 4259 4254 6 1.05 2.17 Term - 6984 6932 53 1 2 95 47 85 0.022 2.69 2.16 Intr - 14908 14848 61 1 1 92 52 70 0.041 2.11 2.15 Intr - 20566 20377 190 0 1 -20 58 137 0.168 -0.51 2.14 Intr - 20891 20648 244 0 1 125 20 405 0.684 33.96 2.13 Intr - 24714 24531 184 0 1 54 109 415 0.838 39.56 2.12 Intr - 31253 31201 53 0 2 89 115 9 0.829 2.33 2.11 Intr - 36560 36322 239 1 2 83 59 125 0.303 6.36 2.10 Intr - 38064 38048 17 0 2 64 115 8 0.097 -4.56 2.09 Intr - 46483 46337 147 1 0 122 60 10 0.341 2.03 2.08 Intr - 51037 50831 207 1 0 70 81 383 0.596 35.07 2.07 Intr - 60181 60095 87 1 0 90 75 100 0.995 9.07 2.06 Intr - 63740 63619 122 0 2 74 57 187 0.999 14.41 2.05 Intr - 64483 64377 107 0 2 95 61 151 0.556 12.96 2.04 Intr - 70030 69909 122 1 2 45 97 118 0.539 7.69 2.03 Intr - 75418 75306 113 2 2 97 64 129 0.947 11.50 2.02 Intr - 88721 88699 23 1 2 145 103 -1 0.385 4.29 2.01 Init - 89344 89304 41 1 2 89 115 -9 0.605 1.56 2.00 Prom - 89787 89748 40 -10.05 3.00 Prom + 90121 90160 40 -7.86 3.01 Sngl + 90519 91166 648 2 0 97 48 379 0.995 28.98 3.02 PlyA + 91459 91464 6 1.05 4.05 PlyA - 93171 93166 6 1.05 4.04 Term - 100115 99998 118 1 1 103 32 154 0.999 9.31 4.03 Intr - 100822 100624 199 2 1 30 82 209 0.996 12.91 4.02 Intr - 101053 100991 63 2 0 89 115 17 0.518 3.19 4.01 Init - 101302 101179 124 1 1 97 53 110 0.941 8.73 4.00 Prom - 111439 111400 40 -3.16 5.02 PlyA - 112671 112666 6 1.05 5.01 Sngl - 113228 113037 192 2 0 97 49 210 0.150 11.13 5.00 Prom - 120104 120065 40 -4.16 6.00 Prom + 131448 131487 40 -2.06 6.01 Init + 132679 132719 41 0 2 76 101 34 0.290 3.32 6.02 Intr + 159138 159165 28 0 1 65 77 40 0.006 -1.28 6.03 Intr + 159258 159309 52 2 1 81 107 21 0.233 1.78 6.04 Term + 162900 163081 182 1 2 37 50 146 0.355 3.47 6.05 PlyA + 164603 164608 6 1.05 7.00 Prom + 169084 169123 40 -2.06 7.01 Init + 179826 179915 90 2 0 103 -10 109 0.356 3.09 7.02 Intr + 181826 181906 81 1 0 56 92 56 0.860 2.63 7.03 Intr + 182168 182239 72 1 0 58 95 104 0.724 7.70 7.04 Intr + 182421 182515 95 2 2 64 12 106 0.851 -0.64 7.05 Intr + 184213 184288 76 1 1 104 76 112 0.948 11.12 7.06 Intr + 184530 184622 93 2 0 116 62 155 0.953 15.86 7.07 Intr + 185613 185699 87 2 0 108 94 114 0.998 14.17 7.08 Intr + 187015 187172 158 0 2 81 97 196 0.999 18.61 7.09 Intr + 187272 187405 134 0 2 78 43 229 0.988 17.69 7.10 Intr + 189878 189945 68 0 2 88 116 52 0.999 6.62 7.11 Intr + 190164 190352 189 2 0 105 77 349 0.999 35.28 7.12 Intr + 193449 193544 96 2 0 84 94 143 0.996 14.71 7.13 Intr + 195094 195337 244 0 1 75 96 446 0.999 41.17 7.14 Intr + 196569 196644 76 1 1 61 105 58 0.998 3.37 7.15 Intr + 197482 197585 104 1 2 89 101 178 0.997 19.02 7.16 Intr + 198040 198165 126 2 0 88 92 168 0.999 17.85 7.17 Intr + 198477 198545 69 1 0 93 96 53 0.976 5.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:129070456_129271757|GENSCAN_predicted_peptide_1|46_aa XGETEAQKAFKAILGHMLDKLALDRRKREDRYNGGKIENPCGFPML >gi568815595r:129070456_129271757|GENSCAN_predicted_CDS_1|141_bp natggggaaactgaggcacagaaagcattcaaagccatcctggggcacatgttggacaag cttgccctagaccgaaggaaaagagaagaccgatacaacggtggcaaaatagaaaatccc tgtggatttcccatgttataa >gi568815595r:129070456_129271757|GENSCAN_predicted_peptide_2|669_aa MGKLSDRETTSHRWPEMQKRPSGLGEFRIRDLNDEINKLLREKGHWEVRIKELGGPDYGK VGPKMLDHEGKEVPGNRGYKYFGAAKDLPGVRELFEKERKKTRAELMKAIDFEYYGYLDE DDGVIVPLEQEYEKKLRAELVEKWKAEREARLARGEKEEEEEEEEEINIYAVTEEESDEE GSQEKGGDDSQQKFIAHVPVPSQQEAMAGPGPGPGDPDEQYDFLFKLVLVGDASVGKTCV VQRFKTGAFSERQGSTIGVDFTMKTLEIQGKRVKCPQEISAGSKSLKHSEAPLLTLIWDN TKACYVNSFLQITRHTGLKSSAEVPERERPAKRRGLLCFLGDWGGNLNCEKLTGPSLITC WWSTGTGTHILDFSSLCIATKPPKESCIVTRDKESHFSCWAFRHSFLRVSFFTAIWPHCP FHVSNTLQIWDTAGQERFRTITQSYYRSANGAILAYDITKRSSFLSVPHWIEDVRKYAGS NIVQLLIGNKSDLSELREVSLAEAQSLAEHYDILCAIETSAKDSSNVEEAFLRVATELIM RHGGPLFSEKSPDHIQLNSKDIGEGWGCGASPPGYRQLQQPGEALELCILWPGCGMEALL EEGEAGYPGGPPCQPAAGPPPSSHSKTGLNPPRATVRPTASNPVYSSGYGGRDCWSSTQV FDLPTFIIM >gi568815595r:129070456_129271757|GENSCAN_predicted_CDS_2|2010_bp atggggaaactgagtgatagagagactacttctcacagatggcccgaaatgcagaaaagg ccatctggtttaggtgaatttcgaattcgtgacctgaatgatgaaattaacaagctgcta agggagaaaggacactgggaggtccggataaaggagctgggaggtcctgattatggaaaa gttggccctaaaatgctggatcatgaaggaaaagaagtcccaggaaaccgaggttacaag tactttggagcagcaaaagatttgcctggtgttagagagctgtttgaaaaagaacgtaaa aagacacgtgctgagctcatgaaggcaatcgattttgagtactatggttacctagatgaa gatgatggtgttattgtgcctttggaacaggaatatgaaaagaaactcagagccgagtta gtggaaaagtggaaagcagagagagaggctcggctggcaagaggagaaaaggaagaggag gaggaagaggaggaagagatcaacatctatgcagtcaccgaggaggagtcggacgaggaa ggcagccaggagaaaggaggggacgacagccagcagaagttcattgctcacgtccctgtt ccctcgcagcaagaggccatggcagggccgggcccaggcccgggggacccggacgagcag tacgatttcctgttcaagctggtgctggtgggcgacgcaagcgtgggcaagacgtgcgtg gtgcagcgcttcaagaccggcgccttctcggagcgccagggaagcaccatcggcgtcgac ttcaccatgaagacgctggagatccagggcaagcgggtcaagtgtcctcaggaaatctca gctggctctaaatccttaaaacattctgaagctcctctactcactcttatctgggacaac acaaaggcctgttatgtaaactcgtttttgcagatcactaggcacaccggcctcaagtcc agcgcagaggtaccagagagagaaaggcctgcaaaacgtcgtgggttactgtgcttcctg ggtgactggggaggaaacttgaactgcgagaagctgacagggccatccctcatcacgtgc tggtggagtacaggaacaggaacgcacatcctggatttttcttcactctgcatcgccacc aagcctcccaaggagagctgcattgtaacaagggacaaagaaagccatttctcatgctgg gccttcaggcactcatttttaagagtttccttcttcacggccatctggccacactgccct ttccatgtctctaacacgctgcagatctgggacacggccggccaggagcggttccgcacc atcacccagagctactaccgcagtgccaatggggccatccttgcctacgacatcaccaag aggagctccttcctgtcggtgcctcactggattgaggatgtgaggaagtatgcgggctcc aacattgtgcagctgctgatcgggaacaagtcagacctcagcgagcttcgggaggtctcc ttggctgaggcacagagcctggctgagcactatgacatcctgtgtgccattgagacgtct gccaaggactcgagcaacgtggaggaggccttcctgagggtggccacggagctcatcatg cggcacgggggccccttgttcagcgagaagagccccgaccacatccagctgaacagcaag gacatcggagaaggctggggctgcggagccagccctcctgggtaccggcaactacagcag ccgggtgaagctctggagctctgcatcctgtggcctggctgcgggatggaggctctcctt gaggaaggggaagcaggataccctggcgggccaccctgccagccagcagctggccctcca ccatcttcacattccaagactggcctgaacccgccgcgggccacggtgcggcccactgca agcaaccctgtctacagctctggctacggaggccgtgactgctggtccagcacgcaggtt ttcgacttgccaaccttcatcatcatgtga >gi568815595r:129070456_129271757|GENSCAN_predicted_peptide_3|215_aa MVSLRAPSWSPAAPVQETPQAQKTPTLTGTEDSNYPQTLMFCRKRKRQGPELNGSTGSAQ TKRTLTETEDSNCPQTLFCRKRKRKGQEVNESTGSARTKPTLAGTEDSNCPQSLMFCRKR KRKGPEVKGRTGSEREIRPGIGRENSGAPSGGLPVAPARGVITRRRVMGLSQRDWRFSDS AQAAGLSSGNRSTKCLVFTIGTAGAGGAGRGYVEG >gi568815595r:129070456_129271757|GENSCAN_predicted_CDS_3|648_bp atggtgtcgctaagggcgccgtcctggagccccgcggcccctgtccaagaaactccacag gcccagaagacgccgacgctcacaggaactgaagattccaactacccacagacactgatg ttttgccggaaacgtaaaagacagggaccggaactgaatgggagtaccggaagtgcccag acgaagcggacgctcacagaaactgaagattctaactgcccacagacactgttttgtcgg aaacgtaaaagaaagggacaggaagtgaatgagagtactggaagtgcccggacgaagccg acgctcgcaggaactgaggattctaactgcccacagagcctgatgttttgtcggaaacgt aaaagaaagggaccggaagtgaaagggaggaccggaagtgaaagggagatccggccgggg attgggagagaaaattctggcgctccgtcaggcggtcttccggtagcgccggcacgcggc gtcatcacacgcagacgtgtgatggggctttcgcagagggactggcggttttcggactca gcccaggcggcgggtcttagttctggaaacaggagtactaaatgtcttgtgttcactatt ggtactgcgggagcaggcggcgctgggcgcggttatgtggaaggctga >gi568815595r:129070456_129271757|GENSCAN_predicted_peptide_4|167_aa MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRDICYRCGESGHLAKDCDLQ EDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCGEFGHIQK DCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA >gi568815595r:129070456_129271757|GENSCAN_predicted_CDS_4|504_bp atgagcagcaatgagtgcttcaagtgtggacgatctggccactgggcccgggaatgtcct actggtggaggccgtggtcgtggaatgagaagccgtggcagaggtggttttacctcggat agagacatttgttatcgctgtggtgagtctggtcatcttgccaaggattgtgatcttcag gaggatgcctgctataactgcggtagaggtggccacattgccaaggactgcaaggagccc aagagagagcgagagcaatgctgctacaactgtggcaaaccaggccatctggctcgtgac tgcgaccatgcagatgagcagaaatgctattcttgtggagaattcggacacattcaaaaa gactgcaccaaagtgaagtgctataggtgtggtgaaactggtcatgtagccatcaactgc agcaagacaagtgaagtcaactgttaccgctgtggcgagtcagggcaccttgcacgggaa tgcacaattgaggctacagcctaa >gi568815595r:129070456_129271757|GENSCAN_predicted_peptide_5|63_aa MAAALWPAPLLRQAGSSAAAAASDPGFWRTAVGRHFCPFFVLESWVAEHAGLSAPRPHPR PQA >gi568815595r:129070456_129271757|GENSCAN_predicted_CDS_5|192_bp atggcggccgcactttggcctgcgcctctgctgcgtcaggcgggaagctcggctgctgcc gccgcctcggacccgggtttctggcgcaccgctgtcggacgacacttctgtcctttcttc gtcctggaaagctgggtcgccgagcatgcgggtctttcggcgccacggccgcaccccagg ccgcaggcttag >gi568815595r:129070456_129271757|GENSCAN_predicted_peptide_6|100_aa MGQIPHGLVLSLQWVMLETKIKQATTHPYGRMKHGPYNSSGRILCIFAEETKKDVDELDV ENERKENVKDGSNFWLMQLSGAIERGELVCRDLEQVDESG >gi568815595r:129070456_129271757|GENSCAN_predicted_CDS_6|303_bp atggggcagatccctcatggcttggtgctgtccttgcaatgggtcatgctggagactaag atcaagcaggctaccacgcatccttacgggagaatgaagcacggtccctacaactcttcc ggaagaattctatgtatatttgctgaagagacaaagaaggatgtggatgaattggatgtg gagaatgagaggaaagaaaatgtcaaggatggctctaatttctggcttatgcaactgagt ggtgccattgagagaggtgagctggtctgcagggacttggaacaggttgatgagtctggt tag >gi568815595r:129070456_129271757|GENSCAN_predicted_peptide_7|620_aa MEEADINQIITRVIICKCVICYEGTSYNEMARVFNETPINPRKCAHILTKILYLINQGEH LGTTEATEAFFAMTKLFQSNDPTLRRMCYLTIKEMSCIAEDVIIVTSRQVMGFLTKDMTG KEDNYRGPAVRALCQITDSTMLQAIERYMKQAIVDKVPSVSSSALVSSLHLLKCSFDVVK RWVNEAQEAASSDNIMVQYHALGLLYHVRKNDRLAVNKMISKVTRHGLKSPFAYCMMIRV ASKQLEEEDGSRDSPLFDFIESCLRNKHEMVVYEAASAIVNLPGCSAKELAPAVSVLQLF CSSPKAALRYAAVRTLNKVAMKHPSAVTACNLDLENLVTDSNRSIATLAITTLLKTGSES SIDRLMKQISSFMSEISDEFKVVVVQAISALCQKYPRKHAVLMNFLFTMLREEGGFEYKR AIVDCIISIIEENSESKETGLSHLCEFIEDCEFTVLATRILHLLGQEGPKTTNPSKYIRF IYNRVVLEHEEVRAGAVSALAKFGAQNEEMLPSILVLLKRCVMDDDNEVRDRATFYLNVL EQKQKALNAGYILNGLTVSIPGLERALQQYTLEPSEKPFDLKSVPLATAPMAEQRTESTP ITAVKQPEKVAATRQEIFQX >gi568815595r:129070456_129271757|GENSCAN_predicted_CDS_7|1860_bp atggaggaggcagacatcaaccagataatcacacgtgtaattatttgcaaatgtgtcatc tgctatgaaggaactagttacaacgagatggcccgtgtatttaatgaaactcccatcaac cctcggaaatgtgcccacatcctcaccaagattctttatctcataaaccagggggagcac ctggggaccacggaagcgaccgaggccttctttgccatgaccaagctctttcagtccaat gatcccacactccgtcggatgtgctacttgaccatcaaggagatgtcttgcattgcagag gatgtcatcattgtcaccagcaggcaagtcatggggttcctaacaaaagacatgactggg aaagaagacaactaccggggcccggccgtgcgagccctctgccagatcactgatagcacc atgctgcaggctattgagcgctacatgaaacaagccattgtggacaaggtgcccagtgtc tccagctctgccctcgtgtcttccttgcacctgctgaagtgcagctttgacgtggtcaag cgctgggtgaatgaggctcaggaggcagcatccagtgataacatcatggtccagtaccac gcactagggctcctgtaccatgtgcgtaagaatgaccgcctagccgtcaataagatgatc agcaaggtcacacggcatggccttaagtctccctttgcctactgcatgatgatccgggtg gccagcaagcagctggaagaggaggatggcagccgtgacagcccactgtttgacttcatc gagagctgcttgcgcaacaagcacgagatggtggtgtatgaagccgcctcggccatcgtc aatctgccaggctgcagtgccaaagagctggccccggctgtgtcagtgctccagcttttc tgcagctcacccaaggctgctctccgctatgctgctgttcgtaccctcaataaggttgcc atgaagcatccgtcagctgtgacagcttgtaatctggatctggagaacctggtcacagat tcaaaccgcagcattgccacgctggccatcaccaccctccttaagacgggcagcgagagc agcatcgaccgcctcatgaagcagatctcctccttcatgtcagaaatctcggatgaattc aaggtggtggttgtccaggccatcagtgccctgtgtcagaaatatcctcgcaaacacgcc gtccttatgaacttcctgttcaccatgctgcgggaagagggtggctttgagtataagcgc gctatcgtggactgcatcatcagcatcattgaagagaactcagagagcaaggagacaggg ctgtcacatctgtgcgagttcatcgaggactgcgagttcacagtgctggccacccgtatt ctacatctcctgggccaggaggggcccaagaccaccaatccctcaaagtacatccgcttc atctataaccgagtggtcttggagcatgaggaggtccgggcaggtgctgtgagtgctctg gcgaagtttggagcccagaatgaagagatgttacccagtatcttggtgttgctgaagagg tgtgtgatggatgatgacaatgaagtaagggaccgagccaccttctacctaaatgtcctg gagcagaagcagaaggcccttaatgcaggctatatcctaaatggtctgactgtgtccatc cctggtctggagagggctctgcagcagtacactctagaaccatcagaaaaaccttttgac ctcaagtctgtgcccctggccacggcgcccatggcagagcagagaacagaaagtaccccc atcacagcagtcaaacagcctgagaaagtggcagctaccaggcaggagatcttccaggnn