GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:55:43 Sequence gi568815586f:4811379_5012863 : 201485 bp : 44.43% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3347 3405 59 1 2 62 92 19 0.166 0.50 1.02 Intr + 8288 8411 124 2 1 84 2 97 0.013 1.29 1.03 Intr + 19762 19911 150 0 0 59 -8 141 0.068 1.96 1.04 Term + 27919 27987 69 0 0 115 36 97 0.245 5.14 1.05 PlyA + 29943 29948 6 1.05 2.06 PlyA - 32122 32117 6 1.05 2.05 Term - 34153 34021 133 0 1 90 38 95 0.222 2.26 2.04 Intr - 36584 36460 125 2 2 88 66 36 0.144 0.78 2.03 Intr - 44757 44497 261 0 0 64 60 106 0.586 3.08 2.02 Intr - 45026 44921 106 1 1 50 96 29 0.677 0.12 2.01 Init - 45511 45414 98 1 2 65 121 71 0.528 7.88 2.00 Prom - 52397 52358 40 -3.86 3.06 PlyA - 53846 53841 6 1.05 3.05 Term - 64840 64829 12 1 0 121 39 12 0.334 -2.20 3.04 Intr - 68251 68151 101 2 2 58 95 74 0.309 4.93 3.03 Intr - 69283 69147 137 0 2 74 103 -2 0.609 0.11 3.02 Intr - 70446 70309 138 2 0 64 94 98 0.920 7.58 3.01 Init - 80297 80236 62 2 2 66 103 -5 0.136 -0.48 3.00 Prom - 82086 82047 40 -5.76 4.00 Prom + 82161 82200 40 -5.16 4.01 Sngl + 100001 101488 1488 1 0 104 42 2681 0.989 260.59 4.02 PlyA + 101689 101694 6 1.05 5.03 PlyA - 101710 101705 6 1.05 5.02 Term - 121271 120823 449 0 2 -3 48 368 0.338 19.08 5.01 Init - 141885 141753 133 0 1 98 45 60 0.222 3.00 5.00 Prom - 146079 146040 40 -4.16 6.00 Prom + 161107 161146 40 -3.26 6.01 Init + 165724 166034 311 0 2 61 66 151 0.407 7.25 6.02 Intr + 167766 167880 115 0 1 98 69 41 0.519 3.65 6.03 Term + 171318 171371 54 2 0 78 43 90 0.527 1.06 6.04 PlyA + 172058 172063 6 1.05 7.04 PlyA - 172901 172896 6 1.05 7.03 Term - 173657 173539 119 0 2 132 55 77 0.939 7.50 7.02 Intr - 177361 177211 151 1 1 75 80 81 0.480 5.74 7.01 Init - 190304 190218 87 2 0 99 72 31 0.093 3.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:4811379_5012863|GENSCAN_predicted_peptide_1|133_aa MRDIRPQGARRDSSPSLQVSSLPQPLRLLNKGPIPGQGDVAALMVEDSELAGLPAMGVGE LVRGFARTAEEYGTDELSFLKTEGALVQMHKVVLEVACGPERLAWWIFTVLPTRCEDNED EDLYDDTLPLNNQ >gi568815586f:4811379_5012863|GENSCAN_predicted_CDS_1|402_bp atgcgggatattcgtccccaaggagcgaggagggacagctcgccttctttgcaggtgagc tcactgccacagccactacggcttcttaacaaaggtcctatcccagggcaaggtgacgtg gctgccctcatggttgaggattctgagcttgctggcttgcctgccatgggagttggagaa ttggtgagaggatttgcaaggactgcggaggaatatgggaccgacgagctctcctttctc aaaacagagggagccttggtgcagatgcacaaagtggtccttgaagttgcatgtggcccg gagcggctggcttggtggatctttactgttctgcctactcggtgtgaagacaatgaggat gaagacctttacgatgatacacttccacttaataaccagtaa >gi568815586f:4811379_5012863|GENSCAN_predicted_peptide_2|240_aa MKQYLEIKSSEDSVTSRRGGLTEPPELAPAGTRGGNQGPEWKTDTRGKRVAYKVTFHVSP PHSGSSDQSTLSQSSQGVQPLRGVSRNTSCCFLYDLFLEVAGTHKQGLSKMSTCFHRPGD REPLRPQLPYIYLESPSSHLYFCTQYLRRSGAEGQGNIKQAVGEYFPGKTKGFQEKKACR HWRPPVRQPDPSFITLQNRVPNVQHLHNNARNVTAEHPEREKLNNLQRDRSSRTKQSEKR >gi568815586f:4811379_5012863|GENSCAN_predicted_CDS_2|723_bp atgaagcagtacttggaaattaaaagcagtgaggactctgtcaccagcaggcgaggtggc ctcacagagccccctgagctggcaccggctggcacaagagggggaaaccaaggccctgag tggaagactgacaccagaggaaaaagggttgcatacaaggtcaccttccatgtctcccct cctcactccgggtccagtgatcagagcacactgagccaaagttcccagggcgtccagcct ctgcgtggagtttccaggaacacttcctgctgcttcctctacgacctcttcctggaagtt gcaggcacccataagcaaggactctctaagatgtccacttgctttcacaggccaggagat cgggagcctttgaggcctcagctcccctacatctacttggaatccccgagcagccacctt tacttctgcacccaatacctccggcggagtggggcagaagggcagggtaatatcaaacag gcagttggagaatattttcctggaaaaaccaaaggattccaagaaaaaaaggcctgtaga cactggaggcccccagtgagacagcctgatcccagcttcatcactctgcagaaccgtgtg cctaatgtccaacacctccacaacaatgcacgtaacgtgactgcggagcatccagaaaga gaaaagcttaacaacctgcagagggacagaagttcacgtacaaagcaatcagaaaagcgt tga >gi568815586f:4811379_5012863|GENSCAN_predicted_peptide_3|149_aa MLQNPTLFEHQHDIINGKFPMHLIPGCGSNKTNRDCLTGPLWSGYFPFRCRQPSEHAASK SSEITAGAEWSTHLPLSQRALVLLCNKASLTTKHVLKWEFVCASSLFACHYTCAPAVAER AQCRIQAMASEGASIKPWQPPLGVEPPLY >gi568815586f:4811379_5012863|GENSCAN_predicted_CDS_3|450_bp atgctccaaaatccaacactttttgaacaccaacatgatatcataaatggaaaattcccc atgcacctgattcctggatgtggcagcaataaaaccaaccgagattgtttgactggcccc ctctggagcgggtatttcccattccgatgccgtcagccatctgagcatgctgcttccaag agctccgaaataacagccggtgctgaatggagcacacaccttcctctcagccagagagca cttgtcctcctgtgcaataaagcttcactgacgaccaagcatgttttaaaatgggagttt gtctgtgcaagctctctctttgcctgccactatacatgtgctccagctgtggctgaaagg gcccaatgtagaattcaggccatggcttcagagggtgcaagcatcaagccttggcagcct ccacttggtgttgagcctccactctactag >gi568815586f:4811379_5012863|GENSCAN_predicted_peptide_4|495_aa MTVMSGENVDEASAAPGHPQDGSYPRQADHDDHECCERVVINISGLRFETQLKTLAQFPN TLLGNPKKRMRYFDPLRNEYFFDRNRPSFDAILYYYQSGGRLRRPVNVPLDMFSEEIKFY ELGEEAMEKFREDEGFIKEEERPLPEKEYQRQVWLLFEYPESSGPARVIAIVSVMVILIS IVIFCLETLPELKDDKDFTGTVHRIDNTTVIYNSNIFTDPFFIVETLCIIWFSFELVVRF FACPSKTDFFKNIMNFIDIVAIIPYFITLGTEIAEQEGNQKGEQATSLAILRVIRLVRVF RIFKLSRHSKGLQILGQTLKASMRELGLLIFFLFIGVILFSSAVYFAEAEEAESHFSSIP DAFWWAVVSMTTVGYGDMYPVTIGGKIVGSLCAIAGVLTIALPVPVIVSNFNYFYHRETE GEEQAQLLHVSSPNLASDSDLSRRSSSTMSKSEYMEIEEDMNNSIAHYRQVNIRTANCTT ANQNCVNKSKLLTDV >gi568815586f:4811379_5012863|GENSCAN_predicted_CDS_4|1488_bp atgacggtgatgtctggggagaacgtggacgaggcttcggccgccccgggccacccccag gatggcagctacccccggcaggccgaccacgacgaccacgagtgctgcgagcgcgtggtg atcaacatctccgggctgcgcttcgagacgcagctcaagaccctggcgcagttccccaac acgctgctgggcaaccctaagaaacgcatgcgctacttcgaccccctgaggaacgagtac ttcttcgaccgcaaccggcccagcttcgacgccatcctctactactaccagtccggcggc cgcctgcggaggccggtcaacgtgcccctggacatgttctccgaggagatcaagttttac gagttgggcgaggaggccatggagaagttccgggaggacgagggcttcatcaaggaggag gagcgccctctgcccgagaaggagtaccagcgccaggtgtggctgctcttcgagtacccc gagagctcggggcccgccagggtcatcgccatcgtctccgtcatggtcatcctcatctcc atcgtcatcttttgcctggagacgctccccgagctgaaggatgacaaggacttcacgggc accgtccaccgcatcgacaacaccacggtcatctacaattccaacatcttcacagacccc ttcttcatcgtggaaacgctgtgtatcatctggttctccttcgagctggtggtgcgcttc ttcgcctgccccagcaagacggacttcttcaaaaacatcatgaacttcatagacattgtg gccatcattccttatttcatcacgctgggcaccgagatagctgagcaggaaggaaaccag aagggcgagcaggccacctccctggccatcctcagggtcatccgcttggtaagggttttt agaatcttcaagctctcccgccactctaagggcctccagatcctgggccagaccctcaaa gctagtatgagagagctagggctgctcatctttttcctcttcatcggggtcatcctgttt tctagtgcagtgtactttgccgaggcggaagaagctgagtcgcacttctccagtatcccc gatgctttctggtgggcggtggtgtccatgaccactgtaggatacggtgacatgtaccct gtgacaattggaggcaagatcgtgggctccttgtgtgccatcgctggtgtgctaacaatt gccctgcccgtacctgtcattgtgtccaatttcaactatttctaccaccgagaaactgag ggggaagagcaggctcagttgctccacgtcagttcccctaacttagcctctgacagtgac ctcagtcgccgcagttcctctactatgagcaagtctgagtacatggagatcgaagaggat atgaataatagcatagcccattatagacaggtcaatatcagaactgccaattgcaccact gctaaccaaaactgcgttaataagagcaagctactgaccgatgtttaa >gi568815586f:4811379_5012863|GENSCAN_predicted_peptide_5|193_aa MGQRPQKIKPIRKIFSKGNNLSHRTLAGRRDLEEHQERSKDDDIEKKKKKKKKKKKKKKK KKKKKKKKKKKKKKKKERKNALRPNVNISPSQKFSRHSFQNQDFLFWAPLACSWLPLLLV FLAYPNVRGHVCLSHCTKLLVELEFRIPEFRMPEFRIPVKYNSSRERTKRMYSSPNVWGS AYCEPIFKPQENE >gi568815586f:4811379_5012863|GENSCAN_predicted_CDS_5|582_bp atggggcagagaccacaaaaaatcaaacccatccggaagatcttttccaagggaaataat ctcagtcacagaactttggcaggcaggagagatttggaggaacatcaagaaagatctaag gatgatgatattgagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaagaaagaaagaat gctctaaggcccaatgtaaacatttccccctctcagaagttctctagacattcctttcag aaccaggacttcctcttctgggctcccctagcttgttcttggcttcctctgctcctagtg ttcttggcttatcccaatgtccgtggtcatgtctgtctgtctcactgcaccaagctcctt gtggagcttgaatttagaataccagaatttagaatgccagagtttagaataccagtcaaa tacaattcgtcccgagaacggaccaagagaatgtattcctcacccaacgtttgggggtca gcttactgtgagcctatttttaagccacaagaaaatgagtga >gi568815586f:4811379_5012863|GENSCAN_predicted_peptide_6|159_aa MPFACMPAGVVVPSCSPVLFLSMVLSTGGGPIIQDCGGVEENTREPSEWWHRHLTYLENC LETGDSLRALLPGPIIAGSQAACRKGEYTTALHGLGEDDFSTSCTGVREEEVEAAVLGSG FIPCDSSGLPQVLLSTYTTGHQELAYSAIDLTINTLMET >gi568815586f:4811379_5012863|GENSCAN_predicted_CDS_6|480_bp atgccatttgcctgcatgcctgctggtgtggtggtgccgtcctgttctcccgtgctgttc ctcagcatggttctctccacgggtggaggacccatcattcaggattgtggtggcgtagaa gaaaacacacgggagccttctgagtggtggcaccggcatcttacataccttgaaaactgc ctggaaacaggtgattcattacgggctctgcttccaggcccgatcatagcaggatcccag gcagcctgcagaaagggagagtacactactgctcttcatggtctgggggaggatgacttc agtacctcatgcacaggggtgcgagaggaggaagtggaggctgcagtgctgggctctggg ttcattccctgtgattcctctgggctgccccaagtcctgctcagcacatacacaacagga caccaggaacttgcctattctgccattgacctcaccatcaacactttgatggagacctaa >gi568815586f:4811379_5012863|GENSCAN_predicted_peptide_7|118_aa MGSSEGRVTEHLQPQSSPFSIAASRARSQLSGGLAELMSGINQQGGKSVSSAAPLPPRNH LLKLLSSIHLPEAPVMQAAGALLPAIICHCKQPVQQCAAVSTSSERRHLLPEDTFRYA >gi568815586f:4811379_5012863|GENSCAN_predicted_CDS_7|357_bp atgggctcctcggagggcagagtcacagagcacctgcagccacagagttctcccttctcc attgctgcttctcgtgctaggagccagctgagcggaggactggctgagctgatgagtggt ataaatcaacaaggtggcaaatcggtgagttctgctgctcctttgcccccaaggaaccac ttgttaaagctcctgagcagcatccacctccctgaagcacctgtcatgcaggctgcgggg gccttactgcctgccatcatctgccactgcaaacaacctgttcagcagtgtgcagctgtg agcacatcctctgagagaaggcatctgctacccgaagataccttcaggtatgcctga