GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:34:21 Sequence gi568815586r:12230030_12443314 : 213285 bp : 42.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 20 15 6 1.05 1.03 Term - 10837 10716 122 2 2 54 49 136 0.435 3.96 1.02 Intr - 14626 14237 390 2 0 109 39 437 0.393 34.67 1.01 Init - 21831 21825 7 0 1 74 80 0 0.249 -0.94 1.00 Prom - 24909 24870 40 -4.95 2.06 PlyA - 25858 25853 6 1.05 2.05 Term - 37089 36978 112 2 1 81 55 208 0.595 13.85 2.04 Intr - 37557 37306 252 2 0 40 0 223 0.405 4.33 2.03 Intr - 45076 45032 45 0 0 128 74 11 0.108 0.31 2.02 Intr - 50361 50238 124 1 1 45 76 118 0.825 5.02 2.01 Init - 54783 54768 16 0 1 65 99 12 0.254 0.59 2.00 Prom - 59418 59379 40 -6.05 3.00 Prom + 66219 66258 40 -1.65 3.01 Init + 67777 67822 46 0 1 62 92 8 0.399 -0.50 3.02 Intr + 70925 71153 229 0 1 59 57 167 0.731 6.71 3.03 Term + 75628 75835 208 1 1 62 48 127 0.340 1.93 3.04 PlyA + 78824 78829 6 1.05 4.07 PlyA - 78865 78860 6 1.05 4.06 Term - 100929 99998 932 1 2 68 49 532 0.895 38.61 4.05 Intr - 108531 108391 141 1 0 73 95 110 0.094 9.60 4.04 Intr - 120345 120049 297 1 0 8 103 186 0.240 8.02 4.03 Intr - 121195 121077 119 0 2 35 72 67 0.286 -1.01 4.02 Intr - 124070 123891 180 1 0 83 80 69 0.119 3.66 4.01 Init - 152233 152133 101 1 2 61 48 137 0.501 6.78 4.00 Prom - 153064 153025 40 -5.85 5.04 PlyA - 153934 153929 6 1.05 5.03 Term - 173476 173312 165 1 0 87 36 122 0.283 3.83 5.02 Intr - 177166 176936 231 1 0 103 7 126 0.161 3.05 5.01 Init - 183695 183648 48 2 0 79 78 56 0.016 2.72 5.00 Prom - 189500 189461 40 -3.65 6.02 PlyA - 189784 189779 6 -0.45 6.01 Sngl - 190093 189803 291 1 0 71 43 179 0.887 7.10 6.00 Prom - 190147 190108 40 -5.65 7.02 PlyA - 190264 190259 6 1.05 7.01 Sngl - 191458 190805 654 1 0 43 48 333 0.991 20.72 7.00 Prom - 193279 193240 40 -3.45 8.02 PlyA - 193312 193307 6 1.05 8.01 Term - 199848 199683 166 2 1 58 54 162 0.051 6.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 183695 183513 183 2 0 79 43 173 0.838 4.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:12230030_12443314|GENSCAN_predicted_peptide_1|172_aa MGAAPLLLYANRRDLRLVDATNGKENATIVVGGLEDAAAVDFVFSHGLIYWSDVSEEAIK RTEFNKTESVQNVVVSGLLSPDGLACDWLGEKLYWTDSETNRIEVSNLDGSLRKVLFWQE LDQPRAIALDPSMSRGDSYVAAPNAGAPQSDSTVTMLSNRTNLGDVEDLCQV >gi568815586r:12230030_12443314|GENSCAN_predicted_CDS_1|519_bp atgggagcggcccctttgttgctttatgcaaacagacgggacttgcgattggttgatgct acaaatggcaaagagaatgctacgattgtagttggaggcttggaggatgcagctgcggtg gactttgtgtttagtcatggcttgatatactggagtgatgtcagcgaagaagccattaaa cgaacagaatttaacaaaactgagagtgtgcagaatgttgttgtttctggattattgtcc cccgatgggctggcatgtgattggcttggagaaaaattgtactggacagattctgaaact aatcggattgaagtttctaatttagatggatctttacgaaaagttttattttggcaagag ttggatcaacccagagctattgccttagatccttcaatgagtagaggtgattcctatgtg gctgctcctaatgctggagcccctcagtctgattccacagttaccatgctaagcaaccgt actaatcttggtgatgttgaggacctgtgtcaagtttaa >gi568815586r:12230030_12443314|GENSCAN_predicted_peptide_2|182_aa MKMTGVQYHKPLNNILEPMQSATSDAGSAPKKQKKVMTLQEKSGIAWLPSIADILSSCLG QTSTLKAGVNSVQKFRPPKKRMLKPISNPEDGAHHPATGSAVAPRACTAAGQSQVAGGAR PEPHLALPAAAAQRAGQTAARPSRRKRQYREKGLVARRDTVAATLGGGADRIPAPLREAE AE >gi568815586r:12230030_12443314|GENSCAN_predicted_CDS_2|549_bp atgaagatgactggagtgcaataccataaacccttgaataacatcctggaacccatgcaa agtgccactagtgatgctggaagtgctcccaagaagcagaaaaaagtcatgaccttacaa gaaaaaagtggaattgcttggttaccctctattgcagatatcctttcatcctgtttggga caaacatccacgctgaaagccggtgtaaacagcgttcaaaaattccgccctcccaaaaaa aggatgctgaaaccgatttccaaccccgaagacggagctcatcaccccgcaacgggatcc gccgtggctccccgcgcgtgcaccgcggctgggcagtctcaggtggctgggggcgcgaga ccggagccccacctggccctgccggcggcggccgcacaaagagcggggcagacggctgcc cggccgagccgcagaaagaggcagtaccgggagaaggggctcgttgcgcgcagggacacg gtcgcggcgaccctaggaggcggtgccgaccgtatccctgcgccgctgcgcgaggcagag gctgaatga >gi568815586r:12230030_12443314|GENSCAN_predicted_peptide_3|160_aa MQTSRGPILQAEGAAGVSGTRYYHLTPQVVWERQQQKVDSLLGTQQMLEAQGREFQGPAS PYLSPQMHQLLRPHLSLLHRTSNPPTISSCGWSLCYHSSASTEINTPVGNGMSLMLVGQG RSLNAGGTTTLAGVQALDTITRRNSRMSQKIVKVWRFIAK >gi568815586r:12230030_12443314|GENSCAN_predicted_CDS_3|483_bp atgcaaacatccaggggaccaatactccaagctgagggagcggcaggtgtgtctggaact cggtactaccaccttactccccaggtggtctgggagaggcagcaacagaaagtagacagt ttattagggacccagcagatgctggaagcccagggaagggagtttcagggccctgcatcc ccctacctgtccccacaaatgcaccagctgctaagaccacacctcagcctccttcaccgg acctctaaccccccaactatctcttcctgtggttggtctctttgttaccatagttcggcc tctaccgaaattaatacaccagtggggaatgggatgagcttgatgctagtgggccaggga aggtccctaaacgctggtgggaccacaaccctggctggtgtccaggctcttgacaccatc acaagaaggaattcaagaatgagtcagaaaatagtgaaagtgtggagatttattgcaaaa tga >gi568815586r:12230030_12443314|GENSCAN_predicted_peptide_4|589_aa MPAPPSPSSMIESFLRSLPEADAGATLLIQLAELLPSQKPAYCQHCIQPIVVSWRASPLN FSSICMPISGPYHFRFLEERPLASRPSIHPSCGSYWLRLGCVAVGRTGLDLTCNCIALQV CKLFQNRGEDTVKALGPGARSLRGEGRANVAGKQLRVAGQERALPPRQTCRRCDGSRNLL TTKPARPVRELRSIRCGPRRDPRADSPVLPAGATELSSHLDGSRGAGVLSTPGDKACNLM IFDTRKTARQPNCYLFFCPNEEACPLKPAKGLMSYRIITDFPSLTRNLPSQELPQEDSLL HGQFSQAVTPLAHHHTDYSKPTDISWRDTLSQKFGSSDHLEKLFKMDEASAQLLAYKEKG HSQSSQFSSDQEIAHLLPENVSALPATVAVASPHTTSATPKPATLLPTNASVTPSGTSQP QLATTAPPVTTVTSQPPTTLISTVFTRAAATLQAMATTAVLTTTFQAPTDSKGSLETIPF TEISNLTLNTGNVYNPTALSMSNVESSTMNKTASWEGREASPGSSSQGSVPENQYGLPFE KWLLIGSLLFGVLFLVIGLVLLGRILSESLRRKRYSRLDYLINGIYVDI >gi568815586r:12230030_12443314|GENSCAN_predicted_CDS_4|1770_bp atgcctgctcctccttcaccttcctccatgattgaaagttttctgaggtccttaccagaa gcagatgctggtgccacacttcttatacagcttgcagaactgctcccaagccagaagcct gcttactgtcagcactgcatccaaccaatcgttgtgtcctggagagcctctcctcttaat ttctcctctatctgcatgcccatttcagggccttatcacttccgtttcttggaagagagg cctctggcctcacgtccctcaatccatccctcctgtggcagttattggttaagactaggg tgtgtagctgttggccgtacaggcctagacttaacttgcaattgcatagcgcttcaggtc tgcaagctctttcaaaacagaggtgaagatactgtcaaagcgctggggccgggagcccgg agcctccgcggggagggacgcgctaatgttgccgggaagcagctccgggttgcagggcag gaacgtgcccttcccccgcggcaaacttgccgtcgctgcgacggaagcaggaacttgcta accacaaaacccgccaggccggtgcgggagctgcggagcatccgctgcggtcctcgccga gacccccgcgcggattcgccggtccttcccgcgggcgcgacagagctgtcctcgcacctg gatggcagcaggggcgccggggtcctctcgacgccaggggacaaagcatgtaacttgatg atcttcgacactcgaaaaacagctagacaacccaactgctacctatttttctgtcccaac gaggaagcctgtccattgaaaccagcaaaaggacttatgagttacaggataattacagat tttccatctttgaccagaaatttgccaagccaagagttaccccaggaagattctctctta catggccaattttcacaagcagtcactcccctagcccatcatcacacagattattcaaag cccaccgatatctcatggagagacacactttctcagaagtttggatcctcagatcacttg gagaaactatttaagatggatgaagcaagtgcccagctccttgcttataaggaaaaaggc cattctcagagttcacaattttcctctgatcaagaaatagctcatctgctgcctgaaaat gtgagtgcgctcccagctacggtggcagttgcttctccacataccacctcggctactcca aagcccgccacccttctacccaccaatgcttcagtgacaccttctgggacttcccagcca cagctggccaccacagctccacctgtaaccactgtcacttctcagcctcccacgaccctc atttctacagtttttacacgggctgcggctacactccaagcaatggctacaacagcagtt ctgactaccacctttcaggcacctacggactcgaaaggcagcttagaaaccataccgttt acagaaatctccaacctaactttgaacacagggaatgtgtataaccctactgcactttct atgtcaaatgtggagtcttccactatgaataaaactgcttcctgggaaggtagggaggcc agtccaggcagttcctcccagggcagtgttccagaaaatcagtacggccttccatttgaa aaatggcttcttatcgggtccctgctctttggtgtcctgttcctggtgataggcctcgtc ctcctgggtagaatcctctcggaatcactccgcaggaaacgttactcaagactggattat ttgatcaatgggatctatgtggacatctaa >gi568815586r:12230030_12443314|GENSCAN_predicted_peptide_5|147_aa MVGGSAPRPASRPIREDQDYFKIICKKEYIQSEKALTLTVKINITNIHNSDEKIFQQLSL SEGRKPTLSHLPGRMNAILEEQATWVPPTLQQLFPQKQEGAPHSGLGRYGAITERKSLVC PSHGEEQLGLPQKSQQKQQTSNSETAK >gi568815586r:12230030_12443314|GENSCAN_predicted_CDS_5|444_bp atggtgggggggtcagccccccgcccggccagccgccccatccgggaggatcaggactac ttcaaaattatttgcaaaaaggaatatattcaaagtgagaaggccctcactcttactgtg aaaataaacattacaaacattcacaattcagatgagaaaatctttcaacagctctcactg agtgaaggcagaaagcccactttgtcccacttgcctggaagaatgaatgccatcctggaa gaacaggccacttgggtgccccccacacttcagcagctgtttcctcagaagcaggaaggt gccccccacagcggactagggagatatggggcaatcacagaaaggaaatccttggtgtgc ccaagccacggggaagagcagctgggattgccacagaaaagccagcagaagcaacagacc tctaacagtgaaacagccaaataa >gi568815586r:12230030_12443314|GENSCAN_predicted_peptide_6|96_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYPSDKEL MSKIYKEFKQIYKKKTNNPINKWAKDMNRHSSKEDI >gi568815586r:12230030_12443314|GENSCAN_predicted_CDS_6|291_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctatccatcagacaaagagcta atgtccaaaatctacaaagaattcaaacaaatttacaagaaaaaaacaaacaaccccatc aacaagtgggcaaaagatatgaacagacactcctcaaaagaagacatctaa >gi568815586r:12230030_12443314|GENSCAN_predicted_peptide_7|217_aa MNIDAKILNKILANRIQEHIKKLTYHDQVGFIPGMQGWFNIHKSINVIHHINRTDHKKHM IISIDAEKAFNKIQQRFMLKTLNKLGIDGTYLKIIRAINDKPTASIIPNGQKLEAFPLKI GTRQECSLSPLLFNTVLEVLARAIRQEKEIKDIQLGNEEVKLSLFADDMIVYLENSIISA PNLLKLISNFSKVSGYKISVQKSQSFLYTNNRQTAKL >gi568815586r:12230030_12443314|GENSCAN_predicted_CDS_7|654_bp atgaacatcgatgcgaaaatcctcaataaaatactggcaaaccgaatccaggagcacatc aaaaagcttacctaccacgatcaagtcggcttcatccctgggatgcaaggctggttcaac atacacaaatcaataaacgtaatccatcacataaacagaaccgaccacaaaaaacacatg attatctcaatagatgcagaaaaggccttcaacaaaattcaacagcgcttcatgctaaaa actctcaataaactgggtattgatggaacgtatctcaaaataataagagctattaatgac aaacccacagccagtatcataccgaatgggcaaaaactggaagcattccctttgaaaatc ggcacaagacaagaatgctccctctcaccactcctattcaacacagtgttggaagttctg gccagggcaatcaggcaagaaaaagaaataaaggatattcaattaggaaatgaggaagtc aaattgtccctgtttgcagatgacatgattgtatatttagaaaactccatcatctcagcc ccaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcagtgtg caaaaatcacaatcattcctatacaccaataacagacaaacagccaaattatga >gi568815586r:12230030_12443314|GENSCAN_predicted_peptide_8|55_aa XCMRKLSGASEDYWEPESQNEVEEGYRASPSCQVAEAEGPEPNLSSQNKTLTSTS >gi568815586r:12230030_12443314|GENSCAN_predicted_CDS_8|168_bp nngtgcatgaggaagctctccggtgcttcagaagactattgggagcctgaatcgcagaat gaagtggaggaaggctacagggccagcccaagctgccaggtggctgaagcagagggccca gaacccaaccttagctcacaaaacaaaactctcacttctacctcttga