GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:09:02 Sequence gi568815580r:26070726_26271286 : 200561 bp : 39.67% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 3898 3893 6 1.05 1.03 Term - 19934 19521 414 2 0 22 34 344 0.591 16.68 1.02 Intr - 20710 20550 161 2 2 21 69 145 0.381 4.69 1.01 Init - 28393 28387 7 1 1 62 89 0 0.096 -1.42 1.00 Prom - 40665 40626 40 -6.15 2.00 Prom + 41193 41232 40 -4.35 2.01 Init + 50870 51152 283 1 1 83 74 246 0.370 19.95 2.02 Intr + 52539 52732 194 1 2 17 106 72 0.256 0.19 2.03 Term + 55687 56103 417 0 0 -7 42 219 0.264 2.09 2.04 PlyA + 57819 57824 6 1.05 3.00 Prom + 61265 61304 40 -4.45 3.01 Init + 63241 63342 102 0 0 74 77 96 0.324 7.39 3.02 Intr + 73834 73960 127 0 1 46 80 121 0.578 6.43 3.03 Intr + 81133 81257 125 2 2 57 115 101 0.937 9.08 3.04 Intr + 87397 87519 123 0 0 66 113 93 0.975 9.46 3.05 Term + 90809 90907 99 1 0 50 54 47 0.307 -5.35 3.06 PlyA + 91848 91853 6 1.05 4.02 PlyA - 92343 92338 6 1.05 4.01 Sngl - 100561 99998 564 1 0 73 34 422 0.907 31.19 4.00 Prom - 103151 103112 40 -2.35 5.00 Prom + 103827 103866 40 -9.85 5.01 Init + 103958 103960 3 1 0 56 95 0 0.451 -2.45 5.02 Intr + 108105 108224 120 2 0 70 100 147 0.978 13.87 5.03 Intr + 108343 108405 63 0 0 71 115 30 0.773 2.00 5.04 Term + 112756 112923 168 0 0 34 44 140 0.607 1.10 5.05 PlyA + 112992 112997 6 1.05 6.00 Prom + 115980 116019 40 -5.15 6.01 Init + 134519 134665 147 1 0 92 29 141 0.953 8.74 6.02 Intr + 134714 134872 159 1 0 -58 -16 462 0.114 20.46 6.03 Intr + 139807 139937 131 0 2 63 30 81 0.016 -1.63 6.04 Term + 156100 156709 610 1 1 75 29 584 0.389 44.18 6.05 PlyA + 157592 157597 6 1.05 7.04 PlyA - 158178 158173 6 1.05 7.03 Term - 161739 161560 180 0 0 103 36 122 0.922 5.13 7.02 Intr - 164306 163339 968 0 2 26 73 575 0.932 39.60 7.01 Init - 167089 166186 904 1 1 77 0 270 0.692 11.52 7.00 Prom - 168349 168310 40 -3.25 8.02 PlyA - 169110 169105 6 1.05 8.01 Sngl - 170120 169485 636 2 0 42 49 276 0.968 15.03 8.00 Prom - 175270 175231 40 -3.35 9.02 PlyA - 175984 175979 6 1.05 9.01 Sngl - 185632 185306 327 1 0 50 37 258 0.769 12.66 9.00 Prom - 189043 189004 40 -3.25 10.02 PlyA - 190132 190127 6 1.05 10.01 Term - 198578 198411 168 2 0 65 48 146 0.861 5.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 134714 134896 183 1 0 -58 53 444 0.814 22.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:26070726_26271286|GENSCAN_predicted_peptide_1|193_aa MSWFVPRGELFSVRSSQAGRLPLSTGHNLGVPVSGQGYFPIVRHDFVRACETVDSEVVQP ERPASLPQFAVHPERSGLADSGDGGNMSVAFAAPRQRGKGEITPAAIQKVKPRGVADARP LPPLGETDAGGRAGPGVDRAEAEAGLLGISSVWEHGLLAVPEWVAEEGEPVFLGVFHAFT RFSTDTKAQTAAA >gi568815580r:26070726_26271286|GENSCAN_predicted_CDS_1|582_bp atgtcttggtttgtcccaagaggcgagctcttctctgtccgctccagccaagctggccga ttgcccctgagcactggacacaaccttggtgtcccggtttcgggtcagggttactttcca attgttaggcacgactttgtgcgtgcgtgcgagactgtggactcggaggtggttcagccc gagaggccggcgtctctcccccagtttgccgttcacccggagcgctcgggacttgccgat agtggtgacggcggcaacatgtctgtggctttcgcggccccgaggcagcgaggcaagggg gagatcactcccgctgcgattcagaaggtgaaaccgcgcggggttgcggatgccaggccc ttaccgcctctgggagagacagacgcggggggaagggccgggcccggagtcgaccgggcc gaggcggaggcgggcctgctgggaatcagcagtgtttgggaacacggactgctggctgtg cctgagtgggtggcggaagagggtgaaccggtttttctcggagtctttcacgcatttacg cgtttttctacagacacaaaagcccaaacagccgccgcctga >gi568815580r:26070726_26271286|GENSCAN_predicted_peptide_2|297_aa MAEMTETGFRMWIKMNFSELKEHVATEGKEAKNHDKTMQELTAKIARIERNITDLIELKN TLQELHNAITSINSRIDQVEKRISELEDCLSEMTAYGTYSKIDHIIGNKTLLSKCKRTEI ITNSLLDHTRIKLEPKIKKFTQNLTTTWKLNKLLLNDFWKLNKNRKTLHKAKNEDHNYVL KEWIHWCGSKHMPLNGILIRKQAKIYHDELKTEEDCEYSTSWLQKFKKQHGIKILKLCGD KVSDDHKAVEKIIHNFAKVIADKNLIPELLLMKILSQNKSIMLIKHHCFGITAPERT >gi568815580r:26070726_26271286|GENSCAN_predicted_CDS_2|894_bp atggctgaaatgacagaaacaggcttcagaatgtggataaaaatgaacttcagtgagcta aaggagcatgttgcaaccgaaggcaaggaagccaagaatcatgataaaacaatgcaagag ctgacagccaaaatagccagaatagagaggaacataactgacctgatagagctgaaaaac acactacaagaacttcacaatgcaattacaagcattaacagcagaatagaccaagtggag aaaagaatctcagagcttgaagactgtctttctgaaatgacagcatatggcacttactct aaaattgatcacataatcggaaataaaacactcctcagcaaatgtaaaagaaccgaaatc ataacaaacagtctcttggaccacaccagaatcaaattagaacccaagattaagaaattc actcaaaaccttacaactacatggaaattaaacaagctgctcctgaatgacttttggaag ttaaacaaaaatagaaaaacgctacataaagctaaaaatgaggatcacaattatgtattg aaagagtggatccactggtgtggcagtaaacacatgcctcttaatggtatactgatcagg aaacaagcaaagatctatcatgatgaactgaaaactgaggaggactgtgaatactcaaca agctggttgcagaaatttaagaaacaacatggcattaaaattttaaagctctgtggtgat aaagtatctgatgaccacaaagcagtggaaaaaatcattcacaactttgccaaagtcatt gctgataaaaatcttatcccagagttactgctgatgaaaatcttatcccagaacaagtct ataatgctgataaaacatcattgttttggcattactgcaccagaaagaacctga >gi568815580r:26070726_26271286|GENSCAN_predicted_peptide_3|191_aa MASRYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGIRGTNIVVLGVEKKSVAKLQDERT VRKICALDDHVCMAFAGLTADARVVINRARVECQSHKLTVEDPVTVEYITRFIATLKQKY TQSNGRRPFGISALIVGFDDDGISRLYQTDPSGTYHAWKFLAQRLCFLEYYDGEESYSGQ LKWCMAVVLAT >gi568815580r:26070726_26271286|GENSCAN_predicted_CDS_3|576_bp atggcgtctcgatatgacagggcgatcactgtcttctccccagacggacacctttttcaa gttgaatatgcccaggaagcggtgaagaaaggatccaccgcggtcggaattcgaggtacc aatatagttgttcttggggtagaaaaaaaatctgttgccaagcttcaagatgaaagaact gtgaggaaaatttgtgcccttgatgaccatgtctgcatggcttttgcaggacttactgct gatgctagagtagtaataaacagagcccgtgtggagtgccagagccataagcttacggtt gaggacccagtcactgtagaatacataactcgcttcatagcaactttaaagcagaaatat acccaaagcaatggacgaagaccttttggtatttctgccttaattgtaggttttgatgat gatggtatctcaagattgtatcagacagatccttctggtacttatcatgcttggaagttc cttgctcagaggctatgttttctagaatactatgatggtgaagaaagctattcaggtcag ctaaagtggtgcatggcagtagtcctagctacttga >gi568815580r:26070726_26271286|GENSCAN_predicted_peptide_4|187_aa MVGSLNCIVAVSQNMGIGKNGDLPWPPLRNEFRYFQRMTTTSSVEGKQNLVIMGKKTWFS IPEKNRPLKGRINLVLSRELKEPPQGAHFLSRSLDDALKLTEQPELANKVDMVWIVGGSS VYKEAMNHPGHLKLFVTRIMQDFESDTFFPEIDLEKYKLLPEYPGVLSDVQEEKGIKYKF EVYEKND >gi568815580r:26070726_26271286|GENSCAN_predicted_CDS_4|564_bp atggttggttcgctaaactgcatcgtcgctgtgtcccagaacatgggcatcggcaagaac ggggacctgccctggccaccgctcaggaatgaattcagatatttccagagaatgaccaca acctcttcagtagaaggtaaacagaatctggtgattatgggtaagaagacctggttctcc attcctgagaagaatcgacctttaaagggtagaattaatttagttctcagcagagaactc aaggaacctccacaaggagctcattttctttccagaagtctagatgatgccttaaaactt actgaacaaccagaattagcaaataaagtagacatggtctggatagttggtggcagttct gtttataaggaagccatgaatcacccaggccatcttaaactatttgtgacaaggatcatg caagactttgaaagtgacacgttttttccagaaattgatttggagaaatataaacttctg ccagaatacccaggtgttctctctgatgtccaggaggagaaaggcattaagtacaaattt gaagtatatgagaagaatgattaa >gi568815580r:26070726_26271286|GENSCAN_predicted_peptide_5|117_aa MANAIGRSAKTVREFLEKNYTEDAIASDSEAIKLAIKALLEVVQSGGKNIELAIIRRNQP LKEFKTLVEEVTADVVQIARELKLEVEPEYATELLQSCDKTDDELLLKDEKRMWFLG >gi568815580r:26070726_26271286|GENSCAN_predicted_CDS_5|354_bp atggcaaatgcaataggccgaagtgctaaaactgttcgagaatttctagaaaagaattac acagaagatgccatagcaagtgacagtgaagctatcaagttagcaataaaagctttgcta gaagttgtccagtctggtggaaaaaacattgaacttgctataataagaagaaatcaacct ttgaaggagttcaagactttagtggaggaagtcactgcagatgtggtacaaatagcaaga gaactaaaattagaagtggagcctgaatatgcaactgaattgctgcaatcttgtgataaa acagatgatgagttgcttcttaaagatgagaaaagaatgtggtttcttggatga >gi568815580r:26070726_26271286|GENSCAN_predicted_peptide_6|348_aa MVFQLKCGSGPVYISGQHLVSVEEDAESEDEEEEDVKLLSISGKQSALERKKKEEEEERR KKKEEEGGEEEEEEEEEDEEEEEEVEEEEEEEDFDDEETEEKLSGLWTWDLHQQPPQVLR PSALDGELTHSEASLQVAIMGLLIVVDAVEPEPHRSRLPRAKPPLTSAPGTAAPKLPLSP WGMPAGLTEPAGAAPPAAVSASGTVTMAPAGALPVRVESTPVALGAVTKAPVSVCVEPTA SQPLRSPVGTLVTKVAPVSAPPKVSSGPRLPAPQIVAVKAPNTTTIQFPANLQLPPGMLI TLSSLVCRSSWRRRENSQLQIAPKKTEPREHFIILTLQLTYSFCVITT >gi568815580r:26070726_26271286|GENSCAN_predicted_CDS_6|1047_bp atggtctttcagttgaagtgtggttcagggccagtgtatattagtggacagcacttagta tctgtggaggaagatgcagagtcagaagatgaagaagaggaggatgtgaaactcttaagt atatctggaaagcaatctgccctggagaggaagaagaaagaggaggaggaggaaagaagg aagaagaaagaagaagaaggaggagaagaagaagaggaggaggaggaagaggatgaggag gaagaggaggaggtggaagaggaagaggaagaagaggattttgatgatgaggaaactgaa gaaaaattatcgggtctttggacctgggacttgcaccagcagcctccccaggttctcagg ccttcagccttagatggagagttaacccattctgaggccagcttgcaggtggctatcatg ggacttctcatagttgtcgatgctgtggaacccgaaccgcaccggagtcggctgccgcgc gccaagcctcccctcacctctgctcccggaaccgcagcgccaaagctgccgctgagcccc tgggggatgcccgccggcctcaccgaacccgccggcgccgctcccccggctgctgtgagc gcctcggggaccgtgaccatggccccggccggggcgctgccggtgcgggtggagagcact ccggtggccctgggcgccgtgactaaggctcctgtcagcgtctgcgtggagcccacggcg tcccagcccctgcggtcccccgtggggaccctggtgaccaaagtggctccggtcagcgcc cctcctaaagtcagcagcggccctaggctgcctgctcctcagatagtcgccgtgaaagcc cccaacaccacgacaatccagtttcctgctaatttgcagcttcctccaggtatgttgata accctttccagccttgtctgtcggtccagctggcggcgacgggaaaattcgcagctccag attgctcctaaaaaaactgagccacgggaacatttcataatcttgactttacagttaaca tattctttttgtgtcattactacatag >gi568815580r:26070726_26271286|GENSCAN_predicted_peptide_7|683_aa MAWPQYSLSDGEKWPTEGSINYNTILQFDLFCKREGKWSEVSYVQAFFSLKDNSQLCKAC NLHPTGGPLSLPPYPSFPTAPLPINDKPPLISPTQKETSKKSPRDHKKNPRLSVTSPSSC RGREIWPNLDTCPFYLSDLKQIKVDLGKFSDNPDRYIDVLQGLGQTFNLTWRDVLLLLDQ TLAFNEKNVALAAAQEFEDTWSLSQVNDRITAKERDKFTTSQQAIPSTDPHWDPDSDHGD WSHKHLLTCVLEGLRRIRKKPMNYSMMSTITQGKEENLTALLQRLREALRKYTPLSPDTL EVAYLSKEIDVVVKDWPHCLRVVAAVAILVSEGIKITQGKDLTVWTTHDVNGTLRAKGSL WLSDNCLLRYQALLLKGPVLQICTCAALNPATFLLEDGEPIEHDCQQIVTQTYATQEDLL EVPLANPDLNLYIDESPFVENGIQRAGYAIVSDVTVLESKPLPPGTSTQLAELVALTRAL ELGKEKRINVYTDSKYVYLILHAHAAIRKEREFPTSGRTPIKYLKEIMKLLHTVQKPKEV AVLHCQSHQKGEGEKAEGNRQADAEAKIAARQNLPLGIPMEEPLVWNNPLQEINPQYSLT ETEWGLSRGHIFLPSGWLTTEEGKDRGHQATDGLTNGTPNELNSQLLLRTPGPTHWPFDW PREFTSRGHYNCRAPSSPLSSKK >gi568815580r:26070726_26271286|GENSCAN_predicted_CDS_7|2052_bp atggcctggccccagtattctctctctgatggggaaaaatggccaactgagggaagtata aattacaatactatcctgcagtttgaccttttctgtaagagggaaggcaaatggagtgag gtatcttacgtccaagctttcttttcattgaaggataattcacaactatgcaaagcttgc aatctacatcccacaggaggacctctcagcttacctccatatcctagcttccctacagct ccccttcctattaatgataagcctcctctaatctcccccacccagaaggaaacaagcaag aaatctccaagggaccacaaaaaaaaccccaggctatcggttacatccccttcaagctgt aggggaagggaaatttggcccaacctggatacatgtcccttctacctctctgatttaaag cagatcaaggtagacctggggaagttttcagataatcctgataggtatatagatgtccta cagggtctagggcaaaccttcaacctcacttggagagatgtcttgctattgttagatcaa accctggcctttaatgagaagaatgtggctttagctgcagcccaagagttcgaagatacc tggtctcttagtcaagtaaatgacagaataacagccaaagaaagggataaattcactacc agtcagcaagccatccctagtacggatccccactgggacccagactcagatcatggggac tggagtcacaaacatctgttgacctgtgttctagaaggactaaggagaattaggaaaaag cccatgaattattcaatgatgtccaccataactcagggaaaggaagaaaatcttactgcc ttgctccagcggctacgggaggccttaagaaaatatactcccctgtcacccgacaccctt gaggtggcatacctaagtaaggaaattgatgtagtagtaaaagactggcctcactgttta cgggtagttgcagcggtggccatcttagtatcagagggtatcaaaataacacaaggaaag gatctcactgtctggactactcatgatgtaaatggcacactacgtgccaaaggaagttta tggctatcagacaactgcctgcttagataccaggcgctactccttaagggaccggtgctt caaatatgtacttgtgcagccctcaaccctgctacttttctcctagaggatggagaacca atcgagcatgactgccaacaaattgtgacccagacttacgccacccaagaggatctctta gaagtccccttagctaatcctgaccttaacctatatatcgatgaaagtccatttgtggag aatgggatacaaagggcaggttatgccatagttagtgatgtaacagtacttgaaagtaag cctcttcccccagggaccagcacccagttagcagaactagtggcacttacccgagcctta gaactgggaaaggaaaaaagaataaatgtgtatacagatagcaagtatgtttatctaatc ctacatgcccatgctgcaatacggaaagaaagggagttcccaacctctgggagaaccccc attaaatacctcaaggaaatcatgaagttattgcacacagtgcaaaaacccaaggaggtg gcagtcttacactgccaaagccatcaaaaaggtgaaggagaaaaggcagaaggaaaccgt caggcagacgctgaggccaaaattgctgccaggcagaatctcccattaggaatacctatg gaggaacccttggtatggaacaaccctctccaagagattaacccccagtattccctgact gaaacagaatggggactttcacgggggcatatttttctcccctcggggtggttaacaaca gaagagggaaaggatcgaggccatcaagctacagatggtcttacaaatggcaccccaaat gagctcaactcacaacttctactgaggacccctggaccaacccactggccctttgactgg cctagagaattcacctccagaggacactacaactgcagggccccttcttcgcccctatcc agcaagaagtaa >gi568815580r:26070726_26271286|GENSCAN_predicted_peptide_8|211_aa MIISIDVEKAFDKLQQPFMLKTLNKLGIDVTYLKIIRAIYDKLTANIILNVQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKERKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLTSNFSKVSGYKTNVQKSQAFLYTNNRQTESQILRELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLNEIKEDTNGRTFHAHG >gi568815580r:26070726_26271286|GENSCAN_predicted_CDS_8|636_bp atgattatctcaatagatgtggaaaaggcctttgacaaacttcaacagcccttcatgcta aaaactctcaataagttaggtattgatgtgacgtatctcaaaataataagagctatttat gacaaactcacagccaatatcatactgaatgtacaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaatcaggcaggagaaagaaagaaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccattgtctca gcccaaaatctccttaagctgacaagcaacttcagcaaagtctcaggatacaaaaccaat gtgcaaaaatcacaagcattcctatacaccaataacagacaaacagagagccaaatcctg agggaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagag gacacaaatggaagaacattccatgctcatggatag >gi568815580r:26070726_26271286|GENSCAN_predicted_peptide_9|108_aa MFGFHTPKMYRSIEGCCICRAKSSSSRFTDSKRYEKDFHSCSGLHETPSGDICNACVLLV KRWKKLPAGSKKKLESCGRCKGWTQSKDYIETKESDSIWEQDKEQPYQ >gi568815580r:26070726_26271286|GENSCAN_predicted_CDS_9|327_bp atgtttggttttcacacgccaaagatgtaccgaagtatagagggctgctgtatttgcaga gctaagtcctctagttctcgattcactgacagtaaacgctatgaaaaggacttccacagc tgttctggattgcatgagactccttcaggagacatctgcaatgcctgtgtcctgcttgtg aaaagatggaagaagttgccagcaggatcaaaaaaaaaactggaatcatgtggtagatgc aagggctggacccagtctaaagactacattgaaaccaaagaaagtgactctatctgggaa caagataaagagcaaccatatcagtaa >gi568815580r:26070726_26271286|GENSCAN_predicted_peptide_10|55_aa DKPIDQGIKEKKHEWPPLTITHNKHSLEDFVFPISAHRKQTPLPEDKARVPVNRH >gi568815580r:26070726_26271286|GENSCAN_predicted_CDS_10|168_bp gacaaacccatagatcaaggaatcaaggagaagaagcatgaatggcccccacttaccatc actcacaacaaacactcactggaggactttgtgtttcccatttctgcccacaggaagcaa acacctctaccagaggacaaagcaagagttcctgtaaatagacactaa