GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:45:11 Sequence gi568815595r:165087900_165290830 : 202931 bp : 34.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 18284 18123 162 0 0 31 89 156 0.354 9.03 1.02 Intr - 34368 34227 142 0 1 47 77 34 0.008 -2.79 1.01 Init - 35195 35046 150 2 0 52 60 175 0.019 11.19 1.00 Prom - 35487 35448 40 -5.45 2.02 PlyA - 36427 36422 6 -1.75 2.01 Sngl - 37643 37053 591 2 0 65 33 207 0.866 8.84 2.00 Prom - 46999 46960 40 -3.65 3.02 PlyA - 47740 47735 6 1.05 3.01 Sngl - 48550 47996 555 1 0 68 47 222 0.710 11.97 3.00 Prom - 49856 49817 40 -5.55 4.04 PlyA - 50347 50342 6 1.05 4.03 Term - 51423 50689 735 0 0 -8 36 558 0.589 33.47 4.02 Intr - 60499 60417 83 2 2 92 68 -19 0.007 -5.06 4.01 Init - 70016 69899 118 2 1 81 115 35 0.542 5.91 4.00 Prom - 75006 74967 40 -3.25 5.02 PlyA - 75170 75165 6 1.05 5.01 Sngl - 81027 80689 339 0 0 88 32 270 0.663 17.18 5.00 Prom - 99599 99560 40 -6.15 6.02 PlyA - 99947 99942 6 1.05 6.01 Sngl - 102931 99998 2934 1 0 66 42 1857 0.624 170.99 6.00 Prom - 104135 104096 40 -11.74 7.03 PlyA - 104644 104639 6 1.05 7.02 Term - 105820 105459 362 2 2 23 43 231 0.178 5.81 7.01 Init - 108933 108693 241 0 1 62 111 94 0.427 6.77 7.00 Prom - 113603 113564 40 -4.15 8.00 Prom + 123972 124011 40 -4.35 8.01 Init + 157824 157963 140 2 2 73 72 98 0.523 6.36 8.02 Term + 161867 162158 292 2 1 25 42 172 0.666 0.13 8.03 PlyA + 162948 162953 6 1.05 9.03 PlyA - 164174 164169 6 1.05 9.02 Term - 182424 182000 425 1 2 -64 47 397 0.006 14.87 9.01 Intr - 192117 191919 199 0 1 -7 83 239 0.541 11.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 35195 35007 189 2 0 52 48 253 0.959 12.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:165087900_165290830|GENSCAN_predicted_peptide_1|152_aa MFLKRVKLPQEEPQVGPLISIPEEGTVIIGDDGSMGVIALIDLPVGQDVELQANIQILKM QRTQVRYSTGKSSPTHIIIGFSKIDMKLKILKATREKAVSSSLESGFSAGKFDRQSELGQ SSSKSGLTTEEVFQRADWYLDADLEKRMGPGX >gi568815595r:165087900_165290830|GENSCAN_predicted_CDS_1|456_bp atgtttctgaaaagagtgaaacttcctcaagaagagcctcaggtaggtcctttaataagt attccagaagaaggcactgttatcataggagatgacggctccatgggtgttattgccctt atagaccttccagtgggacaagatgtagagctacaggccaacattcaaattctgaaaatg cagagaacccaagtaagatactccacaggaaaatcatctccaacacatataatcatagga ttctctaagattgacatgaaattaaaaatattaaaggcaactagagagaaagcagtttca agttctctggaatctggatttagcgcaggtaaatttgacagacaaagtgaattgggtcag tcttcgagcaaaagtggtttgacaactgaagaggtgttccaaagagcagattggtatttg gatgcagacctagaaaagagaatgggaccaggtgnn >gi568815595r:165087900_165290830|GENSCAN_predicted_peptide_2|196_aa MSGKSISADVKAAEEFWETVHKLIVEENYLPEQIFNMNETSLFWKQMPERTFIYNEVSSM PGVKAFKDRKTALLGGNIAGYKLKPFVIWHHRDSRAFKHINKCTLIVNYRNSMKLHIIQF PFQDTLWNCYASKMEKCFVKNDIPFKIVFIVHNAPAHPPFIGDFLPNIKEVLFFSKHHLF GPTNGTRSYSSFSSSF >gi568815595r:165087900_165290830|GENSCAN_predicted_CDS_2|591_bp atgagtggtaagtctataagtgctgatgtgaaagcagctgaagaattttgggaaactgta cataagctgattgtggaggaaaattacttgccagagcaaatcttcaatatgaatgaaacc tccctattctggaaacagatgcctgaaagaactttcatctacaatgaagtgtcatcaatg ccaggtgtgaaggcttttaaggacaggaaaacagccttgcttgggggcaatattgcaggc tacaaattaaaaccttttgtgatctggcaccatagggactcccgggccttcaagcatatt aataagtgcacactgatagtgaactacagaaacagtatgaagttacatattatccagttc cccttccaagataccctctggaactgctatgcaagcaaaatggagaagtgctttgtaaaa aatgacatacctttcaaaattgtgtttattgttcataatgctcccgcacatcctcctttt ataggtgattttcttcctaatattaaagaggtgctttttttctccaaacaccatctcttt ggtccaaccaatggaacaaggagttacagcagcttttcttcttccttttag >gi568815595r:165087900_165290830|GENSCAN_predicted_peptide_3|184_aa MPFLTTPTHHSVGSFGPVNQQEKEVKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLK LISNFSKASGYKINVQKSQAFLHTNNRQTESQIMSELPFTIASKRIIQLTRDVKDLFKEN YKSLLHEIKKDTNKWKNIPCSCIGRINTMKMGILPKVIYRFNAIPIKLPMTFFTELEKTT LKFI >gi568815595r:165087900_165290830|GENSCAN_predicted_CDS_3|555_bp atgccctttctcaccactcctactcatcatagtgttggaagttttggcccggtcaaccag caggagaaagaagtaaagggcattcaattaggaaaagaggaagtcaaattgtcccttttt gcagatgacatgattgtatatttagaaaaccccatcgtctcagcccaaaatctccttaag ctgataagcaacttcagcaaagcctcaggatacaaaatcaatgtgcaaaaatcacaagca ttcctacacaccaataatagacaaacagagagccaaatcatgagtgaactcccattcaca attgcttcaaagagaataatccaacttacaagggatgtgaaggacctcttcaaggagaac tacaaatcactgctccatgaaataaaaaaggacactaacaaatggaagaacattccatgc tcatgtataggaagaatcaataccatgaaaatgggcatactgcccaaggtaatttataga ttcaatgccatccccatcaagctaccaatgactttcttcacagaactggaaaaaactact ttaaagttcatatga >gi568815595r:165087900_165290830|GENSCAN_predicted_peptide_4|311_aa MHLLGKCFWEECEWWEKTEKWFWARGQWRFLPELAEKTDGSIVISCFIPDVDNFCVFSRF FLPVLLEKDVRTHRKEAKNLEKRLDEWQTRINSIEKTLNYLKELKTMTRQLRDACTSFSS QFDQVEERVSVIEDQMNEMKQEEKFREKRVKRNEQSLQEIWDYVKRPNLRLIGVPESDGE NGTKLENTLQDIIQENLPNLARQANIQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMKEK MLRAATDKGRVTHKGKPIRLTADLSAETLQARRKWGPIFNKLKEKTFQPRISYPAKLSFI SEGEIKSFTDE >gi568815595r:165087900_165290830|GENSCAN_predicted_CDS_4|936_bp atgcacctgctaggtaagtgcttctgggaagagtgtgaatggtgggagaagactgaaaaa tggttctgggctagaggacagtggcgatttttacctgaactagctgagaaaaccgatggt tctatagtcatatcctgtttcatccctgatgttgataatttttgtgttttctctcgattt tttttgccagttttgctagagaaggatgttcgaacccatcgcaaagaagctaaaaacctt gaaaaaagattagacgaatggcaaactagaataaacagcatcgagaagaccttaaattac ctgaaggagctgaaaaccatgacacgacaactacgagatgcatgcacaagcttcagtagc caattcgatcaagtagaagaaagggtatcagtcattgaagatcaaatgaatgaaatgaag caagaagagaagtttagagaaaaaagagtgaaaagaaatgaacaaagcctccaagaaata tgggactatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgatggtgag aatggaaccaagttggaaaacactcttcaggatattatccaggagaacctccccaaccta gcaaggcaggccaacattcaaattcaggaaatacagagaatgccacaaagatactcctcg agaagagctactccaagacacataattgtcagattcaccaaagttgaaatgaaggaaaaa atgttaagggcagccacagataaaggtcgggttacccacaaagggaaacccatcagacta acagctgatctctcagcagaaactctacaagccagaagaaagtgggggccaatattcaac aagcttaaagaaaagacttttcaacccagaatttcatatccagccaaactaagcttcata agtgaaggagaaataaaatcctttacagatgagtaa >gi568815595r:165087900_165290830|GENSCAN_predicted_peptide_5|112_aa MGRNQSRKAENSKYQSTFPPPEDCSSLTAMEQNWMENDIDKLTEIGFRRTVITNFSELKE QVLTHHKEAKNLEKTLDEWLTRINSTEKTLNDLMELKTMKRKPCDSCTSFSS >gi568815595r:165087900_165290830|GENSCAN_predicted_CDS_5|339_bp atggggagaaaccaaagcagaaaagctgaaaattctaaataccagagcacctttcctccc ccagaggattgcagctccttgacagcaatggaacaaaactggatggagaatgacattgac aagttgacagaaataggcttcagaaggacagtaataacaaacttctctgagctaaaggag caggttctaacccatcacaaggaagctaaaaaccttgaaaaaacattagacgaatggctg actagaataaacagtacagagaagaccttaaatgacctgatggagctgaaaaccatgaaa cgaaaaccttgtgactcatgcacaagcttcagtagctaa >gi568815595r:165087900_165290830|GENSCAN_predicted_peptide_6|977_aa MKPSIAEMLHRGRMLWIILLSTIALGWTTPIPLIEDSEEIDEPCFDPCYCEVKESLFHIH CDSKGFTNISQITEFWSRPFKLYLQRNSMRKLYTNSFLHLNNAVSINLGNNALQDIQTGA FNGLKILKRLYLHENKLDVFRNDTFLGLESLEYLQADYNVIKRIESGAFRNLSKLRVLIL NDNLIPMLPTNLFKAVSLTHLDLRGNRLKVLFYRGMLDHIGRSLMELQLEENPWNCTCEI VQLKSWLERIPYTALVGDITCETPFHFHGKDLREIRKTELCPLLSDSEVEASLGIPHSSS SKENAWPTKPSSMLSSVHFTASSVEYKSSNKQPKPTKQPRTPRPPSTSQALYPGPNQPPI APYQTRPPIPIICPTGCTCNLHINDLGLTVNCKERGFNNISELLPRPLNAKKLYLSSNLI QKIYRSDFWNFSSLDLLHLGNNRISYVQDGAFINLPNLKSLFLNGNDIEKLTPGMFRGLQ SLHYLYFEFNVIREIQPAAFSLMPNLKLLFLNNNLLRTLPTDAFAGTSLARLNLRKNYFL YLPVAGVLEHLNAIVQIDLNENPWDCTCDLVPFKQWIETISSVSVVGDVLCRSPENLTHR DVRTIELEVLCPEMLHVAPAGESPAQPGDSHLIGAPTSASPYEFSPPGGPVPLSVLILSL LVLFFSAVFVAAGLFAYVLRRRRKKLPFRSKRQEGVDLTGIQMQCHRLFEDGGGGGGGSG GGGRPTLSSPEKAPPVGHVYEYIPHPVTQMCNNPIYKPREEEEVAVSSAQEAGSAERGGP GTQPPGMGEALLGSEQFAETPKENHSNYRTLLEKEKEWALAVSSSQLNTIVTVNHHHPHH PAVGGVSGVVGGTGGDLAGFRHHEKNGGVVLFPPGGGCGSGSMLLDRERPQPAPCTVGFV DCLYGTVPKLKELHVHPPGMQYPDLQQDARLKETLLFSAGKGFTDHQTQKSDYLELRAKL QTKPDYLEVLEKTTYRF >gi568815595r:165087900_165290830|GENSCAN_predicted_CDS_6|2934_bp atgaaaccttccatagctgagatgcttcacagaggaaggatgttgtggataattcttcta agcacaattgctctaggatggactaccccgattcccctaatagaggactcagaggaaata gatgagccctgttttgatccatgctactgtgaagttaaagaaagcctctttcatatacat tgtgacagtaaaggatttacaaatattagtcagattaccgagttctggtcaagacctttt aaactgtatctgcagaggaattctatgaggaaattatataccaacagttttcttcatttg aataatgctgtgtctattaatcttgggaacaatgcattgcaggacattcagactggagct ttcaatggtcttaagattttaaagagactatatctacatgaaaacaaactagatgtcttc agaaatgacaccttccttggcttggaaagtctagaatatctgcaggcagattacaatgtc attaaacgtattgagagtggggcatttcggaacctaagtaaattgagggttctgatttta aatgataatctcatccccatgcttccaaccaatttatttaaggctgtctctttaacccat ttggacctacgtggaaataggttaaaggttcttttttaccgaggaatgctagatcacatt ggcagaagcctgatggagctccagctggaagaaaacccttggaactgtacatgtgaaatt gtacaactgaagagttggctggaacgcattccttatactgccctggtgggagacattacc tgtgagacccctttccacttccatggaaaggacctacgagaaatcaggaagacagaactc tgtcccttgttgtctgactctgaggtagaggctagtttgggaattccacattcgtcatca agtaaggagaatgcatggccaactaagccttcctcaatgctatcctctgttcattttact gcttcttctgtcgaatacaagtcctcaaataaacagcctaagcccaccaaacagcctcga acaccaaggccaccctccacctcccaagctttatatcctggtccaaaccagcctcccatt gctccttatcagaccagaccaccaatccccattatatgccccactgggtgtacctgtaat ttgcacatcaatgaccttggcttgactgtcaactgcaaagagcgaggatttaataacatt tctgaacttcttccaaggcccttgaatgccaagaaactgtatctgagtagcaatctgatt cagaaaatataccgttctgatttttggaatttttcttccttggatctcttgcatctgggg aacaatcgtatttcctatgtccaagatggggcctttatcaacttgcccaacttaaagagc ctcttccttaatggcaacgatatagagaagctgacaccaggcatgttccgaggcctacag agtttgcactacttgtactttgagttcaatgtcatccgggaaatccagcctgcagccttc agcctcatgcccaacttgaagctgctattcctcaataataacttactgaggactctgcca acagacgcctttgctggcacatccctggcccggctcaacctgaggaagaactacttcctc tatcttcccgtggctggtgtcctggaacacttgaatgccattgtccagatagacctcaat gagaatccttgggactgcacctgtgacctggtcccctttaaacagtggatcgaaaccatc agctcagtcagtgtggttggtgatgtgctttgcaggagccctgagaacctcacgcaccgt gatgtgcgcactattgagctggaagttctttgcccagagatgctgcacgttgcaccagct ggagaatccccagcccagcctggagattctcaccttattggggcaccaaccagtgcatca ccttatgagttttctcctcctgggggccctgtgccactttctgtgttaattctcagcctg ctggttctgtttttctcagcagtctttgttgctgcaggcctctttgcctacgtgctccga aggcgtcgaaagaagctgcccttcagaagcaagcggcaggaaggtgtggaccttactggc atccaaatgcaatgccacaggctgtttgaggatggtggaggtggtggtggcggaagtggg ggtggtggtcgaccaactctttcctctccagagaaggcccctcccgtgggtcatgtgtat gagtacatcccccacccggttacccaaatgtgcaacaaccccatctacaagcctcgtgag gaggaggaggtggctgtttcatcagcccaagaagcagggagtgcagaacgtgggggtcca gggacacaaccaccgggaatgggtgaggctctcctaggaagtgagcagtttgctgagaca cccaaggagaaccatagtaactaccggaccttgctggaaaaagagaaggagtgggcccta gcagtgtccagctcccagcttaacaccatagtgacggtgaatcaccatcaccctcaccac ccagcagttggtggggtttcaggagtagttgggggaactgggggagacttggcagggttc cgccaccatgagaaaaatggtggggtggtgctgtttcctcctgggggaggctgtggtagt ggcagtatgctactagatcgagagaggccacagcctgccccctgcacagtgggatttgtg gactgtctctatggaacagtgcccaaattaaaggaactgcacgtgcaccctcctggcatg caatacccagacttacagcaggatgccaggctcaaagaaacccttctcttctcggctgga aagggcttcacagaccaccaaacccaaaaaagtgattacctcgagttaagggccaaactt caaaccaagccggattacctcgaagtcctggagaagacaacatacaggttctaa >gi568815595r:165087900_165290830|GENSCAN_predicted_peptide_7|200_aa MRGERTAAADLHSWRGGGAGGAGGTRWRSSGDGLIWGLPSAAPLVRDLGAADGFAFRFQP CVSYIDSLYTTWRSSLGGDVVVCISFQRIGLSLFLKSGRKEMLLNPGAIEGFDLQHNKRI PGATGSKQNAGETRSKSFRLSELLFGTPRQNQTDSAFLLWGPGFPPIPVIAGNGCPVKMI LTKDSLTFQTTCTREMSFGN >gi568815595r:165087900_165290830|GENSCAN_predicted_CDS_7|603_bp atgcgcggcgagaggactgcggcggctgatttacacagttggcgcggaggaggggccggg ggtgcgggggggacgcgctggcggagctccggcgatgggctgatctgggggctcccgagc gccgctcccttggtgcgcgatttgggggcggctgatggatttgcattcaggttccagccc tgcgtttcctatattgactccttatacacgacctggcgctccagtttaggaggagacgtt gtggtctgtatctccttccagagaattggcttgtctctgtttttaaaaagtggaagaaaa gaaatgctgttgaatccaggggctatcgaaggctttgacctgcagcacaacaaaagaatt cctggtgcaacgggatcgaagcaaaatgcaggagaaactcggtccaagtctttcaggctg tctgagctcttattcgggacacccaggcagaatcagacagattcagcttttctgctttgg ggacctggatttccccccatccccgtcattgcagggaatggctgtcctgtgaaaatgata cttacgaaggatagtctcacttttcaaaccacctgcactagagagatgagctttggaaac taa >gi568815595r:165087900_165290830|GENSCAN_predicted_peptide_8|143_aa MGSFLLTEKIQSNFEKGKNRETSNTGGRIPSLELQDKHFSIRPYQQRYPYLNQTPNIECL LPQYPYLNQTPNNIECLLPQRERRHQREDSPGQKRQAMEGENAKNPSDYCTRVPKITSSS QDCSFPVPISDTICVNVNNREAL >gi568815595r:165087900_165290830|GENSCAN_predicted_CDS_8|432_bp atgggaagtttcctgcttacggagaaaattcagtcaaattttgaaaaaggcaagaataga gaaacaagtaatactggaggaagaattccttctttagaacttcaagataaacatttcagc atcaggccatatcagcagaggtatccttatctaaatcagacccctaatatcgaatgtctt ctaccacaatatccttatctaaaccagacccctaacaatatcgaatgtcttctaccacaa cgagagaggcgccatcagagggaagactcaccagggcagaaaaggcaggccatggaaggg gagaacgcaaagaaccccagtgactactgcacacgggttccaaaaatcacaagttcctcc caggattgctcctttccagtccccatttctgacaccatatgtgtcaatgtgaataacaga gaagcactataa >gi568815595r:165087900_165290830|GENSCAN_predicted_peptide_9|207_aa ELRVGGYKVEHTFWAKNKEEGGGGEEEGGGGGRRGGGGRKEKRRKKEKKEKEEEKKREEG GGEGGGDEWITRITNAEKSLKDLMELKTMAGELRDECTSLSSRLDQLEERVSVMEDQMNE MKREEKFREKRIKRNEQSLREIWDYVKRPNLRLIGVPESYRANGTKLENTLQDIIQENFP NLARQANIQIQEIQRTPQRYSWTQDIH >gi568815595r:165087900_165290830|GENSCAN_predicted_CDS_9|624_bp gaattgcgggtggggggatataaagtagaacatacattttgggcaaaaaataaggaagaa ggaggaggaggtgaagaagaaggaggagggggaggaagaagaggaggaggaggaaggaag gagaaaagaaggaagaaggagaagaaggagaaggaggaggagaagaagagagaagaagga ggaggagaaggaggaggagatgaatggataacaagaataaccaacgcagagaagtcctta aaggacctgatggagctgaaaaccatggcaggagaactacgtgacgaatgcacaagcctc agtagccgattggatcaactggaagaaagggtatcagtgatggaagatcaaatgaatgaa atgaagcgagaagagaagtttagagaaaaaagaataaaaagaaacgaacaaagcctccga gaaatatgggattatgtgaaaagaccaaatctacgtctgattggtgtacccgaaagttac agggcgaatggaaccaagttggaaaacactctgcaggatattatccaggagaacttcccc aatctagcaaggcaggccaacattcaaattcaggaaatacagagaacgccacaaagatac tcctggacacaggatatacattga