GENSCAN 1.0 Date run: 12-Oct-118 Time: 17:03:16 Sequence gi568815586r:10930409_11131335 : 200927 bp : 36.13% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 220 654 435 0 0 78 33 483 0.814 37.82 1.02 PlyA + 1892 1897 6 1.05 2.06 PlyA - 3437 3432 6 1.05 2.05 Term - 16015 15891 125 2 2 34 49 96 0.004 -2.13 2.04 Intr - 29749 29643 107 0 2 81 81 88 0.042 6.34 2.03 Intr - 43313 43247 67 0 1 72 93 46 0.005 0.54 2.02 Intr - 44384 44340 45 0 0 91 94 39 0.679 2.26 2.01 Init - 52705 52564 142 1 1 67 64 71 0.567 2.95 2.00 Prom - 81934 81895 40 -2.85 3.00 Prom + 86592 86631 40 -3.35 3.01 Init + 87375 87401 27 2 0 107 94 6 0.665 2.33 3.02 Intr + 96285 96383 99 2 0 53 40 92 0.016 0.29 3.03 Intr + 118489 118701 213 0 0 97 36 127 0.223 6.39 3.04 Intr + 133213 133326 114 0 0 80 76 42 0.057 1.92 3.05 Intr + 141630 141702 73 2 1 51 99 4 0.008 -4.14 3.06 Intr + 141755 142042 288 0 0 96 61 96 0.099 3.99 3.07 Term + 159005 159189 185 0 2 77 36 127 0.545 3.12 3.08 PlyA + 160345 160350 6 1.05 4.00 Prom + 161587 161626 40 -5.35 4.01 Init + 161820 162042 223 2 1 55 96 124 0.239 8.67 4.02 Term + 174953 175155 203 0 2 45 44 128 0.270 0.67 4.03 PlyA + 176969 176974 6 1.05 5.00 Prom + 177069 177108 40 -8.05 5.01 Sngl + 181212 182417 1206 2 0 62 43 428 0.992 31.27 5.02 PlyA + 182654 182659 6 1.05 6.00 Prom + 183772 183811 40 -3.45 6.01 Init + 183899 184102 204 1 0 66 72 99 0.423 5.00 6.02 Term + 199166 199294 129 1 0 57 42 104 0.009 -0.10 6.03 PlyA + 199580 199585 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 22355 22774 420 1 0 65 54 150 0.901 5.38 S.002 Init + 27955 28041 87 0 0 58 78 76 0.877 4.29 S.003 Term + 147475 147573 99 0 0 83 50 98 0.839 2.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:10930409_11131335|GENSCAN_predicted_peptide_1|144_aa MSSALLVSTHTDGGDSEQFIDEERQGPPLGGQQSQPSAGDGNQNDGPQQGPPQQGGQQQQ GPPPPQGKPQGPPQQGGHPPPPQGRPQGPPQQGGHPRPPRGRPQGPPQQGGHQQGPPPPP PGKPQGPPPQGGRPQGPPQGQSPQ >gi568815586r:10930409_11131335|GENSCAN_predicted_CDS_1|435_bp atgagctcagctcttcttgtttcaactcacacagatggaggagactctgagcagttcata gatgaggagcgtcagggaccacctttgggaggacagcaatctcaaccctctgctggtgat gggaaccagaatgatggccctcagcagggaccaccccaacaaggaggccagcagcaacaa ggtccaccacctcctcagggaaagccacaaggaccaccccaacagggaggccatccccct cctcctcaaggaaggccacaaggaccaccccaacagggaggccatccccgtcctcctcga ggaaggccacaaggaccaccccaacagggaggccatcagcaaggtcctcccccacctcct cctggaaagccccagggaccacctccccaagggggccgcccacaaggacctccacagggg cagtctcctcagtaa >gi568815586r:10930409_11131335|GENSCAN_predicted_peptide_2|161_aa MHACSNFEFCYIIWHNGQATWARLYICYRARFCFGFDLRQAALANSFTWSRTIAAELVWL FGAAAEEYFACWNLLYLLKSGSFHSYLAKYEASSTIGLSAQSNEAVAVTTSLNLTFVKIT NKLENLEEIDKFLDTYTLPAAGSQRPQTEGPAEAMAEEHKL >gi568815586r:10930409_11131335|GENSCAN_predicted_CDS_2|486_bp atgcatgcctgtagcaactttgaattctgttacatcatctggcacaatggccaagcaact tgggccagactctatatctgctatagagcccgtttttgtttcgggtttgacttgagacaa gcagcccttgcaaactcctttacctggtcaagaaccattgctgctgaactagtgtggttg tttggagctgctgcagaagaatattttgcctgctggaacctgctgtatctactcaagagt ggaagttttcacagttatttagcaaagtatgaagcctcttcaactattggcctatcagcc caatccaatgaggctgttgctgtcaccacttcattaaatctgacatttgttaagatcacc aacaaactagaaaacctagaagagatagataaattcctggacacatacaccctccctgct gctggaagtcagagaccccaaacggagggaccagctgaagccatggcagaagaacataaa ttgtga >gi568815586r:10930409_11131335|GENSCAN_predicted_peptide_3|332_aa MPSRNILFKTGYRMNADYQKLKSTLTNAFQDVLSSLSRAELLVEKNKVGEIGSLEQMKYA EASNTQMLKCLVIAETLEPKILTSNLCEAKLNTVAYQCNNMTQSKPTLDTTRAKNITDKS TLLNQECGKYVQPCCVRNSGAKSKNRWEIAKQLTRILFSGSREYPASSFCVTGTTVFVET SSHHVAQGGLELLASIDIPNCAYQSSGIASKSHHTKPALGVFEGQMLPRFLTHVHPKVDL ISLLLLPLPEHVIEMDKHGSWQDSHLLPDLQTLLEKVLEAKYTHLMTVMLSLSPHCEVAA SRAIPDITTHCFTFLPVSISTRPCHFCLEFDF >gi568815586r:10930409_11131335|GENSCAN_predicted_CDS_3|999_bp atgccgagcagaaacatcctgtttaagacgggttatagaatgaatgcagactaccagaaa cttaaatcaacactcacaaatgctttccaggatgtgctgtcttcactaagcagagcagag cttctggtggagaaaaataaggttggagaaattggcagtcttgagcaaatgaaatatgct gaggctagtaacacccagatgctgaaatgcttggttattgctgagacattagaaccaaaa attcttacttctaatctatgtgaagccaaattaaacacagttgcataccaatgtaataat atgacccagagtaaaccaactctggacaccaccagagcaaaaaacataactgataaatca actctgctaaaccaagagtgtgggaaatatgtacaaccttgttgtgtcaggaattcagga gccaaaagcaaaaacagatgggaaattgcaaagcaactgactcgcatattgttttcaggt tcaagggagtatcctgcctcatccttctgtgtaactgggactacagtttttgtagagaca agctctcaccatgtagcccagggtggtctggaactcctggcctcaatagatattcccaac tgtgcctaccaaagttccgggattgcaagcaagagccatcataccaagcctgcactggga gtttttgaaggtcagatgctacctagatttttgactcatgtacatcccaaggtggatcta atcagtttgctgttattacctcttcctgaacatgtaattgaaatggacaaacatggcagc tggcaggactctcaccttcttcctgacctgcagacccttctggaaaaagtgctggaagcc aagtacacgcatctgatgaccgtcatgctgtctctctcacctcactgtgaagtggctgcc agcagagccataccagacatcaccacacattgtttcacatttcttcctgtctcaatttcc acacgtccttgccatttttgtcttgaatttgacttctaa >gi568815586r:10930409_11131335|GENSCAN_predicted_peptide_4|141_aa MSEQTKKIFLNAGVVSGIGSCRWVRGLADFKREPLTFTVSVAALKDGVDPKSEQQQGLLR REKGQSFHRVKGNPGLDLNIYVYLIPEFAGMSLFLFKFCDQCQTGKHLNMPVDEINAVFM ENMMISKTAQINSYSNTTSLL >gi568815586r:10930409_11131335|GENSCAN_predicted_CDS_4|426_bp atgtctgaacagacaaaaaaaatttttttaaatgctggtgttgtgtccggaattggttcc tgcaggtgggttcgtggtctcgctgacttcaaaagggagccactgaccttcacggtgagt gttgctgctcttaaagatggtgtggacccaaagagtgagcaacagcaaggtttattgaga agagagaaaggacaaagcttccacagagtgaaaggcaacccaggtcttgaccttaacatc tatgtgtacctgattcctgaatttgcaggaatgtccttgttcctttttaaattctgtgac caatgtcaaacaggaaagcatctcaatatgccagtggatgaaatcaatgctgtctttatg gaaaacatgatgatttccaaaacagctcaaattaactcctattcaaacacaacgtccttg ctgtaa >gi568815586r:10930409_11131335|GENSCAN_predicted_peptide_5|401_aa MYQNLWDTAKAVFRGKFIALNAHRRKWERSKVDTLASQLKELEKQEQTNLKASRRQEITK VRAQLKEIETPKTLQKINESRSWFFEKFNKIERQLARLVKKKRENNPIDTIKNNKGDITT DFREIQTIIREYYKHLYANTLENLEEMDKILDTYNIPSLNQEEVKSLNRPITSSETETVI NSLATKKSPGTDIFTTKFYQRYKEEMVPFFLKLFKTLEKEGLLLNSFYEASIILIPKPDR DTAKRENFRPISLMNIDAKILNKILASQIQQHIKRLIHQDQVVFNPGMQGWFNICKSINI IHHINRTNDKNHMIISTDAEKAFDKIQHAFMLKTLKKLGTDGMYLKILSAIYDKPTANIL NGQKREAFPLKIGTKQGCHLSPLLFNIVLEILARTIRKRKK >gi568815586r:10930409_11131335|GENSCAN_predicted_CDS_5|1206_bp atgtaccagaatctctgggacacagctaaagcagtgtttagagggaaatttatagcacta aatgcccacaggagaaagtgggaaagatctaaagttgacaccctagcatcacaattaaaa gaactggagaagcaagagcaaacaaatttaaaagctagcagaagacaagagataactaag gtcagagcacaactgaaggagatagaaacacctaaaacccttcagaaaatcaatgaatcc aggagctggttttttgaaaagttcaacaaaatagaaagacagctagccagactagtaaag aagaaaagagagaataatccaatagacacaataaaaaataataaaggagatattaccact gacttcagagaaatacaaactatcatcagagaatactataaacacctctatgcaaatacg ctagaaaatttagaagaaatggataaaatactggacacatacaacatcccaagtctaaac caggaagaagtcaaatccctgaatagaccaataacaagttctgaaactgagacagtaatt aatagcctagcaaccaaaaaaagtccaggaacagacatattcacaaccaaattctaccag aggtacaaagaggagatggtaccattctttctgaaactattcaaaacactggaaaaagag ggactactccttaactcattttatgaggccagcatcatcctgattccaaaacctgataga gacacagcaaaaagagaaaattttaggccaatatccttgatgaacatcgatgcgaaaatc ctcaataaaatactggcaagccaaatccagcagcatataaaaaggcttatccaccaagat caagtcgtcttcaaccctgggatgcaaggctggttcaacatatgcaaatcaataaacata atccatcacataaacagaaccaatgacaaaaaccacatgattatctcaacagatgcagaa aaggcctttgataaaattcaacatgccttcatgctaaaaacactcaagaaactaggtact gatggaatgtatctcaaaatattaagtgctatttatgacaaacccacagccaatatactg aatgggcaaaaacgggaagcattccctttgaaaatcggcacaaaacaaggatgccatctc tcaccactcctgttcaacatagtattggaaattctggccaggacaatcaggaagagaaag aaataa >gi568815586r:10930409_11131335|GENSCAN_predicted_peptide_6|110_aa MEYYAAIKKDEFMSFAGTWMKLETIILSKLTQEQKTKHHIITHNWELNNENTWTQGGHRE RHTTGPFRMGYRMNADYHKLKSTLTNAFQDVLSSLSRAELLVLFVGLVIW >gi568815586r:10930409_11131335|GENSCAN_predicted_CDS_6|333_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgcagggacctggatg aagctggaaaccatcattctcagcaaactaacacaggaacagaaaaccaaacaccacatt atcactcataattgggagctaaacaatgagaacacatggacacagggaggacacagggaa cgtcacacaactgggccttttaggatgggttatcgaatgaatgcagactaccataaactt aaatcaacactcacaaatgctttccaggatgtgctatcttcactgagcagagcagagctt ctggtactttttgtggggctagtgatttggtga