GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:41:15 Sequence gi568815586r:122629302_122830339 : 201038 bp : 45.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 6296 6198 99 0 0 118 98 55 0.424 9.58 1.01 Init - 27151 27016 136 1 1 54 46 56 0.060 -1.70 1.00 Prom - 29492 29453 40 -1.56 2.02 PlyA - 29552 29547 6 1.05 2.01 Sngl - 46682 46344 339 2 0 83 43 259 0.992 16.94 2.00 Prom - 46782 46743 40 -3.56 3.03 PlyA - 46818 46813 6 1.05 3.02 Term - 47882 47773 110 0 2 5 55 98 0.156 -3.13 3.01 Init - 73982 72935 1048 2 1 57 105 1343 0.927 127.19 3.00 Prom - 74088 74049 40 -6.36 4.03 PlyA - 74815 74810 6 1.05 4.02 Term - 76263 76184 80 1 2 26 34 121 0.119 -1.57 4.01 Init - 87436 86389 1048 1 1 57 105 1279 0.835 120.79 4.00 Prom - 89145 89106 40 -6.86 5.02 PlyA - 89534 89529 6 1.05 5.01 Sngl - 101038 99998 1041 1 0 83 47 809 0.994 73.44 5.00 Prom - 118577 118538 40 -6.16 6.00 Prom + 118754 118793 40 -0.16 6.01 Init + 123250 123291 42 0 0 68 85 7 0.477 -1.17 6.02 Intr + 124392 124516 125 2 2 92 -4 129 0.014 3.48 6.03 Intr + 132886 132905 20 1 2 77 107 19 0.024 -1.05 6.04 Intr + 136003 136086 84 2 0 58 94 73 0.919 4.69 6.05 Intr + 139541 139620 80 0 2 30 105 52 0.382 0.37 6.06 Term + 139733 139777 45 1 0 73 38 109 0.935 1.61 6.07 PlyA + 140116 140121 6 1.05 7.00 Prom + 142514 142553 40 -1.96 7.01 Init + 145370 145405 36 1 0 35 115 50 0.443 0.47 7.02 Intr + 148190 148382 193 1 1 60 77 213 0.717 16.47 7.03 Intr + 151863 151909 47 1 2 47 93 -7 0.215 -6.17 7.04 Intr + 156418 156519 102 0 0 75 84 43 0.719 2.97 7.05 Intr + 159457 159628 172 0 1 75 116 147 0.951 15.72 7.06 Intr + 162719 162820 102 0 0 17 58 118 0.036 1.85 7.07 Intr + 168006 168094 89 1 2 39 91 18 0.008 -3.11 7.08 Intr + 168784 168899 116 0 2 38 89 76 0.013 1.95 7.09 Intr + 171823 172551 729 1 0 47 113 282 0.019 17.15 7.10 Intr + 176850 176994 145 0 1 50 83 34 0.250 -0.62 7.11 Intr + 183993 184118 126 2 0 70 22 168 0.546 9.28 7.12 Term + 191834 191878 45 1 0 114 36 44 0.443 -0.99 7.13 PlyA + 198060 198065 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 124392 124520 129 2 0 92 42 118 0.949 5.78 S.002 Term + 162719 162849 131 0 2 17 37 159 0.895 2.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:122629302_122830339|GENSCAN_predicted_peptide_1|79_aa MVSVYDRMYYYLVMKWNKALTQAVPWRNFENMMSSEKSQVQKAPHECSNHILQGKSLDQE SNLILHFPDFLLSRLHQEX >gi568815586r:122629302_122830339|GENSCAN_predicted_CDS_1|237_bp atggtgtctgtgtacgacagaatgtattattatttggtcatgaaatggaataaagcactg acacaagctgtcccatggaggaactttgaaaacatgatgtcaagtgaaaaaagccaggtg caaaaggccccgcatgagtgttccaaccacatcctccagggaaaatccctcgaccaagag tccaatctgattttgcacttccctgacttcttgctaagccgtctacaccaggaggnn >gi568815586r:122629302_122830339|GENSCAN_predicted_peptide_2|112_aa MHLKSCNTTKVCCFTPEPARPRTHQKEETPKTSEHQKEQTPHTLPLRTVTLTARVRGFIL EVSKTKNPPIPDTLCPKPMAAEAVPPTSTWGFTHVGKHQQPDLTDRVTDQED >gi568815586r:122629302_122830339|GENSCAN_predicted_CDS_2|339_bp atgcaccttaagagctgtaacaccacgaaggtctgctgcttcactcctgagccagcgaga ccacgaacccaccagaaggaagaaactccaaagacatccgaacatcagaaggaacaaact ccgcacacgctgcctttaagaactgtaacactcaccgcgagggtccgcggcttcattctt gaagtcagtaagaccaagaacccaccaattccggacacactatgtcccaagcccatggca gcagaggctgttccacctacttctacctggggatttacccatgtaggaaagcatcaacaa ccggatttgactgaccgagtgactgaccaagaggattaa >gi568815586r:122629302_122830339|GENSCAN_predicted_peptide_3|385_aa MNRHHLQDHFLEIDKKNCCVFRDDFIVKVLPPVLGLEFIFGLLGNGLALWIFCFHLKSWK SSRIFLFNLAVADFLLIICLPFLMDNYVRRWDWKFGDIPCRLMLFMLAMNRQGSIIFLTV VAVDRYFRVVHPHHALNKISNRTAAIISCLLWGITIGLTVHLLKKKMPIQNGGANLCSSF SICHTFQWHEAMFLLEFFLPLGIILFCSARIIWSLRQRQMDRHAKIKRAITFIMVVAIVF VICFLPSVVVRIRIFWLLHTSGTQNCEVYRSVDLAFFITLSFTYMNSMLDPVVYYFSSPS FPNFFSTLINRCLQRKMTGEPDNNRSTSVELTGDPNKTRGAPEALMANSGEFYQLLLRGD SVLAVLTALARSRRLLCLGSLFGRT >gi568815586r:122629302_122830339|GENSCAN_predicted_CDS_3|1158_bp atgaatcggcaccatctgcaggatcactttctggaaatagacaagaagaactgctgtgtg ttccgagatgacttcattgtcaaggtgttgccgccggtgttggggctggagtttatcttc gggcttctgggcaatggccttgccctgtggattttctgtttccacctcaagtcctggaaa tccagccggattttcctgttcaacctggcagtggctgactttctactgatcatctgcctg cccttcctgatggacaactatgtgaggcgttgggactggaagtttggggacatcccttgc cggctgatgctcttcatgttggctatgaaccgccagggcagcatcatcttcctcacggtg gtggcggtagacaggtatttccgggtggtccatccccaccacgccctgaacaagatctcc aatcggacagcagccatcatctcttgccttctgtggggcatcactattggcctgacagtc cacctcctgaagaagaagatgccgatccagaatggcggtgcaaatttgtgcagcagcttc agcatctgccataccttccagtggcacgaagccatgttcctcctggagttcttcctgccc ctgggcatcatcctgttctgctcagccagaattatctggagcctgcggcagagacaaatg gaccggcatgccaagatcaagagagccatcaccttcatcatggtggtggccatcgtcttt gtcatctgcttccttcccagcgtggttgtgcggatccgcatcttctggctcctgcacact tcgggcacgcagaattgtgaagtgtaccgctcggtggacctggcgttctttatcactctc agcttcacctacatgaacagcatgctggaccccgtggtgtactacttctccagcccatcc tttcccaacttcttctccactttgatcaaccgctgcctccagaggaagatgacaggtgag ccagataataaccgcagcacgagcgtcgagctcacaggggaccccaacaaaaccagaggc gctccagaggcgttaatggccaactccggggaattctaccagcttctactgagaggtgac agcgtgctggcagtcctcacagcccttgctcgctctcggcgcctcctctgcctgggctcc ctctttggccgcacttga >gi568815586r:122629302_122830339|GENSCAN_predicted_peptide_4|375_aa MNRHHLQDHFLEIDKKNCCVFRDDFIAKVLPPVLGLEFIFGLLGNGLALWIFCFHLKSWK SSRIFLFNLAVADFLLIICLPFVMDYYVRRSDWKFGDIPCRLVLFMFAMNRQGSIIFLTV VAVDRYFRVVHPHHALNKISNWTAAIISCLLWGITVGLTVHLLKKKLLIQNGTANVCISF SICHTFRWHEAMFLLEFFLPLGIILFCSARIIWSLRQRQMDRHAKIKRAITFIMVVAIVF VICFLPSVVVRIHIFWLLHTSGTQNCEVYRSVDLAFFITLSFTYMNSMLDPVVYYFSSPS FPNFFSTLINRCLQRKITGEPDNNRSTSVELTGDPNKTRGAPEALIANSGATIIFFIIVF ISPGSSSSTCSSMGI >gi568815586r:122629302_122830339|GENSCAN_predicted_CDS_4|1128_bp atgaatcggcaccatctgcaggatcactttctggaaatagacaagaagaactgctgtgtg ttccgagatgacttcattgccaaggtgttgccgccggtgttggggctggagtttatcttt gggcttctgggcaatggccttgccctgtggattttctgtttccacctcaagtcctggaaa tccagccggattttcctgttcaacctggcagtagctgactttctactgatcatctgcctg ccgttcgtgatggactactatgtgcggcgttcagactggaagtttggggacatcccttgc cggctggtgctcttcatgtttgccatgaaccgccagggcagcatcatattcctcacggtg gtggcggtagacaggtatttccgggtggtccatccccaccacgccctgaacaagatctcc aattggacagcagccatcatctcttgccttctgtggggcatcactgttggcctaacagtc cacctcctgaagaagaagttgctgatccagaatggcactgcaaatgtgtgcatcagcttc agcatctgccataccttccggtggcacgaagctatgttcctcctggagttcttcctgccc ctgggcatcatcctgttctgctcagccagaattatctggagcctgcggcagagacaaatg gaccggcatgccaagatcaagagagccatcaccttcatcatggtggtggccatcgtcttt gtcatctgcttccttcccagcgtggttgtgcggatccacatcttctggctcctgcacact tcgggcacgcagaattgtgaagtgtaccgctcggtggacctggcgttctttatcactctc agcttcacctacatgaacagcatgctggaccccgtggtgtactacttctccagcccatcc tttcccaacttcttctccactttgatcaaccgctgcctccagaggaagataacaggtgag ccagataataaccgcagcacgagcgtcgagctcacaggggaccccaacaaaaccagaggc gctccagaggcgttaatcgccaactccggtgccaccatcatcttcttcatcatcgtcttc atctcacctggatcatcatcaagtacttgtagttctatgggaatctaa >gi568815586r:122629302_122830339|GENSCAN_predicted_peptide_5|346_aa MYNGSCCRIEGDTISQVMPPLLIVAFVLGALGNGVALCGFCFHMKTWKPSTVYLFNLAVA DFLLMICLPFRTDYYLRRRHWAFGDIPCRVGLFTLAMNRAGSIVFLTVVAADRYFKVVHP HHAVNTISTRVAAGIVCTLWALVILGTVYLLLENHLCVQETAVSCESFIMESANGWHDIM FQLEFFMPLGIILFCSFKIVWSLRRRQQLARQARMKKATRFIMVVAIVFITCYLPSVSAR LYFLWTVPSSACDPSVHGALHITLSFTYMNSMLDPLVYYFSSPSFPKFYNKLKICSLKPK QPGHSKTQRPEEMPISNLGRRSCISVANSFQSQSDGQWDPHIVEWH >gi568815586r:122629302_122830339|GENSCAN_predicted_CDS_5|1041_bp atgtacaacgggtcgtgctgccgcatcgagggggacaccatctcccaggtgatgccgccg ctgctcattgtggcctttgtgctgggcgcactaggcaatggggtcgccctgtgtggtttc tgcttccacatgaagacctggaagcccagcactgtttaccttttcaatttggccgtggct gatttcctccttatgatctgcctgccttttcggacagactattacctcagacgtagacac tgggcttttggggacattccctgccgagtggggctcttcacgttggccatgaacagggcc gggagcatcgtgttccttacggtggtggctgcggacaggtatttcaaagtggtccacccc caccacgcggtgaacactatctccacccgggtggcggctggcatcgtctgcaccctgtgg gccctggtcatcctgggaacagtgtatcttttgctggagaaccatctctgcgtgcaagag acggccgtctcctgtgagagcttcatcatggagtcggccaatggctggcatgacatcatg ttccagctggagttctttatgcccctcggcatcatcttattttgctccttcaagattgtt tggagcctgaggcggaggcagcagctggccagacaggctcggatgaagaaggcgacccgg ttcatcatggtggtggcaattgtgttcatcacatgctacctgcccagcgtgtctgctaga ctctatttcctctggacggtgccctcgagtgcctgcgatccctctgtccatggggccctg cacataaccctcagcttcacctacatgaacagcatgctggatcccctggtgtattatttt tcaagcccctcctttcccaaattctacaacaagctcaaaatctgcagtctgaaacccaag cagccaggacactcaaaaacacaaaggccggaagagatgccaatttcgaacctcggtcgc aggagttgcatcagtgtggcaaatagtttccaaagccagtctgatgggcaatgggatccc cacattgttgagtggcactga >gi568815586r:122629302_122830339|GENSCAN_predicted_peptide_6|131_aa MILGLINRESFVTQVCEMAADISESSGADCKGDPRNSAKLDADYPLRVLYCGGKCCLFIT NRENSPKQEAGISEGQGTAGEEEEKKKQKRVTGEDEIIIQGDFTDDIIDVIQEKWPEVDD DSIEDLGEVKK >gi568815586r:122629302_122830339|GENSCAN_predicted_CDS_6|396_bp atgatcctcggactcattaatagggagtcatttgtgactcaggtttgtgaaatggctgct gacatttctgaatccagcggggctgactgcaaaggagacccaaggaacagtgccaagtta gatgccgattacccacttcgagtcctttattgtggaggcaagtgttgtctgttcattacc aacagagaaaattcacccaaacaagaagctggaattagtgagggtcaaggaacagcaggg gaagaagaggagaagaaaaaacagaagagagtaacaggggaggatgaaattatcattcag ggagattttacagatgacataattgatgtcattcaggaaaaatggccagaggtagatgat gacagcatcgaagatcttggagaagtaaagaagtga >gi568815586r:122629302_122830339|GENSCAN_predicted_peptide_7|633_aa MNPPAAFLAGRQNIGSEVEISTIEKQRKELQLLIGELKDRDKELNDMVAVHQQQLLSWEE DRQKVLTLEERCSKLEGELHKRTEIIRSLTKKARNETLSNTLVELSAQVGQLQAREQALT TMIKLKDKDIIEAVNHIADCSGKFKMLEHALRDAKMAETCIVKEKQDYKQKLKALKIEVN KLKEDLNEKTTENNEQREEIIRLKQEKSCLHDELLFTVEREKRKDELLNIAKSKQERTNS ELHNLRQIYVKQQSDLQFLNFNVENSQELIQMYDSKMEESKALDSSRDMCLSDLENNHPK VDIKREKNQKSLFKDQKFEAMLVQQNRSDKSSCDECKEKKQQIDTVFGEKSVITLSSIFT KDLVEKHNLPWSLGGKTQIEPENKITLCKIHTKSPKCHGTGVQNEGKQPSETPTLSDEKQ WHDVSVYLGLTNCPSSKHPEKLDVECQDQMERSEISCCQKNEACLGESGMCDSKCCHPSN FIIEAPGHMSDVEWMSIFKPSKMQRIVRLKSGCTCSESICGTQHDSPASELIAIQDSHSL GSSKSALREDETESSSNKKNSPTSLLIYKDAPAFNEKDDFSPTSKLQRLLAESRQMVTDL ELSTLLPISHENLTGSATNTPKKANEALRVADA >gi568815586r:122629302_122830339|GENSCAN_predicted_CDS_7|1902_bp atgaaccctccggcagccttccttgccgggcgccagaacatcgggtcagaagttgagatt tccactatcgagaaacaacggaaggagctgcagttgctcattggagaattaaaagatcga gataaagagctcaatgacatggttgcagtgcaccagcaacagcttctttcatgggaagag gatcggcagaaagtgttgacactggaagaacgttgcagcaaattagaaggtgaactacat aaaagaactgaaataatcaggtcactcacgaagaaggctagaaacgaaactctcagcaac acgttagtggaactttctgcccaggtaggacagctacaagctcgagaacaagctcttacg acaatgataaagctaaaggacaaagatattattgaggcagttaatcacattgcagattgt tcgggtaaatttaaaatgctagagcatgccctacgtgatgccaagatggcggagacttgt attgtgaaagaaaagcaagattataagcagaaattgaaggcacttaagattgaagtcaac aaactaaaagaggacctcaatgaaaagacgacagaaaataatgagcaacgagaagagatc attcgcctcaagcaagagaaaagttgcctgcacgatgaattgctttttactgtagagaga gaaaagaggaaagatgaattgcttaatattgcgaagtcaaagcaagaacgcacaaattca gaactgcacaatctgagacagatttatgtaaaacaacagagtgatctgcagtttcttaat ttcaatgtggaaaattctcaggaattaatacagatgtatgactcaaagatggaggaatca aaggctctggactccagcagagacatgtgtttatcagaccttgaaaataaccacccaaaa gtcgatattaagagggaaaaaaatcagaagtcactgtttaaggaccagaaatttgaagcc atgttggttcagcaaaataggtcagacaagagctcttgcgatgaatgcaaagagaagaaa caacagatcgatactgtgtttggggagaaaagtgtaattacgctgtcatccatattcacc aaagacttagtagagaaacacaacctcccttggtctctgggaggaaaaacccagattgaa cccgaaaacaaaattacattgtgcaagatccacacaaaatcaccaaaatgtcatggcact ggggttcagaacgaaggaaaacaaccctcagaaacacccactttatctgatgagaagcag tggcatgatgtcagtgtttacctgggcctgaccaactgtccaagttcaaaacatccagaa aagctggatgtagaatgtcaagatcagatggaaaggtccgaaatctcatgctgccagaaa aatgaagcctgtctgggcgaaagtggcatgtgtgactccaagtgctgccacccgagtaac ttcataattgaagccccaggccacatgtctgacgtggagtggatgagtattttcaagcct tccaaaatgcagagaattgtccgcctcaaatctgggtgcacctgttcagaaagcatctgt ggcacacaacatgactccccggcaagtgagctaattgccatccaagattcccactctttg ggttcttcaaaatctgccttgagagaagatgagacggagtcctcttccaataaaaagaac tcacctacgagtttgttaatctacaaagatgcaccagcattcaatgaaaaggatgatttc tcgcccacgagcaagctccagcgtttgctggcggaatctcgtcagatggtgacggacctg gagctgagcacactgctgcccatcagccatgagaatctcactggcagtgccacaaataca cctaagaaggctaacgaggcacttagagttgccgatgcttaa