GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:01:58 Sequence gi568815591r:131561949_131762624 : 200676 bp : 46.82% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5437 5604 168 2 0 87 107 61 0.816 7.82 1.02 Intr + 40482 40694 213 1 0 103 99 126 0.060 13.89 1.03 Intr + 64795 64897 103 2 1 60 115 16 0.033 0.73 1.04 Intr + 65815 65840 26 1 2 119 41 38 0.032 -0.13 1.05 Intr + 68022 68121 100 1 1 89 100 13 0.017 1.77 1.06 Intr + 81641 81776 136 2 1 103 86 42 0.209 6.07 1.07 Intr + 90990 91017 28 2 1 80 72 37 0.157 -1.01 1.08 Term + 93033 93157 125 1 2 118 55 32 0.223 1.45 1.09 PlyA + 94781 94786 6 1.05 2.06 PlyA - 96424 96419 6 1.05 2.05 Term - 100575 99998 578 1 2 -20 42 578 0.739 37.03 2.04 Intr - 100697 100592 106 2 1 73 5 211 0.014 11.09 2.03 Intr - 105544 105394 151 0 1 84 53 34 0.001 -0.34 2.02 Intr - 113432 113305 128 2 2 65 56 70 0.137 0.98 2.01 Init - 113805 113803 3 0 0 113 81 0 0.341 1.80 2.00 Prom - 115252 115213 40 -3.06 3.12 PlyA - 117299 117294 6 1.05 3.11 Term - 119617 119511 107 2 2 37 41 246 0.998 13.37 3.10 Intr - 121999 121823 177 2 0 81 45 116 0.634 6.49 3.09 Intr - 127267 127226 42 2 0 135 44 20 0.334 0.51 3.08 Intr - 130270 130076 195 2 0 41 76 88 0.076 2.29 3.07 Intr - 160369 160349 21 2 0 98 82 33 0.023 1.22 3.06 Intr - 170914 170791 124 1 1 35 103 34 0.013 -0.24 3.05 Intr - 172120 172002 119 2 2 20 119 61 0.061 2.58 3.04 Intr - 176298 176278 21 1 0 137 64 12 0.349 1.22 3.03 Intr - 183105 183007 99 1 0 81 68 64 0.722 3.78 3.02 Intr - 192203 191854 350 1 2 85 55 263 0.713 17.70 3.01 Init - 196996 196932 65 1 2 22 36 113 0.303 0.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 43430 43254 177 2 0 19 51 145 0.801 1.59 S.002 Init - 100772 100592 181 2 1 57 5 251 0.826 13.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:131561949_131762624|GENSCAN_predicted_peptide_1|299_aa XQHLAQVEGSVVAILAFSNKNNVNAIWSPPILTVCASTELRVLHGWAHWIFTTTPRGPFC FSANHHMGPPGHWLQENLIGSGNHQLLDTGQPIERQHPGQVPTLGPISCGPTAVGPMVQV VFLKGAAEPPLCRPRVGGLVTRVKGNTLHLTLYWKEQLHFPWWPIACVCTRCGDGEGRQR REQNGGSESTCSWQEEETKAHLARSGMWRLGKGRSSPQPFRDSSECFSATPGYGAVFKAL FFLREPSGKEPPEGAMDAASCLYLLLSALYQGGRISGFPKRLSVNGSEVSKAACLPGQS >gi568815591r:131561949_131762624|GENSCAN_predicted_CDS_1|900_bp ncacagcatctggcccaggttgagggctcagtagttgcaatattggcattcagtaataag aacaacgtgaatgctatctggagtccaccaatccttaccgtgtgtgctagcactgagcta cgtgttttacatggatgggcccattggatcttcacaacaactccacgagggccattctgc ttctctgccaaccaccacatgggtccccctgggcattggctgcaggagaacctgattggc tcaggtaatcaccagctcttggacactggccaaccaatagagcggcagcatccaggtcag gtgcccacccttggtccaatcagctgtggcccaactgccgtagggcccatggtgcaggtg gtgtttttaaagggggctgcagagccacctctgtgcaggcctcgggttggtggccttgtg acgagagtgaaggggaacactctacacctcaccctttactggaaagagcagcttcacttc ccatggtggcctattgcctgcgtgtgcacacgctgcggggatggggagggcaggcagagg agggaacagaatggaggctccgagtctacctgctcatggcaagaggaggaaactaaagcc cacctggcaagatcggggatgtggagactggggaaaggcaggtctagcccccagcccttc cgtgacagttcagagtgtttctcagctactcccggatatggtgctgtcttcaaggccctc ttctttctgagagagccttctggaaaggagccacctgaaggtgctatggatgcagcgtct tgcctctacctccttctctcagctctgtaccaagggggtcggatttcaggtttccctaaa cgtctcagcgtgaatgggagtgaagtctccaaggctgcctgcctcccagggcagtcgtga >gi568815591r:131561949_131762624|GENSCAN_predicted_peptide_2|321_aa METEAQKGEVIFTRLYPKFVKEMRLRLGISSPEPAAELAESKPVPSSLWALEDPMQFSCT SGTTDLMFFTTANYGSTGSPPVKPSTLVPSLKYLLSDTADTMGFGDLKSPTGLQVLNDYL ADKSYIKGYDVAVFEAVSGPPPADLCHALRWYNHIKSYEKEKAGLPGVKKALSKYGPADV EDTTGSGATDSKDDDDIDLFGSDYEEESEEAKRLREEHLAQYESKKAKKPALVAKSSILL DVKPWDDETDMAKLEECVRSIQADGLVWGSSKLVPVGYGIKKLQIQCVVEDDKVGTDMLE EQITAFEDYVQSMDVAAFNKI >gi568815591r:131561949_131762624|GENSCAN_predicted_CDS_2|966_bp atggaaacagaggctcagaaaggagaagtaattttcactaggctgtaccccaagtttgtg aaagagatgaggctcagacttggtatctccagccctgaacctgcagcagagctggctgag agcaagcctgtgccatcttctctgtgggctctagaagatcctatgcagttctcttgcact tcaggcacaactgatctcatgttcttcactacagcaaactatgggagtacagggagcccc ccagtgaagccatctaccttagttccatctttaaagtacctgctctcggatacagccgac accatgggtttcggagacctgaaaagccccaccggcctccaggtgctcaacgattacctg gcggacaagagctacatcaaggggtatgatgtggcagtatttgaagccgtgtccggccca ccacctgccgacttgtgtcatgccctacgttggtataatcacatcaagtcttacgaaaag gaaaaggccggcctgccaggagtgaagaaagctttgagcaagtatggtcctgccgatgtg gaagacactacaggaagtggagctacagacagtaaagatgatgatgacattgatctcttt ggatccgattatgaggaggaaagtgaagaagcaaagaggctaagggaagaacatcttgca caatatgaatcaaagaaagccaaaaaacctgcacttgttgccaagtcttccatcttacta gatgtgaaaccttgggatgatgagacagatatggcgaaattagaggagtgtgtcagaagc attcaagcagacggcttagtctggggctcatctaaactagttccagtgggatacggaatt aagaaacttcaaatacagtgtgtagttgaagatgataaagttggaacagatatgctggag gagcagatcactgcttttgaggactatgtgcagtccatggatgtggctgctttcaacaag atctaa >gi568815591r:131561949_131762624|GENSCAN_predicted_peptide_3|439_aa MLPGDVTQLKQYKKLESRTKLGDKYRYFACLMRAQSEEHTNEKDMMKAIRLLKEAEEEFW YHQHPQPCIFPDSPRGTSYVTHECYKVPEWCLDDWHPSEKAMYPDYFAKRKQWKKLRRES WKREVKQLQEETPPAGKEECKLHEGMNYVVLTCIPGAIKQYLACSRYSKTPGPRDSCWDN CGDLTSRYAQGSCFDVSPLKITSVSDPGKLQIVFKRLLMSLRPTEFRGPHDKQWPIASPC ANYKKKPFESGQTPVEAPCAPKKQLQASGTPRADDKTEYAERPPPFLHTALQQGTFRGAA LSGDVELAAARITPSLQRGNGQHRKGEQGEEGSCIWALHLQGHTEEELKLLIWNQWVSSP TFTYPSAEELKLPTAPGLQALSSWCPQTTLFTTGTESQTAQHPGISIVITIITIIITITT LITITIITIIIILSLQMRY >gi568815591r:131561949_131762624|GENSCAN_predicted_CDS_3|1320_bp atgttacctggagatgtcacgcagctgaagcagtacaaaaaacttgaaagtcggaccaag ctaggggacaaataccgatactttgcttgtttgatgagagcccagtctgaagaacatacg aacgaaaaggatatgatgaaggccatccggctgctgaaggaggctgaggaagaattctgg taccatcagcatccacagccatgcatcttccctgactctcctaggggtacctcctatgtg acacacgagtgctacaaggtcccagaatggtgcttagatgactggcatccttctgagaag gcaatgtatcctgattactttgccaagagaaaacagtggaagaaactgcggagggaaagc tggaaaagagaggttaagcagctgcaggaggaaacgccacctgccggaaaggaagaatgt aagctccatgaaggcatgaattatgttgtgctcacttgtatccctggtgccattaaacag tacctggcatgtagcaggtattcaaaaacaccaggtccccgcgacagctgttgggataat tgtggggacctgacatcccgttatgctcaggggagctgctttgatgtcagcccattgaaa atcacatcagtctctgacccagggaagcttcaaattgtttttaaaaggctactgatgtct ctgaggcccacagagtttaggggtccacatgacaagcagtggcccatagcaagcccctgt gcaaattacaaaaagaagccttttgagagtggacaaacccctgtggaagcaccatgcgcc cccaagaagcagcttcaagcttctggaacaccgcgggctgatgacaagacagaatacgca gaaaggcctcctcccttcctccacacagctttacaacagggaacattcaggggtgcggcc ctcagtggggacgtggaactggctgcggccagaataactcccagtttgcagagagggaac gggcagcacagaaaaggagagcagggggaggagggatcctgtatctgggccctgcacctc cagggccatactgaagaggagctgaaacttctcatatggaaccagtgggtttcctccccg accttcacctacccgtctgcagaggagctgaaacttcccacagccccaggtctccaggca ctgagcagctggtgtccccagacaactctctttacaactggcactgaaagccagacagcc cagcaccccggcataagtatagtcatcaccatcatcaccattatcatcaccatcaccacc ctcatcaccatcactatcatcaccatcatcatcatcctcagtttacagatgcggtattaa