GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:24:49 Sequence gi568815583r:85668044_85869810 : 201767 bp : 45.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1679 1787 109 1 1 90 79 11 0.450 0.26 1.02 Intr + 8933 9055 123 0 0 80 92 77 0.975 7.86 1.03 Intr + 14115 14169 55 1 1 105 64 53 0.988 2.64 1.04 Intr + 16698 16830 133 0 1 64 87 109 0.989 9.05 1.05 Intr + 25234 25408 175 0 1 97 71 275 0.809 26.21 1.06 Intr + 39976 40043 68 2 2 114 116 21 0.531 6.12 1.07 Intr + 42536 42602 67 1 1 68 94 39 0.508 0.98 1.08 Intr + 47745 47880 136 1 1 79 106 113 0.987 11.83 1.09 Intr + 49247 49359 113 2 2 102 90 84 0.999 10.02 1.10 Intr + 49970 50116 147 0 0 -34 81 156 0.530 2.91 1.11 Intr + 51033 51283 251 1 2 115 94 216 0.998 22.06 1.12 Intr + 53948 54073 126 1 0 115 67 124 0.999 13.88 1.13 Intr + 54187 54304 118 0 1 104 115 -46 0.983 -0.26 1.14 Intr + 55029 55277 249 1 0 53 116 229 0.991 19.61 1.15 Intr + 58367 58443 77 0 2 74 99 10 0.520 -0.07 1.16 Intr + 59023 59204 182 0 2 112 95 140 0.997 15.77 1.17 Intr + 59338 59420 83 1 2 41 100 83 0.798 4.08 1.18 Intr + 62470 62664 195 2 0 82 53 91 0.887 4.39 1.19 Intr + 66949 67107 159 2 0 68 94 103 0.686 8.86 1.20 Intr + 67517 67587 71 0 2 43 63 42 0.940 -3.90 1.21 Intr + 72179 72229 51 1 0 62 96 71 0.825 4.40 1.22 Intr + 73003 73452 450 0 0 82 95 807 0.608 74.30 1.23 Intr + 79651 79766 116 0 2 77 102 27 0.118 2.25 1.24 Intr + 83259 83387 129 0 0 102 26 76 0.011 2.61 1.25 Intr + 90414 90735 322 0 1 53 100 156 0.312 9.16 1.26 Intr + 91443 91675 233 2 2 37 84 50 0.509 -3.93 1.27 Intr + 91875 92061 187 0 1 63 48 134 0.460 6.59 1.28 Intr + 92652 92748 97 2 1 62 87 61 0.847 2.98 1.29 Intr + 93160 93286 127 2 1 19 81 96 0.799 1.74 1.30 Intr + 95405 95507 103 2 1 72 37 124 0.712 5.88 1.31 Term + 96100 96261 162 0 0 45 40 105 0.750 -0.66 1.32 PlyA + 97817 97822 6 1.05 2.02 PlyA - 98903 98898 6 -0.45 2.01 Sngl - 101767 99998 1770 1 0 91 55 2994 0.999 291.02 2.00 Prom - 108562 108523 40 -7.26 3.00 Prom + 114162 114201 40 -4.66 3.01 Init + 118276 118336 61 0 1 97 80 8 0.553 2.36 3.02 Intr + 122623 122711 89 2 2 124 99 40 0.980 8.39 3.03 Intr + 124234 124257 24 0 0 116 81 4 0.605 0.72 3.04 Intr + 125460 125639 180 2 0 80 8 111 0.619 2.36 3.05 Intr + 126829 126993 165 0 0 67 35 145 0.672 7.26 3.06 Intr + 152287 152320 34 0 1 67 89 16 0.012 -2.60 3.07 Intr + 164531 164671 141 0 0 10 94 171 0.079 10.22 3.08 Intr + 166720 166822 103 2 1 103 84 62 0.150 6.53 3.09 Intr + 170056 170087 32 1 2 90 115 -49 0.021 -4.23 3.10 Intr + 172500 172568 69 1 0 99 80 41 0.269 3.55 3.11 Intr + 188862 188903 42 1 0 75 103 62 0.326 4.61 3.12 Term + 191820 191893 74 1 2 68 43 60 0.122 -2.43 3.13 PlyA + 194327 194332 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 84160 84467 308 2 2 116 32 195 0.884 11.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:85668044_85869810|GENSCAN_predicted_peptide_1|1537_aa ICHRSKQQGFNYCTSAISSPLTKSISLMTISHPGLDTQQQMRPARRISISISPLLLKSKT LFSLGSSYSSDEEEELHNSRPFHSTFHNTSANLTESITEENYNFLPHSPSKKDSEWKSGT KVSRTFSYIKNKMSSSKKSKEKEKEKDKIKEKEKDSKDKEKDKKTVNGHTFSSIPVVGPI SCSQCMKPFTNKDAYTCANCSAFVHKGCRESLASCAKVKMKQPKGSLQAHDTSSLPTVIM RNKPSQPKERPRSAVLLVDETATTPIFANRRSQQSVSLSKSVSIQNITGVGNDENMSNTW KFLSHSTDSLNKISKVNESTESLTDEGTDMNEGQLLGDFEIESKQLEAESWSRIIDSKFL KQQKKDVVKRQEVIYELMQTEFHHVRTLKIMSGVYSQGMMADLLFEQQMVEKLFPCLDEL ISIHSQFFQRILERKKESLVDKSEKNFLIKRIGDVLVNQFSGENAERLKKTYGKFCGQHN QSVNYFKDLYAKDKRFQAFVKKKMSSSVVRRLGIPECILLVTQRITKYPVLFQRILQCTK DNEVEQEDLAQSLSLVKDVIGAVDSKVASYEKKVRLNEIYTKTDSKSIMRMKSGQMFAKE DLKRKKLVRDGSVFLKNAAGRLKEVQAVLLTDILVFLQEKDQKYIFASLDQKSTVISLKK LIVREVAHEEKGLFLISMGMTDPEMVEVHASSKEERNSWIQIIQDTINTLNRDEDEGIPS ENEEEKKMLDTRARELKEQLHQKDQKILLLLEEKEMIFRDMAECSTPLPEDCSPTHSPRV LFRSNTEEALKGGPLMKSAINEVEILQGLVSGNLGGTLGPTVSSPIEQDVVGPVSLPRRA ETFGGFDSHQMNASKGGEKEEGDDGQDLRRTESDSGLKKQVVQSVVHLYELLSALQGVVL QQDSYIEDQKLVLSERALTRSLSRPSSLIEQEKQRSLEKQRQDLANLQKQQAQYLEEKRR REREWEARERELREREALLAQREEEVQQGQQDLEKEREELQQKKGTYQYDLERLRAAQKQ LEREQEQLRREAERLSQRQTERDLCQKMERVKNMQTATFNLCQYSLYPRESYLGGRILTK VFLESSCSLESSETPLSLKFDSVSSGAQWQIIQRVLEQSSSSAVLPPLGRDEALCALGSS TRTILTSETHPLKIIPFSKESGVMPLRQHQKHFLHCEGAIWILWASTPLSTTEATAESHR AGAPPGSCSQQSHPARRLRHGMTFSGCPLSEGRLKAPECASRAVPWDHSQVPSGSWPDTG GVCWVYRVQGAPCSQCPMAGVSVGKPCPKVPPPRGLQDPHQGMGDTRGCLSNCPGVREVL VRDPSDSGGDSNEQMFSSSLGAILILELREQMGAKTQEDGEGPVTVLHTEMDAALSRTCQ TGDLKALQVSFSPTPGSVLVAVHAPLGGDSGVNLIGFGDTQTPGDTLRILSAGRETQAAW YLIRMARPPQVLVGALQQQDLRQTALENDRELDADAHTYEELQQGRPGMPGQRLLGCRPF YGEPSSLPAFEENSDTELSRQKLDRRSFAEHGNNPGS >gi568815583r:85668044_85869810|GENSCAN_predicted_CDS_1|4614_bp atatgtcacagatctaagcagcagggatttaattactgtacatcagccatttcctctcca ttgacaaaatccatctcattaatgacaatcagccatcctggattggacactcagcagcag atgcggccagcaaggcggatctccatcagcatctctccgctcctccttaagtctaaaact ctcttttctcttggctcttcttattcgagtgatgaggaggaggagttgcataattcacgg cccttccacagtaccttccacaataccagtgctaatctgactgagagtataacagaagag aactataatttcctgccacatagcccctccaagaaagattctgaatggaagagtggaaca aaagtcagtcgtacattcagctacatcaagaataaaatgtctagcagcaagaagagcaaa gaaaaggaaaaagaaaaagataagattaaggagaaggagaaagattctaaagacaaggag aaagataagaagactgtcaacgggcacactttcagttccattcctgttgtgggtcccatc agctgtagccagtgtatgaagcccttcaccaacaaagatgcctatacttgtgcaaattgc agtgcttttgtccacaaaggctgccgagaaagtctagcctcctgtgcaaaggtcaaaatg aagcagcccaaagggagccttcaggcacatgacacatcatcactgcccacggtcattatg agaaacaagccctcacagcccaaggagcgtcctcggtccgcagtcctcctggtggatgaa accgctaccaccccaatatttgccaatagacgatcccagcagagtgtctcgctctccaaa agtgtctccatacagaacattactggagttggcaatgatgagaacatgtcaaacacctgg aaattcctgtctcattcaacagactcactaaataaaatcagcaaggtcaatgagtcaaca gaatcacttactgatgagggtacagacatgaatgaaggacaactactgggagactttgag attgagtccaaacagctggaagcagagtcttggagtcggataatagacagcaagtttcta aaacagcaaaagaaagatgtggtcaaacggcaagaagtaatatatgagttgatgcagaca gagtttcatcatgtccgcactctcaagatcatgagtggtgtgtacagccaggggatgatg gcggatctgctttttgagcagcagatggtagaaaagctgttcccctgtttggatgagctg atcagtatccatagccaattcttccagaggattctggagcggaagaaggagtctctggtg gataaaagtgaaaagaactttctcatcaagaggataggggatgtgcttgtaaatcagttt tcaggtgagaatgcagaacgtttaaagaagacatatggcaagttttgtgggcaacataac cagtctgtaaactacttcaaagacctttatgccaaggataagcgttttcaagcctttgta aagaagaagatgagcagttcagttgttagaaggcttggaattccagagtgcatattgctt gtaactcagcggattaccaagtacccagttttattccaaagaatattgcagtgtaccaaa gacaatgaagtggagcaggaagatctagcacagtccttgagcctggtgaaggatgtgatt ggagctgtagacagcaaagtggcaagttatgaaaagaaagtgcgtctcaatgagatttat acaaagacagatagcaagtcaatcatgaggatgaagagtggtcagatgtttgccaaggaa gatttgaaacggaagaagcttgtacgtgatgggagtgtgtttctgaagaatgcagcagga aggttgaaagaggttcaagcagttcttctcactgacattttagttttccttcaagaaaaa gaccagaagtacatctttgcatcattggaccagaagtcaacagtgatctctttaaagaag ctgattgtgagagaagtggcacatgaggagaaaggtttattcctgatcagcatggggatg acagatccagagatggtagaagtccatgccagctccaaagaggaacgaaacagctggatt cagatcattcaggacacaatcaacaccctgaacagagatgaagatgaaggaattcctagt gagaatgaggaagaaaagaaaatgttggacaccagagcccgagaattaaaagaacaactt caccagaaggaccaaaaaatcctactcttgttggaagagaaggagatgattttccgggac atggctgagtgcagcacccctctcccagaggattgctccccaacacatagccctagagtt ctcttccgctccaacacagaagaggctctcaaaggaggacctttaatgaaaagtgcaata aatgaggtggagatccttcagggtttggtgagtggaaatctgggaggcacacttgggccg actgtcagcagccccattgagcaagatgtggtcggtcccgtttccctgccccggagagca gagacctttggaggatttgacagccatcagatgaatgcttcaaaaggaggcgagaaggaa gagggagatgatggccaagatcttaggagaacggaatcagatagtggcctaaaaaagcag gttgtccagagcgttgttcatctctacgagctcctcagcgctctgcagggtgtggtgctg cagcaggacagctacattgaggaccagaaactggtgctgagcgagagggcgctcactcgc agcttgtcccgcccgagctccctcattgagcaggagaagcagcgcagcctggagaagcag cgccaggacctggccaacctgcagaagcagcaggcccagtacctcgaggagaagcgcagg cgcgagcgtgagtgggaagctcgtgagagggagctgcgggagcgggaggccctcctggcc cagcgcgaggaggaggtgcagcaggggcagcaggacctggaaaaggagcgggaggagctc cagcagaagaagggcacataccagtatgacctggagcgactgcgtgctgcccagaaacag cttgagagggaacaggagcagctgcgccgggaggcagagcggctcagccagcggcagaca gaacgggacctgtgtcagaaaatggagagggttaaaaacatgcaaactgccactttcaac ctttgccagtattccctctacccccgtgagagctatctggggggaagaatccttaccaag gtttttttggaaagttcctgttctttggagtcttcggagactcctttgtccttaaaattt gactctgtttcttcaggggcacagtggcagatcatccagagggtcctggagcagtccagt tccagcgccgtcctgccacccctggggagggacgaggccctctgtgctcttggctcttct acacgcaccatactcacctctgagacacacccgctcaagatcattccattttctaaggaa tctggcgtcatgcccctacgccagcatcagaagcacttcttacactgtgaaggcgctatt tggattttgtgggccagcacaccactgagcaccacggaagccactgctgagagccaccgt gctggggcacctccgggttcctgttcacagcagagccaccctgcacgcaggctgcgccac ggcatgaccttctcgggctgcccgctgtctgaggggcgcctgaaggctcctgagtgtgcc agcagagccgtcccctgggaccacagccaagtgccttcgggcagctggcctgacacaggc ggggtctgctgggtctacagggtccaaggagccccatgcagccagtgccccatggcgggc gtgtcagtgggcaaaccctgcccaaaggtcccacccccaagaggcctccaggacccgcac caaggcatgggggacactcgtggctgcttaagtaactgcccaggtgtcagggaggtgctg gtgagggaccccagtgactcaggaggtgacagcaatgaacaaatgttcagcagttctctc ggggccattcttatcttggagctcagggagcaaatgggggccaaaacgcaagaggatggt gagggtcccgtcactgtcctccacactgagatggatgcggccctgtccaggacctgccaa acaggtgatctcaaggctctgcaagtgtccttctccccgacgccagggtcagtcctggtg gcagtccatgcccctctgggaggtgactcaggtgtcaacttgattggattcggggatacc cagacgcccggggacacattacgtattctcagtgctggaagggaaacccaggcggcttgg tacttgattcgaatggccaggccgccccaagtcctagtgggcgccctccaacagcaggac ctacgacaaacagctctggagaatgacagggaactcgatgcagacgctcacacctatgaa gagctccaacagggaaggccaggcatgcccgggcagcggcttctgggctgcaggcccttc tatggagagcccagctcactccctgcctttgaagagaactcggacactgagctgtccagg cagaagctggacaggaggagctttgcagagcatgggaacaacccagggtcatag >gi568815583r:85668044_85869810|GENSCAN_predicted_peptide_2|589_aa MSVSVHETRKSRSSTGSMNVTLFHKASHPDCVLAHLNTLRKHCMFTDVTLWAGDRAFPCH RAVLAASSRYFEAMFSHGLRESRDDTVNFQDNLHPEVLELLLDFAYSSRIAINEENAESL LEAGDMLQFHDVRDAAAEFLEKNLFPSNCLGMMLLSDAHQCRRLYEFSWRMCLVHFETVR QSEDFNSLSKDTLLDLISSDELETEDERVVFEAILQWVKHDLEPRKVHLPELLRSVRLAL LPSDCLQEAVSSEALLMADERTKLIMDEALRCKTRILQNDGVVTSPCARPRKAGHTLLIL GGQTFMCDKIYQVDHKAKEIIPKADLPSPRKEFSASAIGCKVYVTGGRGSENGVSKDVWV YDTVHEEWSKAAPMLIARFGHGSAELENCLYVVGGHTSLAGVFPASPSVSLKQVEKYDPG ANKWMMVAPLRDGVSNAAVVSAKLKLFVFGGTSIHRDMVSKVQCYDPSENRWTIKAECPQ PWRYTAAAVLGSQIFIMGGDTEFTAASAYRFDCETNQWTRIGDMTAKRMSCHALASGNKL YVVGGYFGTQRCKTLDCYDPTSDTWNCITTVPYSLIPTAFVSTWKHLPA >gi568815583r:85668044_85869810|GENSCAN_predicted_CDS_2|1770_bp atgtcggtcagtgtccatgagacccgcaagtcgcggagcagcacggggtccatgaacgtc accctcttccacaaggcctcccacccggactgtgtgctggcccacctcaacacgcttcgc aagcactgcatgttcaccgacgtcacactctgggcgggcgaccgtgccttcccctgtcac cgtgccgtgctggccgcctctagccgctattttgaggccatgttcagccatggccttcgg gagagccgggatgacactgtcaacttccaggacaacctgcacccggaggtgctggagctg ctgctggactttgcctactcctcacgcatcgccatcaacgaggagaacgctgagtcactg ctggaggcaggcgacatgctgcagttccacgatgtgcgggatgctgccgccgagttcctg gagaagaaccttttcccctccaactgcctgggcatgatgctgctctcggacgcccaccag tgccgccggctgtatgagttctcctggcgcatgtgcctggtgcactttgagacggtgagg cagagcgaggacttcaacagcctgtccaaggacacactgctggacctcatctcgagtgat gagctggagaccgaggacgagcgggtggtcttcgaggccatcctccagtgggtgaagcac gacctggagccacggaaggtccacttgcccgagctcctccgcagcgtgcgtctggccttg ctgccgtccgactgcctgcaggaggccgtctccagcgaggccctcctcatggcagacgag cgcaccaagcttatcatggatgaggccctgcgctgcaagaccaggatcctgcagaatgat ggcgtggtcaccagcccctgtgcccggccacgcaaggcgggccacacgctactcatcctg gggggccagaccttcatgtgtgacaagatctaccaggtggaccacaaggccaaggagatc atccccaaggccgacctgcccagcccccggaaggagttcagcgcctcagcgatcggctgc aaggtctatgtgacggggggcaggggctccgagaacggggtctccaaggatgtctgggtg tacgacaccgtacatgaggaatggtccaaggcggcgcccatgctgattgcccgctttggc catggctcagctgagctggagaactgcctctatgtggtggggggacacacatccctggca ggggtcttcccggcctcgccttctgtctccctgaaacaagtggagaaatacgaccctggg gccaacaagtggatgatggtggcccccttgcgggatggcgtcagcaatgccgcagtggtg agtgccaagctgaagctctttgttttcggaggaaccagcatccaccgggacatggtgtcc aaggtccagtgctatgacccctcggagaacaggtggacgatcaaggccgagtgcccccag ccttggcggtacacagccgctgccgtcctgggcagccagatcttcatcatgggaggtgac acggaattcacagccgcctcggcctaccgctttgactgtgagaccaaccagtggacgcgg attggggacatgactgccaagcgcatgtcctgccatgccctggcttccggcaacaagctc tatgtggtcgggggctactttgggacccagaggtgtaagactctggactgctatgacccc acttcagatacatggaactgcatcaccacagtgccctactcacttatccccacggccttt gtcagcacctggaagcacctgcccgcgtga >gi568815583r:85668044_85869810|GENSCAN_predicted_peptide_3|337_aa MAKPGCCFLKGGESLAKTSPGSRSEGSSYGMVKRRSWPRALVTLKQEVAQLVPTKCFLTP TQAFTNMPGILTAQISTLSISSSYEWGDWDPGSQRAWPHNGHWNLVQDTDRGILPPLLAK TLSLRYSLGAANKLVSPAFPPRLPARRAAANQRHRLCDGWAAEPFSAAAEKKGGHVKWVD LNVGGNKGTKSYLITTATTITTIIIRMEVMRWGFLALLFTACEELCAAARPGPTFMDENL PEREFGIAQHLQPQGYSADALSSLIMPLGPGGWVCSESLPGQAAGADSRTVRFEELFSAV RKLGTKKDTINPTWTCGAVADEDRSEDAGGSMLPFDL >gi568815583r:85668044_85869810|GENSCAN_predicted_CDS_3|1014_bp atggccaaacctggatgttgcttcctgaagggtggggagagcctggccaaaacgtcccca ggttccaggtctgagggctctagctatgggatggtgaagagacggagttggcctcgggcc ttggtcacgctgaaacaagaagtggcccagcttgttccaactaagtgtttcctgaccccc acgcaagccttcaccaacatgccaggaatcctcacagctcagatcagcacactgagcatc agttcctcctatgaatggggagactgggacccagggtcccagagggcttggccccataac ggtcactggaatctggtccaggatacagaccgtggaatcctgcctcctctcctggcaaaa acgctctcgctgcgatacagcctaggcgccgccaacaaactagtttctccggccttcccg ccccgcctcccggcccgaagagcagcagccaatcagaggcatcgcctctgtgacggatgg gcagccgaacccttcagtgcagccgcagaaaagaagggggggcatgtgaagtgggtggat ttgaacgttggtggaaataaaggaactaagagttacctcatcaccactgccaccaccatc accaccatcatcatcagaatggaggtgatgcggtggggcttcctggcgctgctcttcacg gcttgtgaggagctctgtgcagcagctaggccaggccctaccttcatggatgaaaacctc ccagagagggagttcggaatagcccaacacctacagcctcagggttattctgcagatgcc ttgagcagcctcatcatgccgctgggcccaggaggctgggtctgttcagaatctctgccc ggccaggcagcaggcgctgactctcgcactgtccggttcgaagagctgttttctgcagtt aggaagctgggaaccaagaaagacaccatcaaccccacatggacctgcggggcagtggct gatgaagacaggagtgaggatgctgggggcagcatgttgccctttgacctctga