GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:50:48 Sequence gi568815596r:20151588_20427360 : 275773 bp : 44.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 432 427 6 1.05 1.02 Term - 4252 4046 207 1 0 21 43 288 0.622 14.94 1.01 Init - 4470 4279 192 0 0 85 50 180 0.874 12.92 1.00 Prom - 10082 10043 40 -4.36 2.07 PlyA - 10147 10142 6 1.05 2.06 Term - 17346 16969 378 0 0 63 32 209 0.495 7.59 2.05 Intr - 23059 22958 102 1 0 73 56 66 0.032 2.27 2.04 Intr - 26447 26300 148 1 1 82 -6 92 0.010 -0.56 2.03 Intr - 33010 32824 187 2 1 37 115 79 0.033 4.25 2.02 Intr - 35696 35554 143 1 2 76 99 25 0.057 2.40 2.01 Init - 44976 44900 77 0 2 36 57 92 0.043 1.48 2.00 Prom - 48095 48056 40 -7.06 3.08 PlyA - 49237 49232 6 1.05 3.07 Term - 51348 51179 170 1 2 64 48 297 0.999 21.34 3.06 Intr - 51635 51500 136 2 1 119 105 169 0.999 21.84 3.05 Intr - 52704 52226 479 1 2 120 109 238 0.998 21.87 3.04 Intr - 53837 53756 82 2 1 80 94 89 0.827 7.91 3.03 Intr - 59048 58928 121 1 1 37 72 67 0.254 0.50 3.02 Intr - 72121 71943 179 1 2 42 64 151 0.965 6.82 3.01 Init - 73280 73215 66 2 0 93 117 237 0.999 26.17 3.00 Prom - 86096 86057 40 -2.76 4.20 PlyA - 86204 86199 6 1.05 4.19 Term - 88941 88714 228 0 0 2 44 246 0.671 8.23 4.18 Intr - 102427 102235 193 0 1 55 85 172 0.928 13.09 4.17 Intr - 103397 103276 122 2 2 39 72 66 0.947 -0.61 4.16 Intr - 103754 103629 126 2 0 92 75 26 0.841 2.58 4.15 Intr - 104583 104446 138 0 0 112 55 94 0.998 9.16 4.14 Intr - 106784 106656 129 2 0 59 91 45 0.427 2.79 4.13 Intr - 110055 109969 87 0 0 91 -1 122 0.486 3.77 4.12 Intr - 111049 111043 7 0 1 92 92 0 0.052 -5.46 4.11 Intr - 111873 111606 268 1 1 57 82 106 0.057 3.49 4.10 Intr - 131899 131756 144 2 0 37 119 43 0.207 2.55 4.09 Intr - 139203 139065 139 0 1 62 74 74 0.333 3.44 4.08 Intr - 142931 142789 143 0 2 -5 96 75 0.428 -0.93 4.07 Intr - 146091 145966 126 1 0 60 92 62 0.415 4.45 4.06 Intr - 156484 156391 94 1 1 68 95 51 0.868 3.34 4.05 Intr - 156997 156727 271 0 1 69 98 184 0.719 15.04 4.04 Intr - 160076 159907 170 2 2 50 60 170 0.947 9.14 4.03 Intr - 167058 166950 109 2 1 68 99 31 0.045 2.49 4.02 Intr - 199692 199585 108 2 0 49 90 45 0.002 0.20 4.01 Init - 213776 213724 53 2 2 95 91 43 0.044 4.10 4.00 Prom - 215590 215551 40 -5.26 5.00 Prom + 219637 219676 40 -3.56 5.01 Init + 226634 226850 217 1 1 54 68 102 0.186 3.66 5.02 Intr + 235119 235330 212 1 2 34 3 185 0.017 3.23 5.03 Intr + 238995 239173 179 2 2 55 70 101 0.021 3.72 5.04 Intr + 239553 239641 89 0 2 23 121 31 0.047 -0.49 5.05 Term + 244781 244971 191 0 2 95 37 141 0.550 7.31 5.06 PlyA + 245630 245635 6 1.05 6.05 PlyA - 246006 246001 6 1.05 6.04 Term - 247871 247851 21 2 0 104 44 21 0.010 -2.39 6.03 Intr - 260055 259955 101 1 2 76 88 74 0.774 6.03 6.02 Intr - 263244 263058 187 0 1 37 67 81 0.042 0.16 6.01 Init - 270931 270587 345 1 0 56 86 144 0.222 8.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 76817 76611 207 2 0 82 38 146 0.970 6.34 S.002 Term - 100129 99998 132 1 0 61 32 124 0.823 2.19 S.003 Init + 116126 116153 28 1 1 76 86 14 0.820 -0.24 S.004 Init + 239446 239641 196 0 1 92 121 54 0.862 8.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:20151588_20427360|GENSCAN_predicted_peptide_1|132_aa MLSKGPLQYAQVFGCKKRATAVAPGKHSNGLIKVNGWHLEMIELHMLQYKLLEPLLLWAK SDLLGGSHVAQICAVHQSISKALVAYYQKYMDEASKKDIKDTLIQNDQTLLVAGLLRCES KKFGSSGACARY >gi568815596r:20151588_20427360|GENSCAN_predicted_CDS_1|399_bp atgctgtccaagggcccactgcagtatgcgcaggtctttggatgcaagaaaagggccaca gctgtggcgcccggcaaacacagcaatggcctcatcaaggtgaatgggtggcacctggag atgatcgagctgcatatgctacaatacaagctgctggaaccacttctgctctgggcaaag agcgatttgctgggtggtagtcatgtggcccagatttgtgctgtccatcagtccatctcc aaagccctggtggcctattaccagaaatatatggatgaggcttccaagaaggatatcaaa gacaccctcatccagaatgaccagaccttgctggtagctggcctccttcgctgtgagtcc aaaaagtttggaagctctggtgcctgtgctcgctactag >gi568815596r:20151588_20427360|GENSCAN_predicted_peptide_2|344_aa MCQAQCQALDIASDPGPVLTFKAVARASPLEKECRAAGQQAQGPAWYLGCFPSLLGPQIS SSGNQVQKGAKCTEGSSELLPQNLLGGLWLTTSKEEASGSGCSQQEGLLMSPTLLKAPGK LDLKKGLGNPVLSPSSLSSSKQALRISSEKLTSPELLVAENEASCVALPATLTFQGQSSC PYVRSLVAASPKESGDCHHSLMLRDVAAWGMALCKLEALGQIPAHQPGEAPIPCPITADW LRPLPGWSYSADCTFYPLMTSPTRKFAQVQNSEPHEEAAEEKDDSSGALRPSGSNPSCPV PTLQPEFPIRHRALQQEDLPARASGPGRSICLPLAWMSARADAI >gi568815596r:20151588_20427360|GENSCAN_predicted_CDS_2|1035_bp atgtgccaggcacagtgccaggccctggacattgccagtgacccaggccccgtccttacc ttcaaggctgttgcgagggcaagccctctggagaaggagtgcagagctgcaggccagcag gcccagggtcccgcctggtatcttgggtgctttccttccctcctggggcctcagatctcc tcatctggaaatcaagttcagaaaggtgcaaaatgtactgagggctcttctgagttgctg ccgcagaacctgcttggtggcctgtggctcacgacgtcaaaggaggaggcatcgggaagt ggatgttcccagcaagagggtcttctgatgtcccccaccctcctcaaagcaccagggaaa ttggatcttaaaaagggccttgggaacccagtgctgagcccttcaagcctctcctcatcc aaacaggctctcagaatatcctctgagaaactcacttctcctgagctgttggtggctgaa aatgaggcctcctgtgtggccctcccagccactctcaccttccagggacagtcctcctgt ccctacgtgcgcagcctggtagctgcatcacccaaggaatctggtgactgccaccactct ctgatgctcagagatgttgctgcatggggcatggccctgtgcaagctggaggcactgggc cagatccctgcccaccaaccgggcgaggctccgatcccctgccccatcacagctgactgg ctgaggcccctccctggctggtcctacagcgctgactgcacgttctacccactcatgact tctcccacaaggaagtttgcccaggtgcagaactcagagccacatgaagaggctgcagaa gaaaaggacgactcgtctggggctcttagaccctcaggaagcaacccaagctgccctgtg cccacactccagcctgaatttcctatcagacacagagctttacagcaggaagacctgccc gccagagcctcggggcccggccgctccatctgcctgcccttggcctggatgtctgcccga gcagatgcaatttaa >gi568815596r:20151588_20427360|GENSCAN_predicted_peptide_3|410_aa MRRAALWLWLCALALSLQPALPPRNAGLDGGWDLGRSQVSARAARCPSLPPEFASPAPAP GAGVPGSLLRAGSAIPRDAEGRSRGFRGHTGQVQLTDQLPFPAPPRMTANQRPPALPKLV PGQIVATNLPPEDQDGSGDDSDNFSGSGAGALQDITLSQQTPSTWKDTQLLTAIPTSPEP TGLEATAASTSTLPAGEGPKEGEAVVLPEVEPGLTAREQEATPRPRETTQLPTTHLASTT TATTAQEPATSHPHRDMQPGHHETSTPAGPSQADLHTPHTEDGGPSATERAAEDGASSQL PAAEGSGEQDFTFETSGENTAVVAVEPDRRNQSPVDQGATGASQGLLDRKEVLGGVIAGG LVGLIFAVCLVGFMLYRMKKKDEGSYSLEEPKQANGGAYQKPTKQEEFYA >gi568815596r:20151588_20427360|GENSCAN_predicted_CDS_3|1233_bp atgaggcgcgcggcgctctggctctggctgtgcgcgctggcgctgagcctgcagccggcc ctgccgccccgtaacgccggccttgacgggggctgggacctgggccgcagccaggtgtca gcgcgggccgcccggtgcccgagtctgccgcccgagttcgcctccccagcccccgcgccc ggtgctggcgtcccggggtccctcctgcgggccggctcggccatccccagggatgcagag gggagaagccggggatttcgtggccacacggggcaggtacagctgacagatcagcttccc tttcctgctccaccacgaatgactgcgaatcagaggccccctgcgctgcccaagttagtg cctgggcaaattgtggctactaatttgccccctgaagatcaagatggctctggggatgac tctgacaacttctccggctcaggtgcaggtgctttgcaagatatcaccttgtcacagcag accccctccacttggaaggacacgcagctcctgacggctattcccacgtctccagaaccc accggcctggaggctacagctgcctccacctccaccctgccggctggagaggggcccaag gagggagaggctgtagtcctgccagaagtggagcctggcctcaccgcccgggagcaggag gccaccccccgacccagggagaccacacagctcccgaccactcatctggcctcaacgacc acagccaccacggcccaggagcccgccacctcccacccccacagggacatgcagcctggc caccatgagacctcaacccctgcaggacccagccaagctgaccttcacactccccacaca gaggatggaggtccttctgccaccgagagggctgctgaggatggagcctccagtcagctc ccagcagcagagggctctggggagcaggacttcacctttgaaacctcgggggagaatacg gctgtagtggccgtggagcctgaccgccggaaccagtccccagtggatcagggggccacg ggggcctcacagggcctcctggacaggaaagaggtgctgggaggggtcattgccggaggc ctcgtggggctcatctttgctgtgtgcctggtgggtttcatgctgtaccgcatgaagaag aaggacgaaggcagctactccttggaggagccgaaacaagccaacggcggggcctaccag aagcccaccaaacaggaggaattctatgcctga >gi568815596r:20151588_20427360|GENSCAN_predicted_peptide_4|884_aa MTLMAGMWVQMQMSALYSTLEAKCQEYPTVGGAGADTRTPSWGKTVSNRVGSRGFCLPKS FGNLMIQQKMDKKAYFLGMMNGERLHGELLGTRDAETDGPEKGDQKGKASPFEEDQNRDL KQGDDDDSKINGRGLPNGMDADCKDFNRTPGSRQASPTEVVERLGPNTNPSEGLGPLPNP TANKPLVEEFSNPETQNLDAMEQVGLESLQFDYPGNQVPMDSSGATVGLFDYNSQQQLFQ RTNALTVQQLTAAQQQQYALAAAQQPHIAGVFSAGLAPAAFVPNPYIISAAPPGTDPYTA AGLAAAATLAGPAVVPPQYYGVPWGVYPANLFQQQAAAAANNTASQQAASQAQPGQQQVL RAGAGQRPLTPNQGQQGQQAESLAAAAAANPTLAFGQGLATGMPGYQVLAPTAYYDQTGA LVVGPGARTGLGAPVRLMAPTPVLISSAAAQAGGLTNGSGRYISAAPGAEAKYRSASSTS SLFSSSSQLFPPSRLRYNRSDIMPSGRSRLLEDFRNNRFPNLQLRDLIGHIVEFSQDQHG SSFKEIFSSNTYGAVLSYDNSAFSGIPSEIYLRFGSLDQKLALATRIRGHVLPLALQMYG CRVIQKALESISSDQQSEMVKELDGHVLKCVKDQNGNHVVQKCIECVQPQSLQFIIDAFK GQVFVLSTHPYGCRVIQRILEHCTAEQTLPILEELHQHTEQLVQDQYGNYVIQHVLEHGR PEDKSKIVSEIRGKVLALSQHKFASNVVEKCVTHASRAERALLIDEVCCQNDGPHSALYT MMKDQYANYVVQKMIDMAEPAQRKIIMHKQPPPPNNGFKKKDKGGIHLTATCPQSELDAE TVKSILAEYKIHNADVTLRSDATADDLIDVVEGNRVYIPCIYVK >gi568815596r:20151588_20427360|GENSCAN_predicted_CDS_4|2655_bp atgacgctgatggcaggcatgtgggtgcaaatgcagatgagtgcattgtacagcactctg gaagcgaaatgccaggagtaccccacggtgggcggcgcgggcgcagacacccgcacaccc agttgggggaaaacagtgagcaaccgggtaggaagccgaggcttttgcctaccaaaaagt tttgggaacctgatgattcaacaaaagatggacaaaaaggcatatttcttggggatgatg aatggagagagactgcatggggagcttctcggcactagagatgctgaaacagatggacct gagaaaggagatcaaaaaggcaaggcttctccatttgaggaggaccaaaacagagatctt aaacaaggagatgatgatgattctaaaataaatggcagaggtttgccaaatggaatggat gccgattgcaaagattttaatcgtactcctggaagtcgtcaagcctctccaactgaagta gttgagcgcttgggccccaatactaatccctcagaaggactggggcctcttcctaatcct acagctaataaaccacttgttgaagaattttcaaatcctgaaactcagaatctggatgcc atggaacaagttggtctggaatccttacagtttgactatcctggtaatcaggtaccaatg gactcttcaggagctactgtaggcctttttgactacaattcccagcagcagctctttcag aggactaatgcactaacagttcaacagttaactgcagctcaacagcagcaatatgcatta gcagcagctcagcagccacatatagctggtgtattctcagcaggccttgctccagctgca tttgtgccaaatccatacattattagtgctgctcctccagggaccgatccgtatactgca gcaggattggctgcagcagctacattagcaggtccagcagtggttccacctcagtattac ggcgttccatggggggtgtatccagccaacttatttcagcagcaagctgcagctgcggca aataacacagccagtcagcaagcagcatcacaagctcagcctggacagcaacaggttctc cgtgctggagcaggtcagcgtcctcttactcccaatcagggtcagcaagggcagcaagca gaatcacttgcggcagctgcagcagcaaatccaacattggcttttggtcagggtcttgct actggcatgccaggctatcaagtactagctccaactgcctattatgatcagactggtgcc ttagtggttggccctggagcaaggactggccttggagctccagttcggttaatggctcca acacctgttttaattagttcagcagcagcacaagctggaggactgacaaatggtagtggt cgatatatctctgcagcacctggagcagaagcaaaatatcgaagtgcttcaagcacttcc agtctatttagctccagcagccagctctttcctccttcccggcttcggtataataggtct gatattatgccttctggccgcagtagattattggaagatttcagaaacaaccgcttccca aaccttcagcttagagacttgattggacatatagttgagttttctcaagaccagcatggt tctagctttaaggagatctttagcagcaacacctatggagctgtcctctcctatgataac agtgctttttctggaataccttctgaaatctacctgaggtttgggagtctggatcaaaaa ttagccctggctactcgtattcgtggtcatgttctacccttagccttgcagatgtatggc tgccgcgttattcagaaagcattagaatctatttcttctgaccagcagagtgaaatggta aaggagctggatggtcatgtgctcaaatgtgtgaaagatcagaatggaaaccatgttgta caaaaatgtatcgaatgtgttcagccacagtcactacagttcatcattgatgctttcaag ggacaagtatttgtgctttcaactcatccttatggctgcagagtaattcagcgcatccta gagcattgcactgcagaacagaccttacctatcttagaagaactccaccaacatacagag cagttggtacaggatcagtatggcaattatgttattcagcatgtactggaacacggtcga cctgaagacaagagcaaaattgtttccgaaatcaggggaaaggttttagccctgagtcaa cacaaatttgccagcaatgtagtagaaaagtgtgttactcatgcctcccgtgctgagaga gctttactgattgacgaggtttgctgccagaatgatggtcctcacagtgccttatacacc atgatgaaggaccagtatgccaattacgtggttcaaaagatgattgatatggctgaacct gctcagagaaagataatcatgcacaagcaaccaccgccccccaacaatggctttaagaag aaggacaagggaggcattcatctcacagccacttgcccacagagtgagctggatgctgaa actgtgaagagcattctggctgaatacaagattcataatgccgatgtgactctacgtagt gatgctacagctgatgacctcattgatgtggtggaaggaaacagagtttatatcccctgt atctatgttaaataa >gi568815596r:20151588_20427360|GENSCAN_predicted_peptide_5|295_aa MPTEGKGRKQDWADGESELQYSFNREPSDPPGSCKTGTTLQSCASRGNGSGSLCSHIDGS QDVDLSGKGMSLAFLVIFTYLLVARLATIASVSLSFAEHAMVPFYPGCASLPPAVLKNVA VTSILLLMVANCWSLKLATMLTNPTRQCDWQGRAAQRGFLGEESGAKPPRADAACGNTAM WSSMNNPDTKKGKACLRHKEICHNAVFECTVWYRAQTAAKEKALQMRAWLSQGSHLHSRH HTPGLSLLVLAPVINHPQTEFLNIFLFLLSGFLVYFPLVHFQCQPRCLQLAALHL >gi568815596r:20151588_20427360|GENSCAN_predicted_CDS_5|888_bp atgcctacggaagggaaggggaggaagcaggattgggcagatggagaaagtgagctgcag tacagcttcaacagagagccctctgaccctcccgggagctgcaaaactgggaccaccctg cagagttgtgccagccggggcaacgggtctgggtctttgtgttctcacattgatgggtca caggatgtggacctttctgggaaaggaatgagcttggccttcctggtcatcttcacatat ttgctggtggccagactagctaccattgcttctgtttctctgagctttgctgagcatgca atggttcccttttaccctggctgtgcctcactgccccctgctgtgctcaagaatgtggct gtcaccagcatcctcctgctgatggtggctaactgctggagcttgaagctggccaccatg ctgacaaatcccaccaggcagtgtgactggcaaggcagagcagcccaacgaggcttcctg ggagaggagtctggagccaagcctccaagggcagatgcagcctgtggcaacacagctatg tggagcagcatgaacaatccagacaccaaaaagggcaaggcatgtctgcgccacaaggaa atatgtcacaacgctgtgttcgaatgcacagtttggtaccgtgcccaaacagcagcgaag gaaaaagccttgcaaatgcgtgcctggctttcccaaggttcccaccttcattcccgccat cacactcctggcttatctctactggtgctggcgcccgtcatcaaccatccccagacagag ttcctaaacatcttcctcttcctgctcagtggcttcctggtgtacttcccacttgtccac ttccagtgccagcccaggtgtttgcagctggccgctctacatctctag >gi568815596r:20151588_20427360|GENSCAN_predicted_peptide_6|217_aa MEVGADSKGLALSSEYLGAIMDQGLTPILGKFFKNSSVQCNLDSPVWDGSIVPQIWEVLG HFPGPEVPCPHNPSESWGSAIQASLSRSPAFPNSHMCNGNAASQKQGPLALCLHRSTQMK RNQKNNSGNMKKQGSLAPTKDQTSSPAMDPNQDEISELPEKEFRMLIIKLIKKAPKKEKV VDTVLAPVWPMQLLHYVILELGLKLEYYAAGACVHDS >gi568815596r:20151588_20427360|GENSCAN_predicted_CDS_6|654_bp atggaagttggtgctgacagcaaggggctggctctatcctctgagtatctgggagccatc atggatcaaggtctcactcccatattaggaaaattctttaaaaattcctccgtgcagtgc aatcttgattctccagtgtgggatgggagcattgtgccccagatatgggaagtcctgggc cacttccctggaccagaagtgccgtgtccccacaacccatcagagagctggggaagtgcc atccaggcgagcctgagcaggtccccggcctttcccaattcccacatgtgcaatgggaat gcagccagtcagaagcaagggcctctggctctgtgcctccacaggtcaacccaaatgaaa aggaaccagaaaaacaattctggtaatatgaaaaaacaaggttctttagcacccacaaaa gatcaaaccagctcaccagcaatggatccaaaccaagatgaaatatctgaattgccagaa aaagaattcagaatgttgattattaagctaatcaagaaggcaccaaagaaagagaaggta gttgatactgtgctcgcccccgtgtggcccatgcagctactgcactatgtcatcttggaa ctgggactaaagctggagtattatgctgcaggcgcatgtgtccatgacagctag