GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:50:49 Sequence gi568815575f:24365456_24634068 : 268613 bp : 41.94% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3368 3483 116 1 2 37 86 108 0.393 5.35 1.02 Intr + 7436 7580 145 2 1 41 86 114 0.521 5.86 1.03 Intr + 12526 12691 166 0 1 64 30 91 0.021 -0.49 1.04 Intr + 22892 23037 146 0 2 112 85 45 0.001 5.68 1.05 Intr + 42792 42923 132 2 0 49 92 73 0.115 3.72 1.06 Term + 46642 46683 42 0 0 144 48 47 0.592 2.48 1.07 PlyA + 48605 48610 6 1.05 2.04 PlyA - 48651 48646 6 1.05 2.03 Term - 59798 59679 120 2 0 93 53 61 0.793 0.59 2.02 Intr - 61355 61214 142 1 1 55 44 91 0.499 0.83 2.01 Init - 64465 64122 344 1 2 75 75 359 0.854 30.05 2.00 Prom - 70798 70759 40 -5.35 3.00 Prom + 78039 78078 40 -1.95 3.01 Init + 100001 100432 432 1 0 77 26 327 0.001 21.66 3.02 Intr + 115453 115484 32 0 2 76 110 21 0.008 -0.89 3.03 Intr + 120176 120210 35 2 2 59 98 60 0.126 1.05 3.04 Intr + 129287 129428 142 0 1 49 111 109 0.770 7.79 3.05 Intr + 133374 133445 72 0 0 140 97 46 0.997 8.30 3.06 Intr + 137872 138056 185 1 2 106 98 151 0.996 16.41 3.07 Intr + 139754 139843 90 0 0 95 116 44 0.906 6.95 3.08 Intr + 153478 153555 78 2 0 104 53 100 0.832 6.70 3.09 Intr + 160743 160819 77 1 2 104 115 -30 0.763 -0.38 3.10 Intr + 162119 162220 102 1 0 33 88 97 0.753 3.65 3.11 Intr + 162621 162731 111 2 0 33 99 77 0.879 2.96 3.12 Intr + 166202 166315 114 1 0 100 79 56 0.807 5.62 3.13 Intr + 171049 171085 37 0 1 77 110 6 0.039 -1.38 3.14 Term + 173679 173827 149 1 2 67 36 122 0.038 1.98 3.15 PlyA + 176493 176498 6 1.05 4.19 PlyA - 177982 177977 6 1.05 4.18 Term - 179036 178639 398 0 2 58 38 207 0.860 6.95 4.17 Intr - 180743 180584 160 2 1 57 100 117 0.673 8.44 4.16 Intr - 193385 193261 125 0 2 64 49 97 0.146 2.78 4.15 Intr - 196662 196384 279 1 0 75 23 126 0.075 1.33 4.14 Intr - 197092 196988 105 2 0 89 50 151 0.229 10.67 4.13 Intr - 197894 197715 180 0 0 117 55 106 0.786 9.12 4.12 Intr - 198365 198275 91 2 1 86 -13 84 0.027 -3.25 4.11 Intr - 200737 200654 84 1 0 76 63 56 0.026 1.00 4.10 Intr - 209863 209675 189 1 0 48 71 221 0.760 15.26 4.09 Intr - 211889 211731 159 2 0 31 26 160 0.684 3.36 4.08 Intr - 214003 213861 143 2 2 70 75 168 0.850 12.85 4.07 Intr - 221914 221786 129 2 0 73 94 128 0.667 11.65 4.06 Intr - 224719 224465 255 2 0 56 78 199 0.117 12.19 4.05 Intr - 239473 239372 102 2 0 34 111 64 0.383 2.53 4.04 Intr - 242406 242290 117 1 0 121 94 70 0.969 10.42 4.03 Intr - 246635 246582 54 0 0 73 98 45 0.655 2.03 4.02 Intr - 253629 253530 100 0 1 136 94 118 0.997 15.96 4.01 Init - 255487 255293 195 1 0 95 28 107 0.504 4.38 4.00 Prom - 260309 260270 40 -5.55 5.00 Prom + 262676 262715 40 -3.65 5.01 Init + 265188 265219 32 2 2 48 116 13 0.414 -0.63 5.02 Intr + 265801 265925 125 1 2 76 95 46 0.907 3.41 5.03 Term + 266827 267146 320 2 2 64 47 248 0.986 12.36 5.04 PlyA + 267390 267395 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 146986 147044 59 0 2 86 82 33 0.816 -0.74 S.002 Intr + 148024 148244 221 1 2 66 44 178 0.936 8.22 S.003 Term - 174141 173971 171 0 0 86 49 124 0.842 5.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:24365456_24634068|GENSCAN_predicted_peptide_1|248_aa MLIACSGLELLEILMAYGHLGQDLQDQVAPAMERGGWKRIFAGEEVKELKAQATGRIIYV DTKIIKNSYCWHSVEEDDNESGPRIFKNTQVTVTEVNQVGLGGWALRISISKFPGVAAAA AGSGTIHILRTCLDANHMVALQDPVWRVAASGLVQTMEDTSRSEIGRERQAYLFLQPTPC QAMLVSSFFPLCVRGNRLWVKSSLVIHPWCLRRFHKCIHIPYQGLAGQSNSRPLWGSDWH LTSDGITI >gi568815575f:24365456_24634068|GENSCAN_predicted_CDS_1|747_bp atgctgattgcatgtagtgggctggagcttcttgagatcctcatggcctatggtcatctg ggccaggacctgcaggatcaagtcgccccagccatggagcgaggagggtggaaaaggatc tttgcaggagaggaggtcaaggagttgaaggcccaggctactggaaggattatctatgtg gataccaaaatcatcaagaattcatactgctggcatagtgttgaagaggatgataatgag tcaggccctagaatcttcaagaatactcaggtaactgtaactgaagtcaaccaagtaggt ctgggtgggtgggccctgagaattagcatttccaagtttccaggtgttgctgctgctgct gctggatcagggaccatccacattttgagaacctgtctggatgcaaatcacatggtggcc ctccaggatccagtctggagagtagctgcttctggtttggtccagacaatggaggacacc agtagatctgagataggaagagagaggcaggcgtacttattcctgcagcccactccctgc caggctatgctggtcagcagcttttttcctctgtgtgtccgtggaaatcgactttgggtt aagagtagtctggttatccatccgtggtgtttgcggaggtttcacaaatgcatacacatc ccttatcaaggacttgctggccagtcaaatagtagacctttgtgggggtcagactggcat ctgacatcagatggcatcaccatctga >gi568815575f:24365456_24634068|GENSCAN_predicted_peptide_2|201_aa MTKKRRNNGRAKKGRGHVQPIRCTKCARCVPKDKAIKKFVIRNIVEAAAIRDISEASVFD AYVLPKLYVKLHYCVSCAIHSKVVRNRSREARKDRTPPPRFRPAGAAPRPPPKPMIWVQN TADIRHVLTGFAAGKPPSLGRSCVGKSGSCPLSRCLQDGCPQVIPVGCINIRQKPPLLCN HVGKCIIRSKLCGADLLAGIH >gi568815575f:24365456_24634068|GENSCAN_predicted_CDS_2|606_bp atgacaaagaaaagaaggaacaatggtcgtgccaaaaagggccgcggccacgtgcagcct attcgctgcactaaatgtgcccgatgcgtgcccaaggacaaggccattaagaaattcgtc attcgaaacatagtggaggccgcagcaatcagggacatttctgaagcgagcgtcttcgat gcctatgtgcttcccaagctgtatgtgaagctacattactgtgtgagttgtgcaattcac agcaaagtagtcaggaatcgatctcgtgaagcccgcaaggaccgaacacccccaccccga tttagacctgcgggtgctgccccacgtcccccaccaaagcccatgatctgggtccagaat actgcagacatcagacatgtgttaactgggtttgctgccggcaagccaccatccttggga aggagttgtgtgggcaaatcaggaagttgccctctcagcagatgtctccaggatggctgc cctcaggttattcctgttggttgtataaatataagacagaagccaccactgctttgcaat cacgtggggaagtgtatcatccgctccaagctctgtggagcagacctcctggccgggatc cactga >gi568815575f:24365456_24634068|GENSCAN_predicted_peptide_3|551_aa MRLFRWLLKQPVPKQIERYSRFSPSPLSIKQFLDFGEYGAKGPHGPGAAAIRAATPDLRS FGVKSVGTAGGAREYPGGSPGARISGRGGAGPGPLAPSLGLTRRHRNGVSLQDPHRRRDS PNLRSLASNLLDTNLTALKTSERDVFEIRVVSGNRTLTAGSYPYRQGRDNACEKTSYMFL RKELPVRLANTMREVNLLPDNLLNRPSVGLVQSWYMQSFLELLEYENKSPEDPQVLDNFL QVLIKVRNRHNDVVPTMAQGVIEYKEKFGFDPFISTNIQYFLDRFYTNRISFRMLINQHT LLFGGDTNPVHPKHIGSIDPTCNVADVVKDAYETAKMLCEQYYLVAPELEVEEFNAKAPD KPIQVVYVPSHLFHMLFELFKNSMRATVELYEDRKEGYPAVKTLVTLGKEDLSIKISDLG GGVPLRKIDRLFNYMYSTAPRPSLEPTRAAPLAGFGYGLPISRLYARYFQGDLKLYSMEG VGTDAVIYLKGQLNSLQVTNFPETRSRLIELSRGLNAVVLSVKKDCLLQVKQRELLSPTC SERGITVIPSA >gi568815575f:24365456_24634068|GENSCAN_predicted_CDS_3|1656_bp atgcggctgttccggtggctgctgaagcagccggtgcccaagcagatcgagcgctactcg cgcttttcgccgtcgccgctctccatcaaacaattcctggacttcggtgagtacggggcc aagggtccccatgggcccggggccgccgccattcgggctgccaccccggatctccgcagc ttcggggtaaagtctgtgggaaccgcaggcggggctagggaatatccggggggctctcct ggggctcggatttcggggcgcggaggggcagggcctggccccctcgcacccagcctggga cttacccggagacacagaaacggcgtctcattgcaggaccctcatcgcaggcgagactcc ccaaacctgcggtcgttagcttcaaatctgcttgacacaaacctcacagcccttaaaact tcagaacgagatgtttttgaaataagggtggtttctgggaacagaacattaacagcaggc agttatccctatcggcaagggagagataatgcatgtgagaaaacttcatatatgtttcta cgaaaggaacttcctgtgcggctggctaacacaatgagagaagttaatcttctgccggat aatttacttaaccgcccttcagtgggattggttcagagttggtatatgcagagttttctt gaacttttagaatatgaaaataagagccctgaggatccacaggtcttggataactttcta caagttctgattaaagtcagaaatagacacaatgatgtggttcctacaatggcacaagga gtgattgaatacaaggagaagtttgggtttgatcctttcattagcactaacatccaatat tttctggatcggttttataccaaccgcatctctttccgcatgcttattaatcagcacaca cttctgtttgggggtgacactaatcctgttcatcctaaacacataggaagtatcgatccc acctgtaacgtggcggatgtggtgaaagatgcatatgaaacagccaagatgctgtgtgaa cagtattacctggtagctccagagctggaagttgaagaattcaatgccaaagcgccagac aaacctattcaggtggtttatgtgccctcacatctgtttcatatgctatttgagttgttc aagaactcaatgagagcgacagttgaactctatgaagacagaaaagagggctaccctgct gttaaaaccctcgttactttgggtaaagaagacttatccattaagatcagtgacctaggt ggtggtgtcccacttcgaaaaatagatcgtctttttaactacatgtattctactgctcct agacccagcctggagcctaccagagctgcccctttggctggatttggttatggtttgcca atttcccgtctgtatgctagatattttcaaggagatctgaaactgtattccatggaagga gtgggtactgatgctgtcatttatttgaagggacaattaaattcccttcaagttaccaat tttccagagacaagatcaagactaatagaactttctagaggactgaatgctgtggtcctc tctgtgaagaaagattgccttctgcaagtcaaacagagagaactgctttctcccacctgc tctgagcgtggcatcacagttattcctagtgcttaa >gi568815575f:24365456_24634068|GENSCAN_predicted_peptide_4|954_aa MGRPIRAEAVGQWKASSGFNAVEGNGVSGGGTERIMCREQPSWCRAPSLGPHSAPFLLFH SFSSRTLTAPAPFADETNCQCQAPHEKLTIAQARLGTPGPFTTWQLASSKPPVQKDTDRP VRVYADGIFDLFHSGHARALMQAKTLFPNSYLLVGGLHRISDIPLMAITWVVEVKGILNY LDDHETVSFVCSDDLTHKFKGFTVMNEAERYEALRHCRYVDEVIRDAPWTLTPEFLEKHK VLLILPHSLSSKIRESGSAAKTTESPKSLSISKRDMHIHPSGDITLLSLQQIDFVAHDDI PYSSAGSDDVYKHIKEAGMFVPTQRTEGISTSDIITRIVRDYDVYARRNLQRGYTAKELN VSFINISHLAQGTVQAVRMHSLPSCDDKPGCSSEMKSNVLHGRVDKSRNLRGDRDLMPEK RYRFQNQVDKMKEKVKNVEERSKEFVNRVEEKSHDLIQKWEEKSREFIGNFLELFGPDGA WKIEVFASIAKFPIRKALLFYPLTSSNMKQSEWSFENKQQIVNPLLRILQRFLFSEETVR SSLKKRFLSGAFPEHPVCNRSCPAPSPTCYPLALFFSQTTDPSDIPYFVYCVIPPHICED CKVNRCLLSNLSLEADVPGEEQPDAAGLIPEAEPSEELGEVQRLLAGQPTQTLASRVTLT LLRKDAEFPRETCTDGVTSALVLPPPPPSKPPVAAAVPSTCYFCVHKGPARCLPPPQLCQ NKIVSNRAKKQGWDCCAISHTERLASGKHAPVCQDLTKAKKCQAQEEQDFPIGQQDNELK WAPICSVYLKPQCGDLDRVPYKVECRARHRKQTLQIQINRREEKRGQAVEISDEPCVFIP SGIQKRKSGEKQDQRRETNSGADLTNVGYMESRRAEVQTHHVPARMTTSGNGEMQRGSVS QWDPATPRAEDSTVSMEGTHRQHVPRRLRVTPYSPKSKQNENQKHPPPLQPIKS >gi568815575f:24365456_24634068|GENSCAN_predicted_CDS_4|2865_bp atgggccggcctattagagctgaagctgtggggcagtggaaagcaagcagcggctttaat gcagtggaaggaaatggagtcagtgggggtgggacagaaaggatcatgtgcagagagcag ccaagttggtgcagagcaccttctctggggccacattctgcccctttcctgcttttccac agtttcagttccaggaccctgactgcacctgccccatttgctgatgaaaccaactgccag tgtcaagcaccccatgaaaaactgaccattgctcaggcccgcttaggaacaccaggccct ttcacaacatggcagcttgcttcttcaaagccaccagtgcagaaagacactgacaggcct gtcagagtatacgccgatggaatatttgacctcttccactcaggtcatgcaagagccctt atgcaagcaaaaacactgtttcccaacagctacttgttggtaggaggtttacacagaatt tcagatattcccttaatggctattacatgggttgtggaagtcaagggcatattgaactac ttggatgaccatgagactgtatcctttgtttgcagtgatgatctcacccacaaattcaaa ggtttcaccgtgatgaatgaagccgagagatacgaagctctcagacactgtcgctacgta gacgaagttatcagagatgctccctggacactcacgccagagtttctggaaaaacacaag gtacttctcattctcccacattctctaagtagcaagattagggagtctggttctgcagcc aagactacggaaagcccaaagtctctctcaatttctaaaagagacatgcacatccatccc agtggtgacatcactcttctctctcttcaacagattgactttgtggctcatgatgacatt ccgtattcctctgctggctctgatgatgtttacaagcacataaaggaagcagggatgttc gttccaacgcagagaacagaaggcatctcaacatcggacatcattaccagaattgttcgt gactatgatgtttatgcccgacgtaacctccagagagggtatacagccaaggaactgaat gtcagctttataaatataagccacttagctcaaggaaccgtgcaggcagttcggatgcac tcactgccttcctgtgacgacaaacctggctgcagttcagagatgaaatctaatgtgttg catgggcgtgtagacaaatcacggaacctcagaggtgacagggacctgatgccagagaag aggtaccgtttccagaaccaagtggacaaaatgaaggaaaaagtcaagaatgtggaggaa agatcaaaggaatttgtgaacagagtggaagaaaagagccatgatctaattcaaaagtgg gaagagaagtcaagggaattcattggcaacttcctagaactgtttggacctgatggagca tggaaaatagaagtttttgcatctattgccaaattccccatcagaaaggcattgctgttt tatcctcttaccagcagtaacatgaagcagtcagaatggtcttttgaaaacaagcagcag attgttaatcctttgctcagaattctccagaggtttctcttctcagaggagactgtacgt tcttcactcaagaagcgttttctcagtggggccttccccgagcaccctgtttgcaatcgc agctgccctgctcccagccccacttgctatcctctcgccctctttttctcacagaccaca gatccatctgacataccatattttgtttattgtgtgattcctcctcatatctgtgaggat tgcaaagttaacagatgccttctctccaatttatctttagaagcagatgttccaggagag gagcagccggatgctgcaggccttatccccgaagcagagccctctgaagagttgggcgag gtgcagagacttctagctggacaacctacacaaaccttagcgtctagagtcactctaacc ttgctaaggaaggatgcagagttccctagagagacttgcactgatggggtgacttctgct ttggtccttccaccacccccaccaagtaaaccacctgtggcagcggctgttccctcaacc tgctacttctgcgtccataagggtcctgcacgttgcctcccaccaccccagctctgccag aataaaattgtttccaatagagctaagaaacaagggtgggattgctgtgccataagtcac actgaacgacttgcttctggcaagcatgctcctgtgtgtcaggaccttaccaaggcaaag aaatgccaggctcaagaggagcaggacttccccattggtcagcaagacaatgaactgaaa tgggctcccatctgctctgtctacctgaagccacagtgtggggacctcgacagggtcccc tacaaagttgagtgcagagccagacacagaaaacagactctccaaattcagatcaatcga cgtgaagaaaaacgaggacaggcggttgagatctcagatgaaccatgtgtgttcatcccc tccggtatccaaaagaggaagagcggcgagaagcaggaccagaggagagaaaccaactct ggagccgatctcacaaatgtgggctacatggaatccaggagggcagaagtacaaacccac catgttcctgctaggatgacgactagtgggaatggggaaatgcaaagaggatctgtctcc caatgggacccggccacccccagggcagaggattcaacagtgtctatggaaggcactcac agacaacatgttccaaggaggcttcgtgtcactccttattcccccaaatccaaacagaat gaaaaccaaaaacaccctccaccactacaacccatcaagagctag >gi568815575f:24365456_24634068|GENSCAN_predicted_peptide_5|158_aa MHITVLYISKSAFHVRQNPSTKKRFRKAFSLRGQTNVHPENTVFFSLMVGSNEGCQEGGV TKSQKDGKNHKHSHKESYSIYIYKVLKQVHLNSGISSKAMGIMNSFVNIFKCIIPDALQV LDDHLQGDPYSLCLLPREMANHTVSHNTKALTKYTSSK >gi568815575f:24365456_24634068|GENSCAN_predicted_CDS_5|477_bp atgcatatcacagtcctctatatttcaaaaagtgccttccatgtaaggcagaaccccagc accaaaaagcggttcagaaaagcattttctttaaggggccaaacgaacgtgcaccctgaa aatacagtttttttctccctgatggttggatccaatgaagggtgccaagaaggtggtgtg accaagagccagaaggatggcaagaatcacaagcatagccacaaggaaagctactctatc tacatttacaaggtattgaagcaggtccatctcaactctggcatctcatcgaaggccatg ggcatcatgaactcatttgtcaatatcttcaagtgcatcattcctgatgcattacaagta ctcgacgatcacctccagggagatccatacagcttgtgcctgctgcctagggagatggcc aatcacactgtgtctcacaacaccaaggccctcaccaagtacaccagctccaagtga