GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:55:36 Sequence gi568815596f:182978934_183195655 : 216722 bp : 38.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 2443 2311 133 0 1 -6 98 133 0.543 4.63 1.08 Intr - 3994 3888 107 1 2 73 97 100 0.508 7.49 1.07 Intr - 4449 4353 97 2 1 75 86 39 0.204 1.49 1.06 Intr - 23336 23194 143 2 2 81 71 55 0.334 1.33 1.05 Intr - 24392 24300 93 2 0 86 95 81 0.402 7.84 1.04 Intr - 44983 44873 111 1 0 86 101 114 0.993 12.16 1.03 Intr - 59145 59059 87 0 0 2 91 144 0.022 5.35 1.02 Intr - 59685 59424 262 2 1 24 23 239 0.006 7.57 1.01 Init - 67468 67341 128 1 2 54 115 120 0.499 10.98 1.00 Prom - 71529 71490 40 -4.65 2.10 PlyA - 72176 72171 6 1.05 2.09 Term - 73644 72767 878 1 2 80 28 451 0.067 30.05 2.08 Intr - 74333 74176 158 1 2 10 87 109 0.011 1.73 2.07 Intr - 81734 81530 205 0 1 68 105 74 0.152 4.44 2.06 Intr - 82131 82048 84 1 0 142 78 64 0.744 9.67 2.05 Intr - 90516 90443 74 2 2 74 39 64 0.026 -1.77 2.04 Intr - 92433 92111 323 0 2 -29 5 390 0.104 12.73 2.03 Intr - 92614 92458 157 0 1 -33 -2 265 0.429 4.49 2.02 Intr - 93131 92734 398 2 2 -13 19 589 0.337 34.35 2.01 Init - 93553 93230 324 1 0 86 87 290 0.679 26.08 2.00 Prom - 93972 93933 40 -9.25 3.00 Prom + 94031 94070 40 -7.85 3.01 Init + 96972 97092 121 2 1 88 34 100 0.608 5.00 3.02 Intr + 99987 100226 240 1 0 70 95 195 0.612 15.00 3.03 Intr + 104575 104621 47 2 2 67 93 62 0.960 1.81 3.04 Intr + 108107 108259 153 1 0 106 87 70 0.947 8.05 3.05 Term + 116498 116725 228 1 0 105 44 109 0.956 3.85 3.06 PlyA + 120673 120678 6 1.05 4.00 Prom + 126040 126079 40 -4.35 4.01 Init + 129838 129886 49 0 1 86 58 28 0.004 -1.24 4.02 Intr + 139658 139809 152 0 2 69 76 82 0.022 4.06 4.03 Intr + 145248 145494 247 2 1 28 79 148 0.026 4.01 4.04 Intr + 149354 149524 171 0 0 51 96 68 0.049 2.89 4.05 Intr + 151485 151612 128 1 2 90 90 60 0.261 5.88 4.06 Intr + 154633 154690 58 0 1 73 111 33 0.118 1.64 4.07 Intr + 172575 172716 142 1 1 34 82 95 0.056 1.99 4.08 Intr + 174729 174987 259 0 1 77 68 150 0.134 8.44 4.09 Intr + 179350 179478 129 0 0 52 106 112 0.748 9.37 4.10 Intr + 180555 180719 165 2 0 49 68 79 0.520 1.24 4.11 Intr + 193028 193115 88 1 1 93 84 77 0.034 6.42 4.12 Term + 193439 193464 26 0 2 106 47 26 0.021 -2.29 4.13 PlyA + 194161 194166 6 1.05 5.03 PlyA - 194522 194517 6 1.05 5.02 Term - 199916 199751 166 1 1 50 43 136 0.679 1.71 5.01 Intr - 200402 200255 148 0 1 57 66 122 0.842 5.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 50521 50519 3 1 0 85 115 0 0.860 2.45 S.002 Term - 92433 92107 327 0 0 -29 47 402 0.807 18.12 S.003 Sngl + 126096 126623 528 2 0 55 48 163 0.935 4.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:182978934_183195655|GENSCAN_predicted_peptide_1|387_aa MVQEASEAIGQHQSSDAKLIRSGKESLREPWARVPGGLGVAARRSGRSGPGPGPERRRRG HAEGPSAAVPRPASRGRRRCRSGDRGPVRGCGKRPRSQPPETPPPPPPPPHLVEQPGKAA RWGLGRRAPGPSQQKLAEKLTILNDRGVGMLTRLYNIKKACGDPKAKPSYLIDKNLESAV KFIVRKFPAVETRNNNQQLAQLQKEKSEILKNLALYYFTFVDVMEFKTVNFDLTKNYLDL IITYTTLMILLSRIEERKAIIGLYNYAHEMTHGASGSMHRERRKFLRSALKELATVLSDQ PGLLGPKALFVFMALSFARDEIIWLLRHADNMPKKSADDFIDKHIAELIFYMEELRAHVR KYGPVMQRYYVQYLSGFDAVVLNELVQ >gi568815596f:182978934_183195655|GENSCAN_predicted_CDS_1|1161_bp atggtccaggaagcttctgaggcaattgggcagcatcagtcttcagacgctaagctgata agatctgggaaggagtcactcagagagccttgggccagagttccagggggtctgggagtg gctgccaggcgtagtgggcggtccgggccagggccagggccagagcggcggcggcgaggc catgccgagggcccgtcggccgcagttccccgtccggcctcgcgggggcgccggcgctgc cgatcaggtgaccgagggcccgtccggggctgcgggaagcggcctcgttctcagccgccg gagacgccgccgccgccgccgccgccgccacacctagtggagcagccggggaaggcggct cgttgggggctggggcggagagcgccggggcccagtcagcagaagctggcggagaagctc accatcctcaacgaccggggcgtcggcatgctcacccgcctctacaacatcaagaaggca tgtggagaccccaaggcaaaaccatcctatcttatcgacaaaaacctggaatctgctgtg aaattcatagtcagaaaattccctgctgtagaaacccgcaacaacaatcaacagcttgca caactacagaaagaaaaatcagagattctgaaaaatctggcattatattacttcacattt gtagatgttatggaatttaagactgtaaactttgatttaacaaagaactacttagattta attataacctatacaacactaatgatactgctgtctcgaattgaagaaaggaaggcaatc attggattatacaactatgcccatgaaatgactcatggagcaagtggttcaatgcacaga gaaagacgcaagtttttaagatctgcactgaaggaattggctactgtcctctctgatcaa cctggattgctaggtcccaaggcactttttgtttttatggcattatcctttgcccgtgat gaaatcatctggctacttcgtcatgcagataacatgccaaagaagagtgcagacgacttt atagataagcacattgctgaattaatattttacatggaagaacttagagcacatgtgagg aaatacggacctgtaatgcagaggtattacgtgcagtacctttctggctttgatgctgtt gtcctcaatgaactcgtgcag >gi568815596f:182978934_183195655|GENSCAN_predicted_peptide_2|866_aa MSIRTTQKSYRVSTSGPRAFSSHFYTSGPGASISSSSFSQVGSSSFRGGLGGGYGGASGM GGITTVTVNQSLLSPLNLEVDPHIQAVHTQKKEQIKTLNTKFATFIYKSYINNLRQQLET LGQEKLKLEAELGNMQGLVEDFKNKYEDEINKRTEMENEFVLIKKDVDEAYLNKVELESR LEGLTDEINFLRQLYEEDIPELQSQISDTSVVLPMDNSHSLDMDSIINEVKRSTRSPTAA SWLQAEIEGLKGQRASLEAAIADAEQRGELAVKDASAKLSELEAALQQTKQDVELMNDKL ALDIEIATYRQLLEGEESWLESGMQSMSIHTKTISSYAGGLSSAYGGLTSPGLSCGLGSS FGSGAGSSSFSCISYTRAVVVKKIETRDGTLVSEFSDVLPKQVVPLNLQPLAEMRPRVGS SYSQTGKENVMPVRVRDHLSRLSVSNKAVYSLGWHRELLTAHSGQTKTIAVILFPTPDPT RPTPKQTNSSYTFLLNTRLPTDLIRANKGKGGSGAFASTFWTWSRQAAPIRGGSNPGRRI TRATVGERGTKLEDTRVLLKQSPWSVRRGARRHQASSATVDLCCTKPVSLLPGEPPQKVP TGVCGPLPAGMIGLLLGTSSLNLKGVQIQTGVNDSDYNGEIQIVISTSVPWKAEPGERIA QLLIVPYVEIGKNEIKLTGGFGSTNKQGKAAYWVNQITDERPTCEITIQGKKFKVLVDTG ADISVISLQHWSSTWPIQPAQFNIVGVGKAPEVYQSSYILQCEGPDGRPGTIQPIITSVP INLWGRDLLQEWGAQVLIPEQLSDGEIPGEIRGPPSCSHVKADTKEDPNCHEQHPSNTAT HLGTDEETVTDGRRKPEESRTTSHNE >gi568815596f:182978934_183195655|GENSCAN_predicted_CDS_2|2601_bp atgtccatcaggacgacccagaagtcctacagggtgtccacttctggcccccgggccttc agcagccatttctacacaagtgggcctggtgcctccatcagctcctcgagcttctcccaa gtgggcagcagcagcttccggggtggcctgggaggaggctacggtggggccagtggcatg ggaggcatcaccaccgtcactgtcaaccagagcctgctgagcccccttaacctggaggtg gacccacacatccaggcagtgcacactcagaagaaggagcagatcaagaccctcaacact aagtttgccaccttcatatacaagagttacatcaacaaccttaggcagcagctggagact ctgggccaggaaaagctgaagctggaggcggagcttggcaacatgcaggggctggtggag gacttcaagaacaagtatgaggatgagatcaataagcgtacagagatggagaatgaattt gtcctcatcaagaaggatgtggatgaagcttacctgaacaaggtagagctggagtctcgc ctggaagggctgactgacgaaatcaacttcctcaggcagctgtatgaagaggatatcccg gagctgcagtcccagatctcagacacgtctgtggtgctgcccatggacaacagccactcc ctggacatggacagcatcatcaatgaggtcaagcgcagtacgagatcaccaaccgcagcc agctggctccaggcagagattgagggtctcaaaggccagagggcttccctggaggccgcc atcgcagatgcggagcagcgcggggagttggccgttaaggatgccagcgccaagctgtct gagctggaggccgccctgcagcaaaccaagcaggacgtggagctgatgaacgacaagctg gccctggacattgagatcgccacctacaggcagctgctggagggcgaggagagctggctg gagtctgggatgcagagcatgagtatccatacaaagaccatcagcagctatgcaggtggt ctgagctcggcctatgggggcctcacaagccccggcctcagctgtggactgggctccagc tttggctctggcgcgggctccagttccttcagctgcatcagctacaccagggccgtggtt gtgaagaagattgagacccgtgatgggacgctggtgtcagagttctctgacgtcctgccc aagcaggttgtcccattgaatctgcagcccttagcagagatgagacccagagtgggtagc tcctactcgcagacagggaaggagaatgttatgcctgtccgtgtaagagaccacctgagc aggcttagtgtgagcaacaaggctgtttattcacttgggtggcatagggaacttcttaca gcacattctggccagaccaagactatcgctgttattcttttcccaaccccagaccccacc agacccacccctaaacagactaactcctcttacaccttcctcctcaatacaaggctgcct actgacctaatccgtgctaacaaagggaaaggggggtctggagcatttgcatctaccttt tggacctggtctcggcaagcagcgcccatacgtgggggctcgaatccaggtcgaaggatc accagagcgacagttggagaacgtggaactaagctggaggacacccgagtactcttaaag cagtccccatggtcagtaagaaggggagctcggaggcatcaagccagtagtgccacagta gatttatgctgcacaaaacctgtgagccttctgcctggggaacccccacaaaaggtccca acaggagtctgtggacccttgccagcggggatgataggattacttctaggaacgtctagt ttaaatttaaaaggggtacaaatacaaacaggagtcaatgattcagattacaatggggaa attcaaattgttatatctacttctgttccctggaaagcagagccaggagaacgtatagca cagctcctgattgtgccatatgtggaaatagggaaaaatgaaattaaactaacaggaggg tttggaagcacaaataaacaaggcaaagcagcttattgggtaaatcaaattactgatgaa cgtcctacctgtgaaataactattcagggaaagaaatttaaagttttggtagatacagga gcagacatttcagtcatttctctacagcactggtcatccacgtggcccattcaacctgct caatttaacatagttggagttggtaaagcccctgaagtatatcaaagtagttatattttg caatgtgaaggacctgatggacgacctgggactattcaaccaattataacttctgtacct ataaatttatgggggagagatttattacaagaatggggagcacaagttctaattccagag caattatccgatggagagattccgggagaaatccgaggaccccccagttgcagccatgta aaggctgacactaaggaggaccccaactgtcatgagcaacacccgtcgaacacagccacc cacctggggacagatgaagaaactgtcacagatggcagaagaaaacctgaggaaagcagg acaaccagtcacaatgagtaa >gi568815596f:182978934_183195655|GENSCAN_predicted_peptide_3|262_aa MDVVYSIRDTTSAKQPSAAAAVPEGIVTSQPIVHPVVETIATAHVMYSLNQEIKAFSRNN LRKQCTRVTTLTGKKIIETWKDARIHVVEEVEPSSGGGCGYVQDLSSDLQVGVIKPWLLL GSQDAAHDLDTLKKNKVTHILNVAYGVENAFLSDFTYKSISILDLPETNILSYFPECFEF IEEAKRKDGVVLVHCNAGVSRAAAIVIGFLMNSEQTSFTSAFSLVKNARPSICPNSGFME QLRTYQEGKESNKCDRIQENSS >gi568815596f:182978934_183195655|GENSCAN_predicted_CDS_3|789_bp atggatgtggtctattccataagggacacaacctcggcaaagcagccctctgcagctgca gcagtccctgaagggattgttaccagtcagccaatagtacacccggtagtggagacaata gccactgctcatgtaatgtactcccttaaccaggaaattaaagcattctcccggaataat ctcaggaagcaatgcaccagggtgacaacgctaactggaaagaaaattatagaaacatgg aaagatgccagaattcatgttgtggaagaagtagagccgagcagtgggggtggttgtggt tatgtgcaggaccttagctcggacctgcaagttggcgttattaagccatggttgctccta gggtcacaagatgctgctcatgatttggatacactgaaaaagaataaggtgactcatatt cttaatgttgcatatggagttgaaaatgctttcctcagtgactttacatataagagcatt tctatattggatctgcctgaaaccaacatcctgtcttattttccagaatgttttgaattt attgaagaagcaaaaagaaaagatggagtggttcttgttcattgtaatgcaggcgtttcc agggctgctgcaattgtaataggtttcctgatgaattctgaacaaacctcatttaccagt gctttttctttggtgaaaaatgcaagaccttccatatgtccaaattctggcttcatggag cagcttcgtacatatcaagagggcaaagaaagcaataagtgtgacagaatacaggagaac agttcatga >gi568815596f:182978934_183195655|GENSCAN_predicted_peptide_4|537_aa MGFHHVGQAGLELLTSVVGIQITQHQNMLSAANPYGSATTILVSSGENLTKGHKAEETKT SFRAGVKYPQIRKHSRMKSIHRTIRWFATFPVQKPFPRKFSAQRLCPIPKWLRRLQLKHS AGPEVTQPHKVARRYPWGAFPPFLKINWEGSEPMMLGSPTSPKPGVNAQFLPGFLMGDLP APVTPQPRSISGPSVGVMEMRSPLLAGGSPPQPVVPAHKDKSGAPPVRSIYDDISSPGLG STPLTSRRQPNISVMQSPLVGVTSTPGTGQSMFSPASIGQPRKTTLSPAQLDPFYTQGDS LTSEDHLDDSWVTVFGVQPPSWLFSRAGIACMGFPGTECKLSVDLPFWGLEDGGPLLTAP LGSALVGTLCGSFDPTVPFRTALAEVLHEGPTPAAQFCLGIQMSNTGNWMHIRYQSKLQA RKALSKDGRIFGESIMIGVKPCIDKSVMESSDRCALSSPSLAFTPPIKTLGTPTQPGSTP RISTMRPLATAYKASTSDYQDNRVLQSDRSGTAGLEALAGIACLAASRRGQEAAATG >gi568815596f:182978934_183195655|GENSCAN_predicted_CDS_4|1614_bp atggggtttcaccatgttggccaggctggtcttgaactcctgacctcagtggtgggaatc caaattacacagcaccaaaatatgttgtcagcggcaaatccatacgggtctgcaacaaca attcttgtgtcctcaggagagaatctgactaaggggcataaggcagaagagaccaagaca agttttagagcaggagtgaagtacccacaaatcaggaagcattcccggatgaaaagcatt cacagaaccatacgctggttcgccaccttcccggtgcaaaaaccctttcctcggaaattc tcagcgcaaagactttgccctattccaaaatggctccggcgcctacaacttaagcatagc gcaggtccggaagttacgcaaccccataaggtagcacgccgttacccgtggggagcgttt ccgccatttttgaaaattaattgggaaggatctgaaccaatgatgctgggttcacccaca tctccaaagccaggagttaatgcccagttcttacctggatttttaatgggggatttgcca gctccggtgactccacaacctcgatcaattagtggcccttcagtaggagtaatggaaatg agatcacctttacttgcaggtgggtcaccaccacaaccagttgtaccagctcataaagat aaaagtggcgctccaccagttagaagtatatatgatgacatttctagcccaggacttgga tcaacacctttaacttcaagaagacagccaaacatttcagtaatgcagagtcctcttgtt ggagttacatctactcctggaacagggcaaagtatgtttagtccagcaagtatcggtcag ccacgaaagacgacattatctcctgcccagttggatcctttttatactcaaggagattct ttgacttcagaagatcacctcgatgactcttgggtgactgtatttggggtacagcctccc tcctggctgttttcccgtgctggcattgcgtgtatgggttttccaggcacagagtgcaag ctgtcagtggatctaccattctggggactggaggatggtggccctcttctcacagcccca ctaggcagtgccctagtagggaccctgtgtgggagctttgaccccacagttcccttccgc actgccctagcggaggttctccatgagggccccacccctgcagcacaattttgcctgggc atccagatgtctaatacaggaaattggatgcatattcgttatcaatctaaactgcaggct cggaaagccttaagcaaagatgggaggatttttggagaatccatcatgattggtgtaaaa ccatgtattgacaaaagtgttatggaaagcagtgacagatgtgctttatcatctccatct ttagcctttacaccaccaatcaaaactctaggtacaccaacacaacctggaagtactcct aggatttctaccatgagacctcttgctacagcatacaaagcctctactagtgattatcag gacaacagggtgctacaatcagacagaagtgggacagctgggctggaagctctagcaggc attgcttgcctggctgccagccgtaggggacaagaggctgcagccactggctga >gi568815596f:182978934_183195655|GENSCAN_predicted_peptide_5|104_aa XLRKQPGSSDCVEFTPAQKSDCGQTSSLDSSSLGRASLKESQQPQSGTYRQTGSGVDLQQ TPADLQKRGPTVKRKTNKQKARTSTSTKRTPTQKPHPKVINIKD >gi568815596f:182978934_183195655|GENSCAN_predicted_CDS_5|315_bp ntgttaaggaagcaaccgggaagttcagactgtgtggaattcaccccagcacagaaaagc gactgtggccagacttcctctttagattcctcctcactgggcagggcatctctgaaagaa agccagcagccccagtcagggacttataggcaaacagggtctggagtggacctccagcaa actccagcagacctgcagaagaggggaccaactgttaaaaggaaaaccaacaaacagaaa gcaaggacatcaacatcaacaaaaaggacacccacacaaaaaccgcatccaaaggtcatc aacatcaaagactaa