GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:03:56 Sequence gi568815584r:21111226_21334193 : 222968 bp : 41.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 711 706 6 1.05 1.04 Term - 5389 5304 86 2 2 95 48 73 0.028 0.84 1.03 Intr - 21392 21307 86 1 2 107 74 -11 0.035 -2.06 1.02 Intr - 24690 24565 126 2 0 60 69 146 0.225 8.77 1.01 Init - 27396 27272 125 0 2 58 92 34 0.569 0.49 1.00 Prom - 32356 32317 40 -5.95 2.03 PlyA - 32692 32687 6 1.05 2.02 Term - 37270 37104 167 2 2 74 47 95 0.292 1.10 2.01 Init - 44548 43732 817 1 1 50 41 639 0.348 48.32 2.00 Prom - 47126 47087 40 -8.35 3.00 Prom + 50411 50450 40 -10.15 3.01 Init + 51440 51523 84 1 0 75 47 77 0.496 3.17 3.02 Intr + 51883 52110 228 0 0 40 -6 295 0.438 12.44 3.03 Term + 52119 52271 153 2 0 20 45 207 0.871 6.54 3.04 PlyA + 53377 53382 6 1.05 4.08 PlyA - 53797 53792 6 -1.95 4.07 Term - 54335 54120 216 2 0 73 41 126 0.696 2.56 4.06 Intr - 56185 56104 82 0 1 37 106 121 0.813 7.52 4.05 Intr - 57992 57697 296 2 2 50 31 240 0.928 9.38 4.04 Intr - 58822 58718 105 1 0 107 10 97 0.475 3.29 4.03 Intr - 70536 70451 86 1 2 59 78 1 0.010 -5.18 4.02 Intr - 81314 80673 642 0 0 88 56 264 0.536 14.26 4.01 Init - 82214 82193 22 2 1 90 96 11 0.496 2.01 4.00 Prom - 92683 92644 40 -6.85 5.08 PlyA - 94229 94224 6 1.05 5.07 Term - 100081 99998 84 1 0 109 47 174 0.999 12.07 5.06 Intr - 100341 100181 161 1 2 55 67 319 0.874 25.29 5.05 Intr - 100698 100585 114 1 0 41 99 133 0.994 9.20 5.04 Intr - 101892 101735 158 2 2 73 31 191 0.997 10.63 5.03 Intr - 119141 119094 48 1 0 85 77 50 0.475 0.48 5.02 Intr - 119847 119733 115 1 1 104 57 139 0.944 10.99 5.01 Init - 122968 122728 241 1 1 90 97 204 0.990 19.58 5.00 Prom - 129138 129099 40 -6.25 6.00 Prom + 130289 130328 40 -4.75 6.01 Init + 143264 143301 38 1 2 42 116 26 0.261 0.33 6.02 Intr + 156189 156333 145 0 1 52 77 62 0.467 0.86 6.03 Intr + 157702 157887 186 0 0 32 37 189 0.877 7.16 6.04 Intr + 158132 158273 142 1 1 43 30 154 0.345 4.11 6.05 Intr + 176714 176836 123 0 0 106 88 77 0.848 9.14 6.06 Intr + 183452 183584 133 0 1 56 75 147 0.862 8.98 6.07 Intr + 189741 190012 272 0 2 71 64 197 0.712 11.86 6.08 Intr + 191263 191359 97 2 1 71 74 57 0.709 0.75 6.09 Intr + 192187 192318 132 1 0 58 103 146 0.388 11.94 6.10 Intr + 196506 196611 106 0 1 67 105 87 0.982 7.60 6.11 Intr + 197006 197051 46 1 1 80 115 0 0.627 -1.04 6.12 Intr + 200516 200745 230 0 2 29 84 193 0.611 9.67 6.13 Intr + 201214 201281 68 0 2 26 113 91 0.187 2.18 6.14 Intr + 206471 206625 155 2 2 37 102 226 0.274 17.69 6.15 Intr + 208792 208952 161 2 2 58 94 158 0.999 12.19 6.16 Intr + 210034 210192 159 0 0 86 47 167 0.593 11.66 6.17 Intr + 210629 210746 118 1 1 35 63 128 0.985 4.12 6.18 Intr + 213393 213845 453 1 0 87 111 291 0.941 23.60 6.19 Intr + 214007 214158 152 0 2 74 78 122 0.999 8.76 6.20 Intr + 214606 214948 343 0 1 60 81 317 0.999 22.28 6.21 Intr + 216398 216582 185 0 2 101 93 167 0.962 17.09 6.22 Intr + 217151 217402 252 1 0 103 94 117 0.528 10.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 167561 167723 163 1 1 51 100 55 0.840 2.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:21111226_21334193|GENSCAN_predicted_peptide_1|140_aa MTTFGERQVTPTGKGQKGSYWVAEIVPLLELGTHCMQGCSLSNNASSNMQELVKNQNTPA TAKMASELRLLAASSEISPAENTNVPASPSPSTMIVSFLRPPQPCFLYSLQNSNILIEQC QMLDIQRCSSTSGFHKYEGG >gi568815584r:21111226_21334193|GENSCAN_predicted_CDS_1|423_bp atgactacttttggggagaggcaggtgactccaactggtaaagggcagaaggggagctac tgggttgctgagattgttccacttctggaactgggtactcattgcatgcaagggtgttca ctaagtaacaatgcatcctccaatatgcaggaacttgtgaagaatcagaacacacctgcg actgctaagatggcaagtgaactcaggttgctggctgcttcttctgaaatcagtcctgca gaaaatacaaacgtgcctgcttccccttcgccttccaccatgattgtaagtttcctgagg cctccccagccatgcttcctgtatagcctgcagaactctaacatcttgattgaacaatgc cagatgctggacatacagcgctgctccagtacctccggatttcataaatacgaaggggga tga >gi568815584r:21111226_21334193|GENSCAN_predicted_peptide_2|327_aa MYTATLLGNLVMFLLIHVSATLHTPMYSLLKSLSFLDFCYSSTVVPQTLVNFLAKRKVIS YFGCMTQMFFYAGFATSECYLIAAMAYDRYAAICNPLLYSTIMSPEVCASLIVGSYSAGF LNSLIHTGCIFSLKFCGAHVVTHFFCDGPPILSLSCVDTSLCEILLFIFAGFNLLSCTLT ILISYFLILNTILKMSSAQGRFKAFSTCASHLTAICLFFGTTLFMYLRPRSSYSLTQDRT VAVIYTVVIPVLNPLMYSLRNKDVKKALIKVWDIFGYSSLLAIPCGFLNQLVNVYSEVSC YFEMDCVESVDQFGEYCHLNNIKSSDP >gi568815584r:21111226_21334193|GENSCAN_predicted_CDS_2|984_bp atgtacacagccactctgctggggaacctggtcatgttcctcctgatccatgtgagtgcc accctgcacacacccatgtactccctcctgaagagcctctccttcttggatttctgctac tcctccacggttgtgccccagaccctggtgaacttcttggccaagaggaaagtgatctct tattttggctgcatgactcagatgttcttctatgcgggttttgccaccagtgagtgctat ctcatcgctgccatggcctatgaccgctatgccgctatttgtaaccccctgctctactca accatcatgtctcctgaggtctgtgcctcgctgattgtgggctcctacagtgcaggattc ctcaattctcttatccacactggctgtatctttagtctgaaattctgcggtgctcatgtc gtcactcacttcttctgtgatgggccacccatcctgtccttgtcttgtgtagacacctca ctgtgtgagatcctgctcttcatttttgctggtttcaaccttttgagctgcaccctcacc atcttgatctcctacttcttaattctcaacaccatcctgaaaatgagctcggcccagggc aggtttaaggcattttccacctgtgcatcccacctcactgccatctgcctcttctttggc acaacactttttatgtacctgcgccccaggtccagctactccttgacccaggaccgcaca gttgctgtcatctacacagtggtgatcccagtgctgaaccccctcatgtactctttgaga aacaaggatgtgaagaaagctttaataaaggtttgggatatttttggctattctagctta ctagcaattccatgtggatttttgaatcagcttgtcaatgtctacagtgaagtcagctgc tattttgaaatggattgtgttgaatctgtagatcaatttggggaatattgccatcttaac aatattaagtcttctgatccatga >gi568815584r:21111226_21334193|GENSCAN_predicted_peptide_3|154_aa MASSGIMIPKPPKPSDKLLMPYMRYSRKRNHSLINEILSDSVVPDIQSAVTTARMQVLKQ QVQSLMVHQQKLEAELLQIEERYPKTRKFLESTESFSNNLKVCVVEVDMEKIAAEIAQAE EQTHKRQEEREREVADQAKRSQSGIVPEQEHAAN >gi568815584r:21111226_21334193|GENSCAN_predicted_CDS_3|465_bp atggcatcctctggtattatgattccaaaaccaccaaaaccatcagataagctgctgatg ccctacatgaggtacagccgaaagagaaaccacagcctcatcaatgaaatccttagtgac agtgtggtgccagacattcagtcagctgtcacaacagctagaatgcaggtcctcaaacaa caggtccagtccttaatggttcatcagcaaaaactagaagctgaacttcttcaaatagag gaacgatacccgaagacgaggaaattcctggaaagcacagaatcatttagcaataactta aaagtttgtgtggtagaagtggatatggagaaaattgcagctgagattgcacaggcagag gaacagacccacaaaaggcaggaggaaagggagagggaggtggcagaccaagctaagcgc agtcagagcggcatcgttcctgagcaagagcacgcggccaactaa >gi568815584r:21111226_21334193|GENSCAN_predicted_peptide_4|482_aa MDPTSAKGICLPPAVIYGILPDFALRWEGGTAGRSQAVRAGAFKPTRAGGHSRVPKSAGM SESPARVWVAAAACGVGGGGTGLLPALLSGRPRSAAAVLVAAAVPWRTGILPAPSPERAQ GGSDLQPQLVQLQLRPRGRGSCFLHEVGGPVCSHRLGSRSGAQGRPAPVQKGQGSRLFPP APLSMQPQVPPCCSWCDGSGRPSGAAAAIIASFKWRFFWVLGIYPEEKKSLYKTDTCTHM FIAAQFTIAKTKGSRDRTAAVVVAEGLLDASGVLSPGKTLGHFQWYLKVLTVKAAKQPRW WPTLPSGSSVPWRFGIAASWKTAEGVDVDLGQEVQPVKRNGIQDLHEKAAWPSLRRAAVL CQGTAPVPTHLGLPRARWQQQLRLSYRRRSPTPGPQADIVEPGYTAGGEWWLAGLVRVLG NQPVGRMVILDMLFPPEDLSVVNHFRKSGLPPDDNLTPEHFFLQSCFRCPSANHIIPAKR LA >gi568815584r:21111226_21334193|GENSCAN_predicted_CDS_4|1449_bp atggatcccacatctgccaagggaatctgtctgcctcctgctgtcatttatggtatcctc ccagactttgctctgagatgggaagggggaacagcagggagaagccaggcagtgagagca ggcgctttcaagcctacgagggcaggggggcattcccgggtccccaagagtgcagggatg tctgagtctccagccagggtttgggtggcggcagctgcgtgtggggttgggggagggggg acaggactcctgcctgctcttctgagtgggaggcccaggtctgcagctgcagttttggtg gctgcagctgtgccctggaggacagggatcctgcctgctcccagccccgaaagagcgcag ggaggctcggatctgcagccacaacttgtgcagctgcagctccgcccaagagggcggggc tcctgtttcctccatgaagtgggaggcccggtctgcagccaccgtttgggtagccgcagc ggcgcccagggacgtcctgccccagttcagaaggggcagggctcccgcttgttccctccg gctccactaagcatgcagccccaggtgcctccctgctgcagctggtgtgatggtagtggc agaccatctggagcagccgctgccatcattgcctcattcaaatggaggtttttctgggtt cttggtatctacccagaggaaaagaaatcattatacaaaacagatacttgcacacacatg tttatagcagcacaattcacaattgcaaaaaccaaaggcagcagggatagaactgctgct gtggtcgtggcagaggggctcttggatgcctctggggttctctccccagggaaaaccctg ggccacttccaatggtatctgaaggtattaacagtgaaggctgcaaaacaaccaagatgg tggcccacgcttccctctgggagctctgtcccatggaggtttggaattgctgccagctgg aaaacagctgagggggtggatgtagacctcggtcaggaggttcagccagtgaagagaaat gggatccaggacctgcatgaaaaagcagcctggccatctctccggagagctgctgtgctg tgccaggggactgctccagtccctactcatcttggactccccagagccagatggcaacaa cagctaaggctgtcctacagaaggcggtccccaacccctgggccacaggctgatattgta gaaccaggctacacagcaggaggtgaatggtggcttgctggacttgtccgtgtcctggga aatcagcctgtgggtaggatggtcatcctggacatgctcttccctccagaagacctatct gtggttaaccatttccggaaatctgggttaccacctgatgataatctaacaccagagcat ttcttccttcaaagttgcttcagatgtccttcagcaaaccatataattccagcaaagcgc ctggcttga >gi568815584r:21111226_21334193|GENSCAN_predicted_peptide_5|306_aa MASNVTNKTDPRSMNSRVFIGNLNTLVVKKSDVEAIFSKYGKIVGCSVHKGFAFVQYVNE RNARAAVAGEDGRMIAGQVLDINLAAEPKVNRGKAGVKRSAAEMYGSVTEHPSPSPLLSS SFDLDYDFQRDYYDRMYSYPARVPPPPPIARAVVPSKRQRVSGNTSRRGKSGFNSKSGQR GSSKSGKLKGDDLQAIKKELTQIKQKVDSLLENLEKIEKEQSKQAVEMKNDKSEEEQSSS SVKKDETNVKMESEGGADDSAEEGDLLDDDDNEDRGDDQLELIKDDEKEAEEGEDDRDSA NGEDDS >gi568815584r:21111226_21334193|GENSCAN_predicted_CDS_5|921_bp atggccagcaacgttaccaacaagacagatcctcgctccatgaactcccgtgtattcatt gggaatctcaacactcttgtggtcaagaaatctgatgtggaggcaatcttttcgaagtat ggcaaaattgtgggctgctctgttcataagggctttgccttcgttcagtatgttaatgag agaaatgcccgggctgctgtagcaggagaggatggcagaatgattgctggccaggtttta gatattaacctggctgcagagccaaaagtgaaccgaggaaaagcaggtgtgaaacgatct gcagcggagatgtacgggtcagtaacagaacacccttctccgtcccctctactcagctcc tcttttgacttggactatgactttcaacgggactattatgataggatgtacagttaccca gcacgtgtacctcctcctcctcctattgctcgggctgtagtgccctcgaaacgtcagcgt gtatcaggaaacacttcacgaaggggcaaaagtggcttcaattctaagagtggacagcgg ggatcttccaagtctggaaagttgaaaggagatgaccttcaggccattaagaaggagctg acccagataaaacaaaaagtggattctctcctggaaaacctggaaaaaattgaaaaggaa cagagcaaacaagcagtagagatgaagaatgataagtcagaagaggagcagagcagcagc tccgtgaagaaagatgagactaatgtgaagatggagtctgaggggggtgcagatgactct gctgaggagggggacctactggatgatgatgataatgaagatcggggggatgaccagctg gagttgatcaaggatgatgaaaaagaggctgaggaaggagaggatgacagagacagcgcc aatggcgaggatgactcttaa >gi568815584r:21111226_21334193|GENSCAN_predicted_peptide_6|1232_aa MFYEHVLLYSDQSHSIQNKVECHVKSSRDLITNVSCLNSLQKEHQPPSIPESGTSMRGKK KKNGGNRRHRRYVNFDYINEHHDLHFHDPVVLAVVENERKAAVNTGRAKKGHFPWRSWPR GHQRSRLLLEVGNAATTAQSSSLHKMAPESPLPPRRRPSPRPIPENLPIGLSSGISYSLG TEIMSHLVDPTSGDLPVRDIDAIPLVLPASKGKNMKTQPPLSRMNREELEDSFFRLREDH MLVKELSWKQQDEIKRLRTTLLRLTAAGRDLRVAEEAAPLSETARRGQKAGWRQRLSMHQ RPQMHRLQGHFHCVGPASPRRAQPRVQVGHRQLHTAGAPVPEKPKRGPRDRLSYTAPPSF KEHATNENRGEVASKPSELAHIMASNTMQVEEPPKSPEKMWPKDENFEQRSSLECAQKAA ELRASIKEKVELIRLKKLLHERNASLVMTKAQLTEVQEIFTVMIFFKPDNSPQENARVAG VWRPLVLSDMTIPRGTFLLTQNQGILSAAHEALLKQVNELRAELKEESKKAVSLKSQLED VSILQMTLKEERVEDLEKERKLLNDNYDKLLESMLDSSDSSSQPHWSNELIAEQLQQQVS QLQDQLDAELEDKRKVLLELSREKAQNEDLKLEVTNILQKHKQEVELLQNAATISQPPDR QSEPATHPAVLQENTQIEPSEPKNQEEKKLSQVLNELQVSHAETTLELEKTRDMLILQRK INVCYQVQGKMEELEAMMTKADNDNRDHKEKLERLTRLLDLKNNRIKQLEEQLKDVAYGT RPLSLCLETLPAHGDEDKVDISLLHQGENLFELHIHQAFLTSAALAQAGDTQPTTFCTYS FYDFETHCTPLSVGPQPLYDFTSQYVMETDSLFLHYLQEASARLDIHQAMASEHSTLAAG WICFDRVLETVEKVHGLATLIGAGGEEFGVLEYWMRLRFPIKPSLQACNKRKKAQVYLST DVLGGRKAQEEEFRSESWEPQNELWIEITKCCGLRSRWLGTQPSPYAVYRFFTFSDHDTA IIPASNNPYFRDQARFPVLVTSDLDHYLRREALSIHVFDDEDLEPGSYLGRARVPLLPLA KNESIKGDFNLTDPAEKPNGSIQVQLDWKFPYIPPESFLKPEAQTKGKDTKDSSKISSEE EKASFPSQVVKLPACNAISDLSIQDQMASPEVPIEAGQYRSKRKPPHGGERKEKEHQVVS YSRRKHGKRIGVQGKNRMEYLSLNILNGNTPE >gi568815584r:21111226_21334193|GENSCAN_predicted_CDS_6|3696_bp atgttctatgagcatgtattactttattcggatcaaagccattccatccaaaacaaggtc gaatgccatgtcaaatcctctagagatttaattacaaacgtaagctgtttaaattctcta cagaaggaacatcaaccaccaagtattcccgagagtggcactagcatgagggggaaaaaa aagaagaacggaggtaatagaaggcatcggcgctacgttaacttcgactatattaatgag catcacgatcttcattttcatgatcccgtagtcctcgcggtcgtagaaaatgaaaggaaa gcagccgttaacacaggaagagcgaaaaaaggacattttccctggcgatcgtggcctaga ggacaccagagaagccgactgctgctggaggtcggcaacgcggccacaaccgctcagtct tcgtctcttcacaaaatggcccccgagtctcctctgccgccgaggcgaaggcctagtccc cgccccattcctgagaatcttcctattggcttgtcctctgggatctcttacagcttggga acagagatcatgtcacatctggtggaccctacatcaggagacttgccagttagagacata gatgctatacctctggtgctaccagcctcaaaaggtaagaatatgaaaactcaaccaccc ttgagcaggatgaaccgggaggaattggaggacagtttctttcgacttcgcgaagatcac atgttggtgaaggagctttcttggaagcaacaggatgagatcaaaaggctgaggaccacc ttgctgcggttgaccgctgctggccgggacctgcgggtcgcggaggaggcggcgccgctc tcggagaccgcaaggcgcgggcagaaggcgggatggcggcagcgcctctccatgcaccag cgcccccagatgcaccgactgcaagggcatttccactgcgtcggccctgccagcccccgc cgcgcccagcctcgcgtccaagtgggacacagacagctccacacagccggtgcaccggtg ccggagaaacccaagagggggccaagggacaggctgagctacacagcccctccatcgttt aaggagcatgcgacaaatgaaaacagaggtgaagtagccagtaaacccagtgaacttgcc cacatcatggccagcaataccatgcaagtggaagagccacccaagtctcctgagaaaatg tggcctaaagatgaaaattttgaacagagaagctcattggagtgtgctcagaaggctgca gagcttcgagcttccattaaagagaaggtagagctgattcgacttaagaagctcttacat gaaagaaatgcttcattggttatgacaaaagcacaattaacagaagttcaagagatattt acagtcatgatcttttttaagcctgacaacagccctcaagagaatgctagggttgctggt gtctggagaccactcgtgctgagtgatatgaccattcccagaggtactttccttttgacc cagaatcagggaatcctgagtgcagcccatgaggccctcctcaagcaagtgaatgagctc agggcagagctgaaggaagaaagcaagaaggctgtgagcttgaagagccaactggaagat gtgtctatcttgcagatgactctgaaggaggagagagttgaagatttggaaaaagaacga aaattgctgaatgacaattatgacaaactcttagaaagcatgctggacagcagtgacagc tccagtcagccccactggagcaacgagctcatagcggaacagctacagcagcaagtctct cagctgcaggatcagctggatgctgagctggaggacaagagaaaagttttacttgagctg tccagggagaaagcccaaaatgaggatctgaagcttgaagtcaccaacatacttcagaag cataaacaggaagtagagctcctccaaaatgcagccacaatttcccaacctcctgacagg caatctgaaccagccactcacccagctgtattgcaagagaacactcagatcgagccaagt gaacccaaaaaccaagaagaaaagaaactgtcccaggtgctaaatgagttgcaagtatca cacgcagagaccacattggaactagaaaagaccagggacatgcttattctgcagcgcaaa atcaacgtgtgttatcaggtgcaaggaaagatggaggaactggaggcaatgatgacaaaa gctgacaatgataatagagatcacaaagaaaagctggagaggttgactcgactactagac ctcaagaataaccgtatcaagcagctggaagaacagctcaaagatgttgcttatggcacc cgaccgttgtcgttatgtttggaaacactgccagcccatggagatgaggataaagtggat atttctctgctgcatcagggtgagaatctttttgaactgcacatccaccaggccttcctg acatctgccgccctagctcaggctggagatacccaacctaccactttctgcacctattcc ttctatgactttgaaacccactgtaccccattatctgtggggccacagcccctctatgac ttcacctcccagtatgtgatggagacagattcgcttttcttacactaccttcaagaggct tcagcccggcttgacatacaccaggccatggccagtgaacacagcactcttgctgcagga tggatttgctttgacagggtgctagagactgtggagaaagtccatggcttggccacactg attggagctggtggagaagagttcggggttctagagtactggatgaggctgcgtttcccc ataaaacccagcctacaggcgtgcaataaacgaaagaaagcccaggtctacctgtcaacc gatgtgcttggaggccggaaggcccaggaagaggagttcagatcggagtcttgggaacct cagaacgagctgtggattgaaatcaccaagtgctgtggcctccggagtcgatggctggga actcaacccagtccatatgctgtgtaccgcttcttcaccttttctgaccatgacactgcc atcattccagccagtaacaacccctactttagagaccaggctcgattcccagtgcttgtg acctctgacctggaccattatctgagacgggaggccttgtctatacatgtttttgatgat gaagacttagagcctggctcgtatcttggccgagcccgagtgcctttactgcctcttgca aaaaatgaatctatcaaaggtgattttaacctcactgaccctgcagagaaacccaacgga tctattcaagtgcaactggattggaagtttccctacataccccctgagagcttcctgaaa ccagaagctcagactaaggggaaggataccaaggacagttcaaagatctcatctgaagag gaaaaggcttcatttccttcccaggttgttaaactaccagcttgtaatgctatctctgat ctttctattcaggatcagatggcatctcctgaggttcccattgaagctggccagtatcga tctaagagaaaacctcctcatgggggagaaagaaaggagaaggagcaccaggttgtgagc tactcaagaagaaaacatggcaaaagaataggtgttcaaggaaagaatagaatggagtat cttagccttaacatcttaaatggaaatacaccagag