GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:07:09 Sequence gi568815575r:77868877_78069638 : 200762 bp : 39.45% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 2049 1932 118 2 1 56 95 153 0.276 12.35 1.08 Intr - 6721 6567 155 1 2 99 26 101 0.161 2.95 1.07 Intr - 26580 26289 292 2 1 67 32 160 0.034 4.51 1.06 Intr - 26962 26828 135 0 0 55 -6 164 0.178 2.46 1.05 Intr - 30710 30606 105 1 0 -5 55 163 0.319 2.11 1.04 Intr - 39506 39438 69 1 0 29 97 84 0.304 0.78 1.03 Intr - 41887 41593 295 2 1 6 63 190 0.279 3.44 1.02 Intr - 49726 49564 163 1 1 76 61 80 0.101 2.73 1.01 Init - 52398 52294 105 0 0 79 115 34 0.547 5.47 1.00 Prom - 54491 54452 40 -4.95 2.00 Prom + 55622 55661 40 -5.35 2.01 Init + 62333 62453 121 1 1 56 74 71 0.053 2.90 2.02 Term + 93747 93970 224 1 2 132 49 251 0.342 21.70 2.03 PlyA + 95993 95998 6 1.05 3.02 PlyA - 96318 96313 6 1.05 3.01 Sngl - 100762 99998 765 1 0 111 49 1053 0.993 99.54 3.00 Prom - 101010 100971 40 -8.45 4.00 Prom + 101928 101967 40 -12.23 4.01 Init + 102269 102316 48 1 0 71 97 21 0.736 2.30 4.02 Intr + 102745 102885 141 0 0 80 115 129 0.946 14.43 4.03 Intr + 119366 119855 490 1 1 76 90 452 0.737 35.85 4.04 Intr + 120357 121082 726 1 0 70 91 492 0.994 38.17 4.05 Intr + 129602 129808 207 0 0 59 100 218 0.981 18.23 4.06 Intr + 134197 134360 164 2 2 47 81 228 0.822 16.77 4.07 Intr + 140226 140387 162 2 0 64 100 42 0.724 2.25 4.08 Intr + 144003 144236 234 2 0 75 98 191 0.993 15.76 4.09 Intr + 145786 145877 92 0 2 96 68 76 0.995 4.27 4.10 Intr + 146878 147005 128 1 2 19 91 204 0.358 13.20 4.11 Intr + 151368 151522 155 1 2 88 88 148 0.356 13.67 4.12 Intr + 152069 152203 135 1 0 82 95 93 0.989 9.24 4.13 Intr + 160374 160568 195 2 0 70 116 111 0.984 10.79 4.14 Intr + 162524 162706 183 1 0 91 88 158 0.999 15.16 4.15 Intr + 164729 164945 217 1 1 24 82 191 0.991 9.25 4.16 Intr + 169960 170106 147 2 0 85 91 179 0.998 17.19 4.17 Intr + 171715 171857 143 2 2 40 91 139 0.722 8.55 4.18 Intr + 173709 173912 204 2 0 54 110 301 0.973 27.37 4.19 Intr + 174441 174558 118 2 1 68 84 121 0.980 8.82 4.20 Intr + 176594 176696 103 0 1 69 93 33 0.967 0.21 4.21 Term + 177418 177694 277 1 1 19 42 313 0.388 13.75 4.22 PlyA + 179540 179545 6 1.05 5.00 Prom + 186220 186259 40 -5.95 5.01 Init + 188693 188780 88 1 1 86 78 90 0.770 8.65 5.02 Intr + 193529 193632 104 0 2 52 62 108 0.305 3.57 5.03 Term + 197247 197414 168 2 0 29 54 146 0.589 2.20 5.04 PlyA + 198424 198429 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:77868877_78069638|GENSCAN_predicted_peptide_1|479_aa MNKVTELSNSMPYLRTTTGMAEAYGMNGNNKEIVKLRFPLGKEEQHSQQLLLRNYTVSIC PDLMTLRLKEAPPYNKVGHHHNKRLGGDKGSNKAHTIGEVTVAAAAAAAATTPPSASLFP PPAAAASRLYPSLPRLPPLPGDAPAGSRAATGKRATGHVKRNAWSRASFSCPLLPPGRHA ASSGSLLGDIKPGGWELNICTSTTTTVPVSRRFSALLTKGNIVKVKLELRQYSCCFGANS LQVPLKILHEGGEFQGLLNESLKDSVLATSSHFGLRKLRNISQLRMRAWKAPSRSKVSLI EGRGANMAARWRFWCVSVTMVVALLIVCDVPSASAQRKKEVRTRFPAAWAFPNDWGLRGS VRLFPAPFPCRFYAFGGFNYAVFGVVEFMVLSEKVSQLMEWTNKRPVIRMNGDKFRRLVK APPRNYSVIVMFTALQLHRQQADEEFQILANSWRYSSAFTNRIFFAMVDFDEGSDVFQM >gi568815575r:77868877_78069638|GENSCAN_predicted_CDS_1|1437_bp atgaataaagttacagagttgtcaaacagcatgccatatttaaggaccaccactggtatg gctgaggcatatggcatgaatggtaacaacaaggagattgtaaagctaagattcccattg ggaaaggaagaacagcacagccagcaactacttctcaggaactatactgtcagtatctgc cctgatctcatgactctgagattaaaagaagctcctccttataacaaggttggacatcac cataataaaagattgggaggagacaaaggctccaacaaagctcacacaatcggagaagtc acagtagctgcggctgcggcggcggcagcaacaactcctccctctgcttctcttttccca ccccccgccgcggcggccagccgcctttacccctccctccctcgactacccccacttccg ggtgatgcccctgccggaagcagggccgccacgggaaagagagcgactggtcacgtgaaa aggaacgcgtggtctagagccagcttctcctgccccctgctgcccccgggccgacacgca gcctcgagtgggtctttgttaggtgacattaagccaggaggatgggaactcaacatctgt acctctacaactaccacagtacccgtgtctagacgatttagtgcgcttttgaccaaggga aacatcgtgaaggtgaagctagaactgcggcaatacagctgctgctttggtgccaattcg cttcaggtacccttgaaaatattacatgaaggcggcgaatttcaaggacttctaaatgag tctttgaaagattctgttctagcaacttcaagccattttggactccgaaaactccgcaat atttcacaactgcgcatgcgtgcttggaaagcacctagccggagcaaagtttcacttata gaagggagaggagcgaacatggcagcgcgttggcggttttggtgtgtctctgtgaccatg gtggtggcgctgctcatcgtttgcgacgttccctcagcctctgcccaaagaaagaaggag gtgagaacgcggtttccagcagcatgggcttttcccaatgactggggcttaagagggtct gttcgcctcttcccagccccctttccctgccgcttctatgcctttggtggcttcaattac gcggttttcggagttgtggaattcatggtgttatctgaaaaggttagtcagctgatggaa tggactaacaaaagacctgtaataagaatgaatggagacaagttccgtcgccttgtgaaa gccccaccgagaaattactccgttatcgtcatgttcactgctctccaactgcatagacag caagctgatgaagaattccagatcctggcaaactcctggcgatactccagtgcattcacc aacaggatattttttgccatggtggattttgatgaaggctctgatgtatttcagatg >gi568815575r:77868877_78069638|GENSCAN_predicted_peptide_2|114_aa MIVNEHAAFKHLFNKAHLAPPLIHSTLSGHSTCFREHRVGDPSSASSLGIAVSLGRPVLS RSSSGTVDLLEEVGLQIRDTAFSSTKLLEAISTVSAQVEELAVKCTENARFLKT >gi568815575r:77868877_78069638|GENSCAN_predicted_CDS_2|345_bp atgattgttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccattcaactctgagtggacacagcacatgtttcagagagcacagggttggg gacccttcttcagcatcttccctgggcattgctgtgagtttaggccggcccgttttgagc aggagcagcagcggaacagtagacctgctggaggaagtggggctgcagatcagagacaca gcattttcgtcaaccaaacttcttgaagccatatctacagtatcagctcaagtggaagag cttgccgtcaaatgtacggaaaatgcacgtttccttaaaacatga >gi568815575r:77868877_78069638|GENSCAN_predicted_peptide_3|254_aa MAAYKLVLIRHGESTWNLENRFSCWYDADLSPAGHEEAKRGGQALRDAGYEFDICLTSVQ KRVIRTLWTVLDAIDQMWLPVVRTWRLNERHYGGLTGLNKAETAAKHGEAQVKIWRRSYD VPPPPMEPDHPFYSNISKDRRYADLTEDQLPSYESPKDTIARALPFWNEEIVPQIKEGKR VLIAAHGNSLQGIAKHVEGLSEEAIMELNLPTGIPIVYELDKNLKPIKPMQFLGDEETVC KAIEAVAAQGKAKK >gi568815575r:77868877_78069638|GENSCAN_predicted_CDS_3|765_bp atggccgcctacaaactggtgctgatccggcacggcgagagcacatggaacctggagaac cgcttcagctgctggtacgacgccgatctgagcccggcgggccacgaggaggcgaagcgc ggcgggcaggcgctacgagatgctggctatgagtttgacatctgcctcacctcagtgcag aagagagtgatccggaccctctggacagtgctagatgccattgatcagatgtggctgcca gtggtgaggacttggcgcctcaatgagcggcactatgggggtctaaccggtctcaataaa gcagaaactgctgcaaagcatggtgaggcccaggtgaagatctggaggcgctcctatgat gtcccaccacctccgatggagcccgaccatcctttctacagcaacatcagtaaggatcgc aggtatgcagacctcacagaagatcagctaccctcctatgagagtccgaaggatactatt gccagagctctgcccttctggaatgaagaaatagttccccagatcaaggaggggaaacgt gtactgattgcagcccatggcaacagcctccagggcattgccaagcatgtggagggtctc tctgaagaggctatcatggagctgaacctgccgactggtattcccatcgtctatgaattg gacaagaacttgaagcctatcaagcccatgcagtttctgggggatgaagagacggtgtgc aaagccatagaagctgtggctgcccagggcaaggccaagaagtga >gi568815575r:77868877_78069638|GENSCAN_predicted_peptide_4|1422_aa MTGTEKGLVSGRRGGQECNEEIKMDPSMGVNSVTISVEGMTCNSCVWTIEQQIGKVNGVH HIKVSLEEKNATIIYDPKLQTPKTLQEAIDDMGFDAVIHNPDPLPVLTDTLFLTVTASLT LPWDHIQSTLLKTKGVTDIKIYPQKRTVAVTIIPSIVNANQIKELVPELSLDTGTLEKKS GACEDHSMAQAGEVVLKMKVEGMTCHSCTSTIEGKIGKLQGVQRIKVSLDNQEATIVYQP HLISVEEMKKQIEAMGFPAFVKKQPKYLKLGAIDVERLKNTPVKSSEGSQQRSPSYTNDS TATFIIDGMHCKSCVSNIESTLSALQYVSSIVVSLENRSAIVKYNASSVTPESLRKAIEA VSPGLYRVSITSEVESTSNSPSSSSLQKIPLNVVSQPLTQETVINIDGMTCNSCVQSIEG VISKKPGVKSIRVSLANSNGTVEYDPLLTSPETLRGAIEDMGFDATLSDTNEPLVVIAQP SSEMPLLTSTNEFYTKGMTPVQDKEEGKNSSKCYIQVTGMTCASCVANIERNLRREEGIY SILVALMAGKAEVRYNPAVIQPPMIAEFIRELGFGATVIENADEGDGVLELVVRGMTCAS CVHKIESSLTKHRGILYCSVALATNKAHIKYDPEIIGPRDIIHTIEFFGGWYFYIQAYKA LKHKTANMDVLIVLATTIAFAYSLIILLVAMYERAKVNPITFFDTPPMLFVFIALGRWLE HIAKGKTSEALAKLISLQATEATIVTLDSDNILLSEEQVDVELVQRGDIIKVVPGGKFPV DGRVIEGHSMVDESLITGEAMPVAKKPGSTVIAGSINQNGSLLICATHVGADTTLSQIVK LVEEAQTSKAPIQQFADKLSGYFVPFIVFVSIATLLVWIVIGFLNFEIVETYFPGYNRSI SRTETIIRFAFQASITVLCIACPCSLGLATPTAVMVGTGVGAQNGILIKGGEPLEMAHKV KVVVFDKTGTITHGTPVVNQVKVLTESNRISHHKILAIVGTAESNSEHPLGTAITKYCKQ ELDTETLGTCIDFQVVPGCGISCKVTNIEGLLHKNNWNIEDNNIKNASLVQIDASNEQSS TSSSMIIDAQISNALNAQQYKVLIGNREWMIRNGLVINNDVNDFMTEHERKGRTAVLVAV DDELCGLIAIADTVKPEAELAIHILKSMGLEVVLMTGDNSKTARSIASQVGITKVFAEVL PSHKVAKVKQLQEEGKRVAMVGDGINDSPALAMANVGIAIGTGTDVAIEAADVVLIRNDL LDVVASIDLSRKTVKRIRINFVFALIYNLVGIPIAAGVFMPIGLVLQPWMGSAAMAASSV SVVLSSLFLKLYRKPTYESYELPARSQIGQKSPSEISVHVGIDDTSRNSPKLGLLDRIVN YSRASINSLLSDKRSLNSVVTSEPDKHSLLVGDFREDDDTAL >gi568815575r:77868877_78069638|GENSCAN_predicted_CDS_4|4269_bp atgactggcacagagaagggtttggtgagtggtagaagaggaggccaggaatgtaatgag gaaatcaaaatggatccaagtatgggtgtgaattctgttaccatttctgttgagggtatg acttgcaattcctgtgtttggaccattgagcagcagattggaaaagtgaatggtgtgcat cacattaaggtatcactggaagaaaaaaatgcaactattatttatgaccctaaactacag actccaaagaccctacaggaagctattgatgacatgggctttgatgctgttatccataat cctgaccctctccctgttttaactgacaccttgtttctgactgttacggcgtcactgact ttgccatgggaccatatccaaagcacattgctgaagaccaagggtgtgacagacattaaa atttaccctcagaaaagaactgtagcagtgacaataatcccttctatagtgaatgccaat cagataaaagagctggttccagaactcagtttagatactgggacactggagaaaaagtca ggagcttgtgaagatcatagtatggctcaagctggtgaagtcgtgctgaagatgaaagtg gaagggatgacctgccattcatgtactagcactattgaaggaaaaattgggaaactgcaa ggtgttcagcgaattaaagtctccctggacaatcaagaagctactattgtttatcaacct catcttatctcagtagaggaaatgaaaaagcagattgaagctatgggctttccagcattt gtcaaaaagcagcccaagtacctcaaattgggagctattgatgtagaacgtctaaagaac acaccagttaaatcctcagaagggtcacagcaaaggagtccatcatataccaatgattca acagccactttcatcattgatggcatgcattgtaaatcatgtgtgtcaaatattgaaagt actttatctgcactccaatatgtaagcagcatagtagtttctttagagaataggtctgcc attgtgaagtataatgcaagctcagtcactccagaatccctgagaaaagcaatagaggct gtatcaccggggctatatagagttagtatcacaagtgaagttgagagtacctcaaactct ccctccagctcatctcttcagaagattcctttgaatgtagttagccagcctctgacacaa gaaactgtgataaacattgatggcatgacttgtaattcctgtgtgcagtctattgagggt gtcatatcaaaaaagccaggtgtaaaatccatacgagtctcccttgcaaatagcaatggg actgttgagtatgatcctctactaacctctccagaaacgttgagaggagcaatagaagac atgggatttgatgctaccttgtcagacacgaatgagccgttggtagtaatagctcagcct tcatcggaaatgccgcttttgacttcaactaatgaattttatactaaagggatgacacca gttcaagacaaggaggaaggaaagaattcatctaagtgttacatacaggtcactggcatg acttgcgcttcctgtgtagcaaacattgaacggaatttaaggcgggaagaaggaatatat tctatacttgtggccctgatggctggcaaggcagaagtaaggtataatcctgctgttata caacccccaatgatagcagagttcatccgagaacttggatttggagccactgtgatagaa aatgctgatgaaggagatggtgttttggaacttgttgtgaggggaatgacgtgtgcctcc tgcgtacataaaatagagtctagtctcacaaaacacagagggatcctatactgctccgtg gccctggcaaccaacaaagcacatattaaatatgacccagaaattattggtcctagagat attatccatacaattgaatttttcggaggctggtacttctacattcaggcttataaagca ctgaagcataagacagcaaatatggacgtactgattgtgctggcaaccaccattgcattt gcctactctttgattattcttctagttgcaatgtatgagagagccaaagtgaaccctatt actttctttgacacaccccctatgctgtttgtgtttattgcactaggccgatggctggaa catatagcaaagggcaaaacatcagaggctcttgcaaagttaatttcactacaagctaca gaagcaactattgtaactcttgattctgataatatcctcctcagtgaagaacaagtggat gtggaacttgtacaacgtggagatatcattaaagtagttccaggaggcaaatttccagtg gatggtcgtgttattgaaggacattctatggtagatgagtccctcatcacaggggaggca atgcctgtggctaagaaacctggcagcacagtgattgctggttccattaaccagaacggg tcactgcttatctgcgcaacacatgttggagcagacacaaccctttctcaaattgtcaaa cttgtggaagaggcacaaacatcaaaggctcctatccagcagtttgcagacaaactcagt ggctattttgttccttttattgtttttgtttccattgccaccctcttggtatggattgta attggatttctgaattttgaaattgtggaaacctactttcctggctacaatagaagtatc tcccgaacagaaacgataatacgatttgctttccaagcctctatcacagttctgtgtatt gcatgtccctgttcactgggactggccactccaactgctgtgatggtgggtacaggagta ggtgctcaaaatggcatactaataaaaggtggagagccattggagatggctcataaggta aaggtagtggtatttgataagactggaaccattactcacggaaccccagtggtgaatcaa gtaaaggttctaactgaaagtaacagaatatcacaccataaaatcttggccattgtggga actgctgaaagtaacagtgaacaccctctaggaacagccataaccaaatattgcaaacag gagctggacactgaaaccttgggtacctgcatagatttccaggttgtgccaggctgtggt attagctgtaaagtcaccaatattgaaggcttgctacataagaataactggaatatagag gacaataatattaaaaatgcatccctggttcaaattgatgccagtaatgaacagtcatca acttcgtcttccatgattattgatgcccagatctcaaatgctcttaatgctcagcagtat aaagtcctcattggtaaccgggagtggatgattagaaatggtcttgtcattaataacgat gtaaatgatttcatgactgaacatgagagaaaaggtcggactgctgtattagtagcagtt gatgatgagctgtgtggcttgatagccattgcagacacagtgaagcctgaagcagaactg gctatccatattctgaaatctatgggcttagaagtagttctgatgactggagacaacagt aaaacagctagatctattgcttctcaggttggcattactaaggtgtttgctgaagttcta ccttctcacaaggttgctaaagtgaagcaacttcaagaggaggggaaacgggtagcaatg gtgggagatggaatcaatgactccccagctctggcaatggctaatgtgggaattgctatt ggcacaggcacagatgtagccattgaagcagctgatgtggttttgataaggaatgatctt ctggatgtagtggcaagtattgacttatcaagaaagacagtcaagaggattcggataaat tttgtctttgctctaatttataatctggttggaattcccatagctgctggagtttttatg cccattggtttggttttgcagccctggatgggatctgcagcaatggctgcttcatctgtt tctgtagtactttcttctctcttccttaaactttacaggaaaccaacttacgagagttat gaactgcctgcccggagccagataggacagaagagtccttcagaaatcagcgttcatgtt ggaatagatgatacctcaaggaattctcctaaactgggtttgctggaccggattgttaat tatagcagagcctctataaactcactactgtctgataaacgctccctaaacagtgttgtt accagtgaacctgacaagcactcactcctggtgggagacttcagggaagatgatgacact gcattataa >gi568815575r:77868877_78069638|GENSCAN_predicted_peptide_5|119_aa MVEGEGEAGTSYMAEGEGEARYLLHKVKDGTFAGTAPKTGPEPAEQPQELPGSKKSGGIC NGQKSVVSGTSNINITWEIDENSSSWAALQTSESTASTVEPSNSQFNKPSSMILIHAIV >gi568815575r:77868877_78069638|GENSCAN_predicted_CDS_5|360_bp atggtggaaggtgaaggggaagcaggtacgtcttacatggcagaaggcgaaggagaagca agatatcttcttcacaaggtgaaggatggcacatttgctggaactgctcccaaaacaggg cctgaaccagctgagcaaccccaagagcttccagggagcaagaaatctggaggaatctgt aatgggcagaagagtgtggtctctggaaccagtaacatcaacatcacctgggaaattgat gaaaattcaagttcttgggctgctctccagacaagtgaatctacagcttctacggtggag cccagcaattcacagttcaacaagccttccagcatgattctgattcatgctattgtttga