GENSCAN 1.0 Date run: 7-Nov-116 Time: 04:39:59 Sequence gi568815595r:126003794_126260979 : 257186 bp : 45.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 1887 1882 6 1.05 1.10 Term - 3432 3223 210 0 0 92 36 128 0.972 5.29 1.09 Intr - 5087 4939 149 0 2 120 115 189 0.977 24.75 1.08 Intr - 8956 8822 135 2 0 65 94 69 0.981 5.74 1.07 Intr - 11780 11701 80 1 2 102 76 47 0.717 4.09 1.06 Intr - 13082 12938 145 0 1 125 99 212 0.999 25.34 1.05 Intr - 19139 18993 147 0 0 127 98 158 0.997 20.91 1.04 Intr - 19583 19383 201 0 0 56 42 102 0.742 1.66 1.03 Intr - 22686 22542 145 0 1 119 80 164 0.923 18.56 1.02 Intr - 29885 29814 72 2 0 119 109 128 0.983 17.60 1.01 Init - 35313 35230 84 0 0 90 81 49 0.911 5.22 1.00 Prom - 36075 36036 40 -4.46 2.05 PlyA - 40297 40292 6 1.05 2.04 Term - 46988 46953 36 2 0 112 42 35 0.773 -1.36 2.03 Intr - 47257 47114 144 1 0 95 64 193 0.780 18.08 2.02 Intr - 52720 52565 156 1 0 74 51 153 0.096 10.41 2.01 Init - 64426 64154 273 1 0 72 110 141 0.202 11.68 2.00 Prom - 64685 64646 40 -8.46 3.00 Prom + 64730 64769 40 -5.26 3.01 Init + 64896 64960 65 2 2 69 36 24 0.030 -4.02 3.02 Intr + 66311 66468 158 2 2 68 105 66 0.734 5.95 3.03 Intr + 70426 70821 396 2 0 74 49 164 0.748 5.65 3.04 Intr + 74521 74742 222 2 0 66 35 158 0.474 6.40 3.05 Intr + 78289 78683 395 2 2 111 29 144 0.308 5.17 3.06 Intr + 79341 79410 70 2 1 44 27 87 0.456 -3.05 3.07 Intr + 80570 80651 82 0 1 53 70 135 0.323 6.90 3.08 Intr + 87284 87351 68 2 2 90 83 54 0.114 3.65 3.09 Term + 91396 91559 164 2 2 52 47 79 0.082 -1.70 3.10 PlyA + 91831 91836 6 1.05 4.31 PlyA - 92858 92853 6 1.05 4.30 Term - 100053 99998 56 1 2 93 49 122 0.442 6.52 4.29 Intr - 102132 101933 200 2 2 93 59 383 0.547 34.89 4.28 Intr - 103453 103348 106 2 1 98 45 148 0.982 10.77 4.27 Intr - 103669 103624 46 1 1 68 75 31 0.742 -2.22 4.26 Intr - 104985 104757 229 2 1 83 74 72 0.298 3.17 4.25 Intr - 105721 105586 136 2 1 80 36 60 0.538 -0.37 4.24 Intr - 106316 106151 166 2 1 33 109 202 0.902 16.33 4.23 Intr - 109087 108989 99 1 0 38 103 66 0.903 3.41 4.22 Intr - 110863 110764 100 0 1 101 77 155 0.947 15.81 4.21 Intr - 114305 114212 94 0 1 129 100 98 0.996 14.12 4.20 Intr - 120658 120571 88 1 1 73 99 54 0.950 4.54 4.19 Intr - 121928 121823 106 1 1 41 99 140 0.797 10.62 4.18 Intr - 126500 126430 71 2 2 148 92 75 0.998 11.88 4.17 Intr - 127741 127591 151 0 1 86 75 358 0.985 34.46 4.16 Intr - 131869 131742 128 1 2 69 75 167 0.927 12.98 4.15 Intr - 133090 132971 120 1 0 77 65 203 0.999 17.59 4.14 Intr - 134167 134020 148 0 1 116 83 264 0.996 28.94 4.13 Intr - 143133 143042 92 0 2 76 94 69 0.914 5.09 4.12 Intr - 146738 146613 126 2 0 14 94 100 0.701 4.08 4.11 Intr - 149788 149651 138 1 0 87 91 134 0.998 14.26 4.10 Intr - 150850 150761 90 1 0 95 89 99 0.990 10.89 4.09 Intr - 151710 151609 102 0 0 53 113 76 0.875 6.97 4.08 Intr - 153715 153550 166 0 1 92 77 322 0.999 31.46 4.07 Intr - 154846 154612 235 2 1 84 85 227 0.993 18.75 4.06 Intr - 157209 157060 150 1 0 120 92 124 0.930 16.13 4.05 Intr - 181523 181444 80 1 2 111 66 22 0.015 1.49 4.04 Intr - 208181 208028 154 0 1 76 78 107 0.179 7.63 4.03 Intr - 211883 211793 91 2 1 60 97 12 0.027 -1.13 4.02 Intr - 228695 228552 144 2 0 25 91 63 0.021 0.78 4.01 Init - 234604 234413 192 1 0 82 58 140 0.092 9.37 4.00 Prom - 250665 250626 40 -1.36 5.02 PlyA - 250962 250957 6 1.05 5.01 Term - 255941 255795 147 2 0 51 48 191 0.974 9.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 234604 234317 288 1 0 82 29 197 0.884 8.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:126003794_126260979|GENSCAN_predicted_peptide_1|455_aa MVPASASGEGFRKLPIIVEAEGEPVCHMANTGQIDDPQEQHRVISSNLALIQVQATVVGL LAAVAALLLGVVSREEVDVAKVELLCASSVLTAFLAAFALAVSIWMSASQRMPVNTGGHE LTPRGSIAPSIFGAALMQHGAVGCLSLWFLVIWELGSHGSLLSIMCTGVLMVCIVIGARK LGVNPDNIATPIAASLGDLITLSILALVSSFFYRHKDSRYLTPLVCLSFAALTPVWVLIA KQSPPIVKILKFGWFPIILAMVISSFGGLILSKTVSKQQYKGMAIFTPVICGVGGNLVAI QTSRISTYLHMWSAPGVLPLQMKKFWPNPCSTFCTSEINSMSARVLLLLVVPGHLIFFYI IYLVEGQSVINSQTFVVLYLLAGLIQVTILLYLAEVMVRLTWHQALDPDNHCIPYLTGLG DLLGTGLLALCFFTDWLLKSKAELGGISELASGPP >gi568815595r:126003794_126260979|GENSCAN_predicted_CDS_1|1368_bp atggtgccagcatctgcttctggtgagggctttaggaagcttccaattattgtggaagct gaaggggagccagtgtgtcacatggccaacactggacaaattgatgacccccaggagcag cacagagtcatcagcagcaacctggccctcatccaggtgcaggccactgtcgtggggctc ttggctgctgtggctgcgctgctgttgggcgtggtgtctcgagaggaagtggatgtcgcc aaggtggagttgctgtgtgccagcagtgtcctcactgccttccttgcagcctttgccctg gctgtttccatctggatgtctgcttctcagaggatgccagtaaacacaggaggccatgag ctcaccccgaggggttccattgctccgagcatctttggagcagccctcatgcagcatggg gctgtgggctgcctgtctctgtggttcctggtgatttgggagctcgggagccacgggtca ctccttagcatcatgtgcacaggggtgctgatggtctgtatagtgattggtgctcgaaag ctcggggtcaacccagacaacattgccacgcccattgcagccagcctgggagacctcatc acactgtccattctggctttggttagcagcttcttctacagacacaaagatagtcggtat ctgacgccgctggtctgcctcagctttgcggctctgaccccagtgtgggtcctcattgcc aagcagagcccacccatcgtgaagatcctgaagtttggctggttcccaatcatcctggcc atggtcatcagcagtttcggaggactcatcttgagcaaaaccgtttctaaacagcagtac aaaggcatggcgatatttacccccgtcatatgtggtgttggtggcaatctggtggccatt cagaccagccgaatctcaacctacctgcacatgtggagtgcacctggcgtcctgcccctc cagatgaagaaattctggcccaacccgtgttctactttctgcacgtcagaaatcaattcc atgtcagctcgagtcctgctcttgctggtggtcccaggccatctgattttcttctacatc atctacctggtggagggtcagtcagtcataaacagccagacctttgtggtgctctacctg ctggcaggcctgatccaggtgacaatcctgctgtacctcgcagaagtgatggttcggctg acttggcaccaggccctggatcctgacaaccactgcatcccctaccttacagggctgggg gacctgctcggtactggcctcctggcactctgctttttcactgactggctactgaagagc aaggcagagctgggtggcatctcagaactggcatctggacctccctaa >gi568815595r:126003794_126260979|GENSCAN_predicted_peptide_2|202_aa MDGTETRQRRLDSCGKPGELGLPHPLSTGGLPVASEDGALRAPESQSVTPKPLETEPSRE TTWSIGLQVTVPFMFAGLGLSWAGMLLDYFQGKKLRGFSCELTRSPHGVLPESFFTIMCQ VVVPILLSGLCMMTAGLVMNTIQHWPVFVEVKDLLTLVPPLVGLKGNLEMTLASRLSTAV SGHLDIRGGHNPNALAFHYQTG >gi568815595r:126003794_126260979|GENSCAN_predicted_CDS_2|609_bp atggatgggacagagacccggcagcggaggctggacagctgtggcaagccaggggagctg gggcttcctcaccccctcagcacaggaggactccctgtagcctcagaagatggagctctc agggcccctgagagccaaagcgtgacccccaagccactggagactgagcctagcagggag accacctggtccataggccttcaggtgaccgtgcccttcatgtttgcaggcctgggactg tcctgggccggcatgcttctggactatttccagggcaagaagctgcgaggcttcagctgt gagcttacccggtccccacatggggtcttgcctgaatctttcttcaccatcatgtgccag gtcgtggtgcccattctgctgtccggcttgtgcatgatgacagccggcctggtgatgaac accattcaacactggcctgtgtttgtggaggtgaaagaccttttgacattggtgccgccc ctggtgggcctgaaggggaacctggagatgacactggcatccagactctccacagctgta agtggacacctggacattcgaggcggccacaaccccaatgccttggctttccactaccaa actggatga >gi568815595r:126003794_126260979|GENSCAN_predicted_peptide_3|539_aa MPGPTTLSSSALALKIDSLQLQAPRIPSRVNSERAHQDLPRILEAGREQQLMMYKRSFRA GVPNPRWELGHTAEDIKTRILTTDKEGYLIMIRGIEGNVLILIKGICEKPKANKLSESLK VFPLRLGTRQESLFLPILLNTVPEVLASTIQQGKEMRRIQLEKEEVKLSLFTDDIILYVK NHKGSTYTQTITASKQVQQGVGLEDAAMLVKLSQAHTGSLEIMERQLVEKKGEEEDLDPA HTGRNPYHTELSYRPALQAPLCFSGWKPQNHEEEGCLAEQGWGISQLWTWPKAPQLHLNV GVFARRTCSNVPVSLTARADEQPSTGWRRLLWESGLWARKPLIPEPQGGAKMPEDASGGI ASKVPTEDIGQCLHMVLVVPTGGDTGQPPKTKNCPTPNVSSVEAEKPCYGAVFRLLKALR FQINGQNHEEDLLKTTGAPLAPPRERYGRGAYAAAGFIMELSRATWNSKESKKSTFELGL TDLYEGQNQQPIWIPSRHLKPYHEPDAKEEIPGGSQGPPVAAVLRLMLRRTPTVTSNTC >gi568815595r:126003794_126260979|GENSCAN_predicted_CDS_3|1620_bp atgcctggtcccactaccctctccagctctgccctagccttgaaaattgacagcctgcaa ctgcaagctccacgaattccaagtagagtaaactccgagagagcacaccaagacctgcca agaatcttagaagcaggaagagagcagcaactcatgatgtacaagagatcctttagagca ggggtccccaacccccgttgggaactgggccacacagcagaagacattaagacaagaatt cttaccacagacaaagaaggatatttgataatgataagaggaatagaagggaatgttctc atcctgattaagggcatctgtgagaaacccaaagctaacaaacttagcgagagtctgaaa gttttcccactaagattaggaacaagacaagaatccctgtttttgccaattctattaaac actgtacctgaggttctagccagcacaattcagcaaggaaaagaaatgagacgtatccag cttgaaaaggaagaagtaaaattatccctgtttacagacgacataatcttatacgtaaaa aatcataagggatccacatacacacaaactattacagccagtaaacaagttcagcaaggt gtaggactggaggacgcagcaatgctggtaaaactgagccaggctcacactggaagcctg gagatcatggagagacagctagtggagaagaaaggagaggaggaggacttggaccctgcc cacactgggaggaacccgtaccacacagagctgagctatcggccggctctccaggcccca ctctgcttctctgggtggaagccccagaaccatgaggaggaaggatgcttggcagaacag gggtgggggatctcccagctgtggacatggccaaaggcccctcagctccacctgaatgtt ggagtgtttgcaaggagaacatgctccaatgtccctgtgtcactgacagctcgtgcagat gaacaaccaagcacaggctggaggaggctcctgtgggaaagtgggctgtgggccaggaaa cctctaatcccagagccccagggaggagccaagatgccagaagatgccagtggaggcatt gcctccaaggtccccacagaggacattgggcaatgtctgcacatggttttggttgtcccg acagggggtgatacaggacagccccccaaaacaaagaactgccccaccccaaatgtcagt agtgtggaggccgagaaaccctgctatggggcagtgttcagactcctcaaagccctaagg tttcagatcaatggtcagaaccacgaggaggacctgctaaagacaactggggccccactg gcgcccccgcgggagcgctacgggagaggcgcctatgcggctgcaggttttattatggag ctgtcaagggccacatggaactccaaggagtccaagaagtctacatttgaacttggcctt acagatctctatgaaggccaaaatcaacagccgatttggataccatcaagacacctgaaa ccttatcatgagccagatgccaaggaagagattccaggaggatcccaaggaccccctgtt gcagccgtgttgaggctgatgctcaggaggaccccaactgtcacaagcaacacctgttga >gi568815595r:126003794_126260979|GENSCAN_predicted_peptide_4|1267_aa MGPPSSRPQNGRSTHSLHCAPGKATDTQCQPMKAAKRGAVPHKDTGAELLKAMEAHLLHQ HNLDTKTFLTRKENYKPIFLMNMDVKLLNKILANQIRQCTELYTTCKWNLFQSPLQVQAT LHARIQLSALDLGTSEGLPQWKGVSIGSISPVFDILTYRPLTLQPHHGSVSSVNYLETCQ GYKSCGPDFLAQQPEFLGAMCQPGAGTSQFAWGCKHATPPGPSNPPATMKIAVIGQSLFG QEVYCHLRKEGHEVVGVFTVPDKDGKADPLGLEAEKDGVPVFKYSRWRAKGQALPDVVAK YQALGAELNVLPFCSQFIPMEIISAPRHGSIIYHPSLLPRHRGASAINWTLIHGDKKGGF SIFWADDGLDTGDLLLQKECEVLPDDTVSTLYNRFLFPEGIKGMVQAVRLIAEGKAPRLP QPEEGATYEGIQKKETAKINWDQPAEAIHNWIRGNDKVPGAWTEACEQKLTFFNSTLNTS GLVPEGDALPIPGAHRPGVVTKAGLILFGNDDKMLLVKNIQLEDGKMILASNFFKGAASS VLELTEAELVTAEAVRSVWQRILPKVLEVEDSTDFFKSGAASVDVVRLVEEVKELCDGLE LENEDVYMASTFGDFIQLLVRKLRGDDEEGECSIDYVEMAVNKRTVRMPHQLFIGGEFVD AEGAKTSETINPTDGSVICQVSLAQVTDVDKAVAAAKDAFENGRWGKISARDRGRLMYRL ADLMEQHQEELATIEALDAGAVYTLALKTHVGMSIQTFRYFAGWCDKIQGSTIPINQARP NRNLTLTRKEPVGVCGIIIPWNYPLMMLSWKTAACLAAGNTVVIKPAQVTPLTALKFAEL TLKAGIPKGVVNVLPGSGSLVGQRLSDHPDVRKIGFTGSTEVGKHIMKSCAISNVKKVSL ELGGKSPLIIFADCDLNKAVQMGMSSVFFNKGENCIAAGRLFVEDSIHDEFVRRVVEEVR KMKVGNPLDRDTDHGPQNHHAHLVKLMEYCQHGVKEGATLVCGGNQVPRPGSLPCSSSDP TRYRPPGSTPPRAPDRTASGTGHFHLDVPQALQTQRGFSCFLGLHAFAAARLSGGFPVSP PGELLLLQLTGHPSPGMPVSQGIGHRTPQQSSRDHCTAAVTVHLPSSHSRWTHSVKSTVR PTSKNKPGFFFEPTVFTDVEDHMFIAKEESFGPVMIISRFADGDLDAVLSRANATEFGLA SGVFTRDINKALYVSDKLQAGTVFVNTYNKTDVAAPFGGFKQSGFGKDLGEAALNEYLRV KTVTFEY >gi568815595r:126003794_126260979|GENSCAN_predicted_CDS_4|3804_bp atggggccaccatcgtccagaccccagaatggtagatccactcacagcttgcattgtgca cctggaaaagccacagacactcaatgccagcccatgaaagcagccaagaggggagctgta ccccacaaagacacaggggcagagctgctcaaggccatggaagcccacctcttgcatcag cacaacttggatacaaagactttccttacaagaaaagaaaactacaaaccaatatttctc atgaatatggatgtaaagctcctcaacaaaatattagcaaatcaaatccgacagtgcaca gaactatacaccacctgcaagtggaatttattccagagtccactgcaggtccaggcgact ctccatgcaaggattcagctgtcagctcttgatcttgggacttcagagggactgccacag tggaaaggtgtatctatagggtcaatctcccctgtttttgacatcctgacctaccgcccg ctgaccctccagccccaccatggcagtgtatccagtgtcaactacctggagacttgccag ggctataagtcttgtggccctgacttcctggctcagcagcctgagtttctgggggccatg tgccagccaggtgctggcaccagccagtttgcatgggggtgcaaacacgcaacaccccca ggtccttccaaccctcctgctaccatgaagattgcagtgattggacagagcctgtttggc caggaagtttactgccacctgaggaaggagggccacgaagtggtgggtgtgttcactgtt ccagacaaggatggaaaggccgaccccctgggtctggaagctgagaaggatggagtgccg gtattcaagtactcccggtggcgtgcaaaaggacaggctttgcctgatgtggtggcaaaa taccaggctttgggggccgagctcaacgtcctgcccttctgcagccaattcatccccatg gagataatcagtgccccccggcatggctccatcatctatcacccgtcactgctccctagg caccgaggggcctcggccatcaactggaccctcattcacggagataagaaaggggggttt tccatcttctgggcggatgatggtctggacaccggagacctgctgctgcagaaggagtgt gaggtgctcccggacgacaccgtgagcacgctgtacaaccgcttcctcttccctgaaggc atcaaagggatggtgcaggccgtgaggctgatcgctgagggcaaagcccccagactccct cagcctgaggaaggagccacctatgaggggattcagaagaaggagacagccaagatcaac tgggaccagccggcagaggccattcacaactggatccgcgggaacgacaaggtgccggga gcctggacagaggcctgtgaacagaaactgacatttttcaactcaacgctgaacacttca ggcctggtgcccgagggagacgctttgcccatcccaggagcccatcggccaggggtggtc accaaagcaggactcatcctctttgggaatgatgacaaaatgctgctggtgaagaatatt cagctggaggatggcaaaatgatcctggcctcgaacttctttaagggggcagccagcagt gtccttgagctgacagaggcagagctggttactgcggaggctgtgcggagtgtttggcag cggatcctccccaaagtcctggaggttgaagactccactgatttcttcaagtcaggggcc gcgtctgtggacgttgtgaggctggtggaggaagtgaaggagctgtgtgatggcctggag ttagaaaatgaagatgtgtacatggcatccacctttggggacttcatccagctgttagtg aggaagctgcgaggggacgatgaggagggcgagtgcagcattgactacgtggaaatggca gtgaacaagcgcactgtccgcatgccccaccagctcttcattgggggggagttcgtggat gccgagggcgccaagacctctgagaccatcaatcccaccgatggaagtgtcatctgccag gtatccctggcccaagtcaccgacgtcgacaaggcagtggccgcagccaaggatgccttt gagaatggacggtgggggaagatcagtgcgcgggaccggggccggctgatgtacaggttg gcagatctcatggagcagcaccaggaggagctggccaccattgaggccctggatgcgggt gccgtctacacgctggccctgaagacccacgtgggcatgtccatccagaccttccgctac tttgctggctggtgtgacaagatccagggctccaccatccccatcaaccaggccagaccc aaccgcaacctgaccttgaccaggaaggagcctgttggggtttgtggcatcatcatcccc tggaactatcccctgatgatgctgtcctggaagacagctgcctgcctggctgccgggaac acagtggtgatcaagcctgctcaggtgaccccactcacagccttgaagtttgcagagctg acattaaaggccggcattcccaaaggtgtggttaacgtcctcccaggatctggctccctg gtcggccagagactctcagaccatcctgatgtgaggaaaatcgggttcacaggctccaca gaggtgggcaagcacatcatgaaaagctgtgccataagtaacgtgaagaaggtgtccctg gaactgggcgggaagtcacccctcatcatctttgctgactgtgacctcaacaaggctgtg cagatggggatgagttctgttttcttcaacaaaggagagaattgcattgcagcaggccga ctctttgtggaggactccattcatgatgagttcgtgcggagagtggtagaagaggtgcgg aagatgaaggtgggcaacccgctggacagggacaccgaccacgggccgcagaatcaccat gcccaccttgtgaagctgatggagtactgccagcatggcgtgaaggaaggggccacactg gtctgcggcgggaatcaggtccctcggccaggatctctcccctgcagcagctcagaccct accagatacaggcctccaggctccaccccacccagagctcctgaccgcacagccagcggc actggacacttccacctggatgtcccccaggcccttcaaactcaaagaggcttctcctgc tttcttgggctccatgcatttgctgctgctcggcttagtggaggctttccagtgtcccca cctggagagctcctgcttcttcagctcacaggacatccaagcccgggcatgcccgtgtca cagggcatcgggcaccgcacacctcaacagagctctcgtgaccattgcactgctgctgtc actgtccacttaccctcttcccactcccgctggactcactccgttaaaagcacggtgaga cccacatcaaagaacaagccagggttcttctttgagccaactgttttcacagacgtggaa gaccacatgttcatagccaaggaggagtccttcgggcctgtcatgatcatctctcggttt gctgatggggacttggatgccgtgctgtctcgggccaatgccacggaatttggcctggct tctggtgtcttcaccagggacatcaacaaggccctgtatgtcagtgacaagctccaggca ggcactgtgtttgtcaacacgtacaacaagaccgacgtggccgctcccttcggaggattc aaacagtctggatttggcaaagatctaggagaggcggctctgaacgagtacctgcgggtc aagacagtgaccttcgaatactga >gi568815595r:126003794_126260979|GENSCAN_predicted_peptide_5|48_aa NLSHGGTAVSPDPICKSFQKTGIGAISSPPLPPAGQAVGPVQDLKAIR >gi568815595r:126003794_126260979|GENSCAN_predicted_CDS_5|147_bp aacctgagccatgggggcacggccgtatcccccgaccccatctgcaagagtttccagaag accgggattggggccatcagcagccctccactgccccccgctggtcaggctgtggggccc gtgcaggacctgaaggccattagatag