GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:41:56 Sequence gi568815597r:200544257_200766740 : 222484 bp : 39.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 512 775 264 1 0 69 55 139 0.511 3.65 1.02 PlyA + 1009 1014 6 1.05 2.34 PlyA - 1510 1505 6 1.05 2.33 Term - 9511 9132 380 2 2 96 42 304 0.975 20.57 2.32 Intr - 10350 10212 139 0 1 57 86 101 0.997 5.92 2.31 Intr - 15196 15074 123 1 0 82 94 73 0.992 7.16 2.30 Intr - 16624 16466 159 1 0 106 91 102 0.999 11.56 2.29 Intr - 20997 20813 185 1 2 83 99 67 0.993 5.89 2.28 Intr - 21413 21189 225 0 0 39 67 116 0.282 1.63 2.27 Intr - 25749 25655 95 2 2 93 68 -10 0.397 -3.81 2.26 Intr - 31435 31335 101 1 2 45 88 129 0.660 6.59 2.25 Intr - 36127 35998 130 0 1 111 20 91 0.899 4.38 2.24 Intr - 37038 36945 94 1 1 45 109 102 0.971 6.10 2.23 Intr - 41971 41845 127 1 1 -12 111 124 0.988 4.03 2.22 Intr - 45113 44961 153 2 0 49 91 96 0.976 5.35 2.21 Intr - 46016 45869 148 1 1 61 58 175 0.999 11.12 2.20 Intr - 47984 47824 161 2 2 78 98 120 0.999 9.86 2.19 Intr - 49513 49411 103 0 1 93 103 38 0.995 5.06 2.18 Intr - 54165 53981 185 0 2 69 76 136 0.921 8.16 2.17 Intr - 55857 55794 64 2 1 52 67 58 0.920 -2.10 2.16 Intr - 56247 56100 148 1 1 78 75 213 0.995 17.37 2.15 Intr - 57812 57640 173 1 2 70 53 193 0.588 12.66 2.14 Intr - 62542 62490 53 1 2 113 101 47 0.968 5.29 2.13 Intr - 64672 64574 99 1 0 97 63 69 0.958 4.69 2.12 Intr - 70149 70062 88 2 1 71 87 34 0.948 0.65 2.11 Intr - 71353 71099 255 0 0 73 102 185 0.994 14.04 2.10 Intr - 74396 73356 1041 1 0 67 87 772 0.018 63.65 2.09 Intr - 76600 76155 446 1 2 -81 76 368 0.007 9.97 2.08 Intr - 80791 80681 111 1 0 66 43 88 0.001 1.76 2.07 Intr - 104311 104183 129 1 0 55 110 102 0.994 9.07 2.06 Intr - 104970 104818 153 0 0 95 42 62 0.755 1.65 2.05 Intr - 106329 106169 161 1 2 73 82 90 0.307 5.69 2.04 Intr - 109869 109830 40 0 1 73 94 23 0.372 -1.72 2.03 Intr - 114860 114771 90 2 0 107 100 39 0.851 6.27 2.02 Intr - 119830 119663 168 1 0 92 121 107 0.999 13.62 2.01 Init - 122484 121681 804 0 0 33 107 701 0.959 61.25 2.00 Prom - 133048 133009 40 -4.35 3.00 Prom + 139378 139417 40 -4.95 3.01 Init + 166367 166424 58 1 1 78 68 92 0.320 7.72 3.02 Intr + 193713 193897 185 1 2 33 65 175 0.133 8.29 3.03 Intr + 194664 194931 268 2 1 59 50 169 0.012 6.38 3.04 Intr + 195624 195710 87 1 0 32 92 99 0.017 3.72 3.05 Intr + 197843 197974 132 0 0 85 64 36 0.319 0.60 3.06 Intr + 209523 209663 141 1 0 104 65 51 0.236 3.80 3.07 Intr + 216583 216842 260 2 2 22 101 261 0.924 17.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 74467 73356 1112 1 2 71 87 777 0.968 69.74 S.002 Sngl + 76140 76493 354 2 0 68 42 246 0.834 13.80 S.003 Init + 195572 195710 139 1 1 78 92 169 0.901 16.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:200544257_200766740|GENSCAN_predicted_peptide_1|87_aa MTLELKPIGSEMEPRGFLEEERSPQRQQQVCCLISFQQELILEERDIEHGLWPRNSGSEI LYCKAKRPHFPHLRNGDHNGTHLLGLL >gi568815597r:200544257_200766740|GENSCAN_predicted_CDS_1|264_bp atgacacttgagttaaaacctataggaagtgagatggagccacggggattcctggaggaa gagcgttccccgcagaggcagcagcaagtgtgctgcttaattagcttccaacaggaactt attttagaagaaagggatatagagcatggactttggcccaggaattctggttctgagatt ctgtactgcaaagccaagaggcctcattttcctcatctgcgaaatggggatcataatggc actcaccttctggggttattgtga >gi568815597r:200544257_200766740|GENSCAN_predicted_peptide_2|2176_aa MFVPRSLKIKRNANDDGKSCVAKIIKPDPEDLQLDKSRDVPVDAVATEAATIDRHISESC PFPSPGGQLAEVHSVSPEQGAKDSHPSEEPVKSFSKTQRWAEPGEPICVVCGRYGEYICD KTDEDVCSLECKAKHLLQVKEKEEKSKLSNPQKADSEPESPLNASYVYKEHPFILNLQED QIENLKQQLGILVQGQEVTRPIIDFEHCSLPEVLNHNLKKSGYEVPTPIQMQMIPVGLLG RDILASADTGSGKTAAFLLPVIMRALFESKTPSALILTPTRELAIQIERQAKELMSGLPR MKTVLLVGGLPLPPQLYRLQQHVKVIIATPGRLLDIIKQSSVELCGVKIVVVDEVILIFR EVWATLTTTIPTSIEQLASQLLHNPVRIITGEKNLPCANVRQIILWVEDPAKKKKLFEIL NDKKLFKPPVLVFVDCKLGADLLSEAVQKITGLKSISIHSEKSQIERKNILKGLLEGDYE VVVSTGVLGRGLDLISVRLVVNFDMPSSMDEYVHQENTYKSTWRNPQHFQQDVRMTLGYV GKAQWEEDNQLKRRLPEPNHRHFRGAGGSKGAAPDATPSSQSEGGAPIQIAAFWGAGAGG GEWRGDCVEGVPKVPDPAEGRSFGTGEGVRTLSLQATTALRAGGLSSSPPFSSSAFQVGG EGVPERRAQGSVSAGTGKPDFRASRPVGFSPSLPESVGIKSSSLNALTHSSRLKLHLKSD MSECENDDPLLRSAGKVRDINRTYVISASRKTADMPLTPNPVGRLALQRRTTRNKESSLL VSELEDTTEKTAETRLTLQRRAKTDSAEKWKTAEIDSVKMTLNVGGETENNGVSKESRTN VRIVNNAKNSFVASSVPLDEDPQVIEMMADKKYKETFSAPSRANENVALKYSSNRPPIAS LSQTEVVRSGHLTTKPTQSKLDIKVLGTGNLYHRSIGKEIAKTSNKFGSLEKRTPTKCTT EHKLTTKCSLPQLKSPAPSILKNRMSNLQVKQRPKSSFLANKQERSAENTILPEEETVVQ NTSAGKDPLKVENSQVTVAVRVRPFTKREKIEKASQVVFMSGKEITVEHPDTKQVYNFIY DVSFWSFDECHPHYASQTTVYEKLAAPLLERAFEGFNTCLFAYGQTGSGKSYTMMGFSEE PGIIPRFCEDLFSQVARKQTQEVSYHIEMSFFEVYNEKIHDLLVCKDENGQRKQPLRVRE HPVYGPYVEALSMLLKESLGGNSKTAMIATISPAASNIEETLSTLRYANQARLIVNIAKV NEDMNAKLIRELKAEIAKLKAAQRNSRNIDPERYRLCRQEITSLRMKLHQQERDMAEMQR VWKEKFEQAEKRKLQETKELQKAGIMFQMDNHLPNLVNLNEDPQLSEMLLYMIKEGTTTV GKYKPNSSHDIQLSGVLIADDHCTIKNFGGTVSIIPVGEAKTYVNGKHILEITVLRHGDR VILGGDHYFRFNHPVEVQKGKRPSGRDTPISEGPKDFEFAKNELLMAQRSQLEAEIKEAQ LKAKEEMMQGIQIAKEMAQQELSSQKAAYESKIKALEAELREESQRKKMQEINNQKANHK IEELEKAKQHLEQEIYVNKKRLEMETLATKQALEDHSIRHARILEALETEKQKIAKEVQI LQQNRNNRDKTFTVQTTWSSMKLSMMIQEANAISSKLKTYYVFGRHDISDKSSSDTSIRV RNLKLGISTFWSLEKFESKLAAMKELYESNGSNRGEDAFCDPEDEWEPDITDAPVSSLSR RRSRSLMKNRRISGCLHDIQVHPIKNLHSSHSSGLMDKSSTIYSNSAESFLPGICKELIG SSLDFFGQSYDEERTIADSLINSFLKIYNGLFAISKAHEEQDEESQDNLFSSDRAIQSLT IQTACAFEQLVVLMKHWLSDLLPCTNIARLEDELRQEVKKLGGYLQLFLQGCCLDISSMI KEAQKNAIQIVQQAVKYVGQLAVLKGSKLHFLENGNNKAASVQEEFMDAVCDGVGLGMKI LLDSGLEKAKELQHELFRQCTKNEKSFRRQVQEENFEYQDFKRMVNRAPEFLKLKHCLEK AIEIIISALKGCHSDINLLQTCVESIRNLASDFYSDFSVPSTSVGSYESRVTHIVHQELE SLAKSLLFCFESEESPDLLKPWETYNQNTKEEHQQSKSSGIDGSKNKGVPKRVYELHGSS PAVSSEECTPSRIQWV >gi568815597r:200544257_200766740|GENSCAN_predicted_CDS_2|6531_bp atgtttgttccaagatctctaaaaatcaagaggaatgctaatgatgatggcaaaagttgt gtggctaagataattaaaccagacccagaagaccttcagttggacaaaagcagagatgtt cccgttgatgctgtagctacagaagcagccacaatagacaggcacatcagcgaatcatgc cctttccccagcccaggtggccagttggcagaggttcattcagtaagtcccgagcagggt gcgaaggacagccatccttctgaagagcccgttaagtcattttccaaaacacagcgctgg gcagaaccaggggaacccatctgtgttgtctgtggtcgttatggagagtatatctgtgat aagacagatgaagatgtgtgtagtttggagtgtaaagcgaaacatcttctacaagttaag gaaaaggaagagaaatcaaaactcagcaatccacagaaggctgattctgagccagagtct ccactgaatgcttcctatgtctacaaagagcacccctttattttgaaccttcaggaagac cagattgaaaatcttaaacagcagctgggaattttagttcaagggcaagaagtcaccagg cccattattgactttgaacattgtagtctccctgaggtcttaaatcacaacttgaagaaa tcaggctatgaggtgccaactcccattcaaatgcagatgattcctgtgggacttctggga agagacattctggccagtgcagatactggctcaggaaaaacagctgcttttcttcttcct gttatcatgcgagctttattcgagagcaaaactccatctgcgctcattcttacaccaacc agagagttagccattcagatagagagacaagctaaagaattgatgagtggcctgccacgc atgaaaactgtgcttcttgtagggggcttacccttacccccacagctttatcgtctgcaa caacatgttaaggttatcatagcaacccctgggcgacttctggatataataaagcagagc tctgtagaactctgtggtgtaaagattgtggtagtagatgaagtgattcttatcttcaga gaggtttgggccacactgactaccacaattccaactagcatagaacagctagcaagccag cttctgcataatcctgtgagaattatcactggagaaaagaacctaccttgtgccaatgta cgtcagattattttgtgggtagaagacccagccaaaaagaaaaaattatttgaaatttta aatgataagaaactctttaagcctccagtgttagtatttgtggactgcaaactaggagca gatcttttgagtgaagccgttcagaaaatcacagggctgaaaagcatatctatacattca gagaagtcgcaaatagaaaggaaaaacatattgaagggattacttgaaggagactatgaa gttgtagtgagcacaggagtcttgggacgaggcctagacttgatcagtgtcaggctggtt gtcaattttgatatgccttcaagtatggatgagtatgtccatcaggaaaatacctacaag tctacttggaggaatccccagcattttcaacaggatgtcagaatgaccttgggctatgtt ggcaaagcacaatgggaagaagacaaccaattgaagagacggctaccggagcccaaccac cggcacttccggggggcgggtggaagcaaaggcgcggcacccgacgcgacgcccagcagc caatcagaaggcggggcgcccattcaaattgcggccttctggggagccggcgctggaggt ggtgagtggcgtggggactgtgtcgagggggtccccaaggtgccggaccctgcggagggg cgaagtttcggcactggggagggcgtgcggacgctttccctacaggcgaccactgctctg cgggcgggtggtcttagctccagtcccccattcagttcctcagcattccaggtcggcggc gaaggggtccccgaacgaagggcgcaaggcagcgtctctgctgggaccgggaagccggac ttcagggcctctcggcccgtgggcttctccccgagtctccccgagtcggttggcattaag agttcatcactgaatgccctcacccacagtagccgacttaagctgcatttgaagtcggat atgtcagaatgtgaaaatgatgatccattattgagatctgcaggtaaagtcagagacata aatagaacttatgttatttctgccagtagaaaaacagcagacatgccccttacccctaat cctgtaggtagattggcacttcagaggagaactacaaggaacaaagaatcatctttgctt gttagtgagttggaagacacaactgaaaaaacagcagaaacacgtcttacattacaacgt cgtgctaaaacagattctgcagaaaagtggaaaacagctgaaatagattctgtcaaaatg acactgaatgtgggaggtgaaacagaaaataatggtgtttctaaggaaagtagaacaaat gtaaggattgtaaataatgctaaaaactcttttgttgcctcttctgtacctttagatgaa gatccacaggtcattgaaatgatggctgataagaaatacaaagaaacattttctgccccc agtagagcaaatgaaaatgttgcacttaagtactcaagtaatagaccacccattgcttcc ctgagtcagactgaagttgttagatcaggacacttgacaacgaaacctactcagagcaag ttggatatcaaagtgttgggaacaggaaacttgtatcatagaagtattgggaaggaaatt gcaaaaacttcaaataaatttgggagcttagaaaaaagaacacctacaaaatgtacaaca gaacacaaactgacaacaaagtgcagcctgcctcagcttaagagcccagctccatcaata ctgaagaatagaatgtctaaccttcaagttaaacaaagaccaaaaagttcctttcttgca aataaacaggaaagatccgcagaaaatacaattcttcccgaagaagaaactgtagttcag aacacctctgcaggaaaagaccccttaaaagtagagaatagtcaagtgacagtggcagta cgcgtaagacctttcaccaagagagagaagattgaaaaagcatcccaggtagtcttcatg agtgggaaagaaataactgtggaacaccctgacacgaaacaagtttataattttatttat gatgtttcattctggtcttttgatgaatgtcatcctcactacgctagccagacaactgtc tatgagaagctagcagcaccactcctagaaagagccttcgaaggcttcaatacctgtctt tttgcttatggtcagactggctctggaaaatcatatacgatgatgggatttagtgaagaa ccaggaataattccaagattttgtgaagatcttttttctcaagtagccagaaaacaaacc caagaggtcagctatcacattgaaatgagcttctttgaagtatataatgaaaaaattcac gaccttctggtttgtaaagatgaaaatgggcagagaaagcaaccactgagagtgagggaa catcctgtttatggaccatatgttgaagcactgtcaatgctgttaaaagaaagtctgggt ggaaattcaaaaactgcaatgattgctacgattagtcccgctgccagcaacatagaagaa acattaagcacacttagatatgctaaccaagcccgtttaatagtcaacattgctaaagta aatgaagatatgaacgctaagttaattagagaattgaaggcagaaattgcaaagctaaaa gctgctcagagaaacagtcggaatattgaccctgaacgatacaggctctgtcggcaagaa ataacatccttaagaatgaaactgcatcaacaggagagagacatggcagaaatgcaaaga gtgtggaaagaaaagtttgaacaagctgaaaaaagaaaacttcaagaaacaaaagagtta cagaaagcaggaattatgtttcaaatggacaatcatttaccaaaccttgttaatctgaat gaagatccacaactatctgagatgctgctatatatgataaaagaaggaacaactacagtt ggaaagtataaaccaaactcaagccatgatattcagttatctggggtgctgattgctgat gatcattgtactatcaaaaattttggtgggacagtgagtattatcccagttggggaagca aagacatatgtaaatggaaaacatattttggaaatcacagtattacgtcatggtgatcga gtgattcttggtggagatcattattttagatttaatcatccagtagaagtccagaaagga aaaaggccatctggaagagatactcctataagtgagggtccaaaagactttgaatttgca aaaaatgagttgctcatggcacagagatcacaacttgaagcagaaataaaagaggctcag ttgaaggcaaaggaagaaatgatgcaaggaatccagattgcaaaagaaatggctcagcaa gagctttcttctcaaaaagctgcatatgaaagcaaaataaaagcactggaagcagaactg agagaagagtctcaaaggaaaaaaatgcaggaaataaataaccagaaggctaatcacaaa attgaggaattagaaaaggcaaagcagcatcttgaacaggaaatatatgtcaacaaaaag cgattagaaatggaaacattggctacaaaacaggctttagaagaccatagcatccgccat gcaagaattctggaagctttagaaactgaaaagcaaaaaattgctaaagaagtacaaatt ctacagcagaatcggaataatagggataaaacttttacagtgcagacaacttggagctct atgaaactctcaatgatgattcaggaagccaatgctatcagcagcaaattgaaaacatac tatgtttttggcagacatgatatatcagataaaagtagttctgacacttctattcgggtt cgtaacctgaaactaggaatctcaacattctggagtctggaaaagtttgaatctaaactt gcagcaatgaaagaactttatgagagtaatggtagtaacaggggtgaagatgccttttgt gatcctgaagatgaatgggaacccgacattacagatgcaccagtttcttcactttctaga aggaggagtaggagtttgatgaagaacagaagaatttctggttgtttacatgacatacaa gtccatccaattaagaatttgcattcttcacattcatcaggtttaatggacaaatcaagc actatttactcaaattcagcagagtcctttcttcctggaatttgcaaagaattgattggt tcttcgttagatttttttggacagagttatgatgaagaaagaactatagcagacagccta attaatagttttcttaaaatttataatgggctatttgccatttccaaggctcatgaagaa caagatgaagaaagtcaagataacttgttttcttctgatcgagcaatccagtcacttact attcagactgcatgtgcttttgagcagctagtagtgctaatgaaacactggctgagtgat ttactgccttgtaccaacatagcaagacttgaggatgagttgagacaagaagttaaaaaa ctgggaggctacttacagttatttttgcagggatgctgtttggatatttcatcaatgata aaagaggctcaaaagaatgcaatccaaattgtacaacaagctgtaaagtatgtggggcag ttagcagttctgaaagggagcaagctacattttctagaaaacggtaacaataaagctgcc agtgtccaggaggaattcatggatgctgtttgtgatggtgtaggcttaggaatgaagatt ttattagattctggactggaaaaagcaaaagaacttcagcatgaactctttaggcagtgt acaaaaaatgagaaaagtttcagaaggcaagtacaagaagaaaactttgaataccaagat ttcaagaggatggttaatcgtgctccagaattcttaaagttaaaacattgcttagagaaa gctattgaaattattatttctgcactgaaaggatgccatagtgatataaatcttctccag acttgtgttgaaagtattcgcaacttggccagtgatttttacagtgacttcagtgtgcct tctacttctgttggcagctatgagagtagagtaactcacattgtccaccaggaactagaa tctctagctaagtctctcctcttttgttttgaatctgaagaaagccctgatttgttgaaa ccctgggaaacttataatcaaaataccaaagaagaacaccaacaatctaaatcaagcggg attgacggcagtaagaataaaggtgtaccaaagcgtgtctatgagctccatggctcatcc ccagcagtgagctcagaggaatgcacacccagtaggattcagtgggtgtga >gi568815597r:200544257_200766740|GENSCAN_predicted_peptide_3|377_aa MTSGVNVAVGFSRQQLGTLDATTEIRTGCSGNNRGGTCVTALTGKLSAAVREGFMEEVTT VEFEGAVGVSWVDRGTTLSKECRSRKTAAAAAAAEGERSSELQKGGQEKITRTLLIRQEL RAWDSRPLPGHHLRRQQGRKTRETAAATAAGEGGDGDGREHRGGDGPAAASIKPFDHYDF SRAKIACNLAWLVAKAFGTDKVTGSQINLHKIKHLASDVFCMSDSEAFAYYSIPFCRVTV LSADPLCTPLACHCHYHDHQLSLCQQMCRVASLHSTIPAFHRRLGQGCLRENVPEELQEP FYTDQYDQEHIKPPVVNLLLSAELYCRAGSLILKSDAAKPLLGHDAVIQALAQKGLYVTD QEKLVTERDLHKKPIQM >gi568815597r:200544257_200766740|GENSCAN_predicted_CDS_3|1131_bp atgacctctggtgtaaatgtggccgtgggtttcagcaggcagcagctggggaccctagat gctacgacagaaataagaacagggtgctccgggaacaacaggggaggcacctgtgtgaca gccctgactggaaagttgtcagcagctgtcagggaaggcttcatggaggaagtgactact gtggagtttgaaggtgcagtaggagtaagctgggtggacagaggaaccaccctgagcaag gagtgccgcagccggaaaaccgcagcggcggcggcggcggctgagggggaacgatccagc gagctgcagaaaggggggcaggaaaaaattacaaggacattactgataagacaggaactg agagcttgggattcgcggcctctccccggccaccaccttcgccgtcagcaagggaggaag acccgggagacggcagcggcgacggcggcaggggaaggaggagacggcgacgggagggag cacagaggaggggacgggccggcggcggcctccatcaagccttttgaccactatgatttc tccagggccaaaatcgcctgcaatctggcctggctggtggccaaagcctttgggacagat aaggtaactggctcacagattaatttgcacaagattaaacatctggcaagtgatgtgttc tgcatgtctgactctgaagcctttgcttattactctataccgttctgcagagtaaccgtt ttatctgcagacccactctgtactcctcttgcttgtcactgccactaccacgaccaccag ttgtctctctgtcagcagatgtgccgtgttgcctctctgcacagtaccatccctgcattc cacagaagacttgggcaagggtgcctgagagaaaatgtgccagaggaacttcaagaacca ttttacacagatcagtatgaccaggaacacatcaaaccacctgttgttaatttgcttcta tcggctgaactatactgtcgtgctgggagtctcattctcaagagtgatgctgcaaaaccc cttttgggccatgatgctgtaatccaggctttagcacagaaaggtctttatgtcactgac caggaaaaattggtaactgaacgagatctccacaagaaacccatacagatg