GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:13:18 Sequence gi568815593f:69067907_69277628 : 209722 bp : 42.29% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1310 1516 207 1 0 62 41 154 0.491 4.46 1.02 PlyA + 3334 3339 6 1.05 2.00 Prom + 5781 5820 40 -4.65 2.01 Init + 6428 6527 100 1 1 55 59 99 0.284 4.18 2.02 Term + 6878 7461 584 0 2 20 47 377 0.990 20.47 2.03 PlyA + 8521 8526 6 1.05 3.00 Prom + 14609 14648 40 -4.95 3.01 Init + 26350 26432 83 0 2 86 99 150 0.589 16.41 3.02 Intr + 26574 26702 129 0 0 95 57 44 0.143 0.89 3.03 Intr + 40443 40530 88 0 1 88 70 64 0.812 3.65 3.04 Intr + 47331 47501 171 2 0 80 94 96 0.982 8.62 3.05 Intr + 48020 48283 264 1 0 95 67 127 0.926 8.09 3.06 Intr + 49333 49490 158 0 2 57 69 97 0.684 2.59 3.07 Intr + 50593 50722 130 1 1 95 98 5 0.680 1.98 3.08 Intr + 53788 53989 202 0 1 61 84 51 0.672 -0.06 3.09 Intr + 55293 55519 227 1 2 94 79 123 0.972 8.78 3.10 Intr + 60098 60226 129 1 0 100 89 65 0.976 7.77 3.11 Intr + 61541 61707 167 1 2 73 47 100 0.034 2.24 3.12 Intr + 84235 84361 127 1 1 56 64 61 0.011 0.26 3.13 Intr + 99005 99282 278 1 2 71 28 156 0.080 3.29 3.14 Intr + 99287 99377 91 2 1 47 84 110 0.941 5.58 3.15 Intr + 100002 100172 171 2 0 86 88 189 0.993 17.92 3.16 Intr + 100267 100437 171 0 0 103 25 172 0.998 11.62 3.17 Intr + 103364 103546 183 1 0 73 107 181 0.896 17.56 3.18 Intr + 106345 106503 159 0 0 73 99 105 0.932 9.36 3.19 Intr + 106971 107207 237 2 0 69 77 156 0.930 9.49 3.20 Intr + 107491 107631 141 0 0 36 94 97 0.810 4.73 3.21 Intr + 109333 109443 111 0 0 42 101 66 0.871 2.96 3.22 Intr + 109618 109719 102 0 0 127 39 66 0.932 5.05 3.23 Intr + 113711 113795 85 1 1 47 101 104 0.954 6.07 3.24 Intr + 113906 114036 131 0 2 49 117 77 0.549 6.19 3.25 Intr + 121519 121603 85 0 1 28 35 113 0.182 -1.53 3.26 Intr + 121661 121862 202 0 1 59 7 265 0.534 12.92 3.27 Intr + 123889 123944 56 1 2 136 108 36 0.770 8.10 3.28 Intr + 140290 140453 164 2 2 59 92 105 0.949 6.77 3.29 Term + 141801 141893 93 2 0 111 38 81 0.981 2.25 3.30 PlyA + 142432 142437 6 1.05 4.03 PlyA - 142581 142576 6 1.05 4.02 Term - 150496 149962 535 0 1 61 47 495 0.553 35.43 4.01 Init - 153097 152979 119 1 2 68 74 68 0.869 3.12 4.00 Prom - 160507 160468 40 -7.45 5.00 Prom + 161600 161639 40 -6.45 5.01 Init + 167070 167298 229 2 1 53 20 163 0.002 4.58 5.02 Intr + 182839 182975 137 2 2 92 47 108 0.599 6.47 5.03 Intr + 187554 187622 69 2 0 68 94 35 0.346 0.56 5.04 Intr + 190137 190247 111 2 0 89 86 19 0.675 1.46 5.05 Intr + 191912 192030 119 1 2 40 110 69 0.928 2.64 5.06 Intr + 194299 194398 100 1 1 114 110 128 0.999 16.79 5.07 Intr + 195795 196115 321 2 0 53 70 134 0.565 3.23 5.08 Intr + 201301 201387 87 0 0 100 119 43 0.765 7.85 5.09 Intr + 204986 205135 150 1 0 67 68 58 0.602 1.14 5.10 Intr + 208637 208784 148 1 1 61 116 111 0.892 10.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 61541 61711 171 1 0 73 50 118 0.952 3.34 S.002 Init + 98011 98089 79 0 1 53 86 39 0.843 1.47 S.003 Intr + 98973 99282 310 1 1 66 28 175 0.827 3.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:69067907_69277628|GENSCAN_predicted_peptide_1|68_aa QAPVSSAAASQKRHQVHTKRAHKQPQEQQKKEAAVVDESSICILESAYQHSPNTYSLTKL LEFYLSFH >gi568815593f:69067907_69277628|GENSCAN_predicted_CDS_1|207_bp caagcaccagtctcctcagctgctgcttcacaaaagaggcaccaagtccacaccaaacga gcacacaagcagccccaagagcagcagaagaaggaagcagcagtggtggatgagagctcc atatgcattttagagtcagcttatcaacattcaccaaacacatattctcttacaaaactg ctggaattttaccttagctttcattga >gi568815593f:69067907_69277628|GENSCAN_predicted_peptide_2|227_aa MFSKEKDKGSPLTMEESATGHARKAKDSRGMDGAGNFQQPGANRTCEGGSGSPEPSSREA TFRERAAAQEKALPPTGVLLHSGTLPICGHPPTSSLSPPIPPAGDRHRGTVPSPRAPPQG CHVAEEAGELTLGCSQAGLSSVAPPHPSYTHRPGCQPAGVSARGAFQHPPVLEELHLSPN IFTRLLGAAFSALACMVHHLHLSEKLAWVPVEAFVGLQIQVNQSANP >gi568815593f:69067907_69277628|GENSCAN_predicted_CDS_2|684_bp atgttttccaaggaaaaggacaagggaagtccactcaccatggaagaaagtgccacaggg catgctagaaaggccaaggacagcagagggatggatggagccggcaacttccagcagcca ggagcaaacaggacttgtgaaggaggcagtggatctccagagcccagcagcagggaggct actttccgggagagggcagcagcacaggagaaggccctgccgcctacaggggtcctactc cactccgggaccctgcccatctgtggccatcctcccaccagcagcctgtcacctcccatc cctcccgctggtgacaggcaccggggtaccgtgcccagcccccgggcacctccacagggc tgccatgtggcagaggaagccggtgaactgacgcttggctgcagccaggcaggcctaagc tctgttgccccgccgcatcccagctacacccacaggcctggatgccaaccagctggcgtc agtgcccgcggtgccttccagcacccgcctgtcctggaggagctgcacctgtcccctaat atcttcacccgcctcttaggggcagccttctcggccctggcgtgcatggtgcaccacctc cacctctctgaaaagctggcctgggtgcccgtggaggccttcgtggggctgcagatccaa gtgaaccaatccgccaacccgtga >gi568815593f:69067907_69277628|GENSCAN_predicted_peptide_3|1444_aa MEEKYGGDVLAGPGGGGGLGPVDVPSARLPPGSPGSSGACCRRKALKPKSFCLNFWVTRQ TVSLRTARQFTTLLLFEHSDIVVISLLSVLFTSSGGGPAKGGVLLLVLALCCKVGFHTAS RKLSVDVGGAKRLQALSHLVSVLLLCPWVIVLSVTTESKVESWFSLIMPFATVIFFVMIL DFYVDSICSVKMEVSKCARYGSFPIFISALLFGNFWTHPITDQLRAMNKAAHQESTEHVL SGGVVLFTFVELFYGVLTNSLGLISDGFHMLFDCSALVMGLFAALMSRWKATRIFSYGYG RIEILSGFINGLFLIVIAFFVFMESVARLIDPPELDTHMLTPVSVGGLIVNLIGICAFSH AHSHAHGASQGSCHSSDHSHSHHMHGHSDHGHGHSHGSAGGGMNANMRGVFLHVLADTLG SIGVIVSTVLIEQFGWFIADPLCSLFIAILIFLSVVPLIKDACQVLLLRLPPEYEKELHI ALEKIQKIEGLISYRDPHFWRHSASIVAGTIHIQVTSDVLEQRIVQQVTGILKDAGVNNL TIQVEKEAYFQHMSGLSTGFHDVLAMTKQMESMKYCKDGTYIITFSVGLIHSPETILLLH LEGYKSSCQPFGDWVGEGDWRSYHLVAAAAARERRRRGRPRESLARPSGLASLWPRPSRT PSRDRPGNAFSATGSRQWEGSECHEQANKEGAVRGLNLRLGWLFSACCGGTAVGFCWVSL AGRASGVLLLPAELLPGEEEAMALRVTRNSKINAENKAKINMAGAKRVPTAPAATSKPGL RPRTALGDIGNKVSEQLQAKMPMKKEAKPSATGKVIDKKLPKPLEKVPMLVPVPVSEPVP EPEPEPEPEPVKEEKLSPEPILVDTASPSPMETSGCAPAEEDLCQAFSDVILAVNDVDAE DGADPNLCSEYVKDIYAYLRQLEEEQAVRPKYLLGREVTGNMRAILIDWLVQVQMKFRLL QETMYMTVSIIDRFMQNNCVPKKMLQLVGVTAMFIASKYEEMYPPEIGDFAFVTDNTYTK HQIRQMEMKILRALNFGLGRPLPLHFLRRASKIGEVDVEQHTLAKYLMELTMLDYDMVHF PPSQIAAGAFCLALKILDNGEWTPTLQHYLSYTEESLLPVMQHLAKNVVMVNQGLTKHMT VKNKYATSKHAKISTLPQLNSALVQDLAKAVAKRFPIVSEICSFRWVLGLVDFKNEATDP RGVKPQTFDLCSECYSVTAHKDSTVLFSRWVHGLAGFRSEAADLPPSAVIFTSRARMKEE PVVTTATEKQGGKVAGKATFSERVCLLSGSLSPQPAMEEQPQMQDADEPADSGGEGRAGG PPQVAGAQAACSEDRMTLLLRLRAQTKQQLLEYKSMVDAKLKQASESKLLEIQTEKNKQK IDLDSMENSERIKIIRQNLQMEIKITTVIQHVFQNLILGSKVNWAEDPALKEIVLQLEKN VDMM >gi568815593f:69067907_69277628|GENSCAN_predicted_CDS_3|4335_bp atggaggagaaatacggcggggacgtgctggccggccccggcggcggcggcggccttggg ccggtggacgtacccagcgctcggctgcctccgggctccccgggctcttcgggtgcctgc tgcagaaggaaggcgctaaagccaaaaagcttctgcttgaatttctgggttacaaggcag acagtcagtcttagaactgcccgccagttcacgactttgctgctatttgagcacagtgat attgttgtcatttcactactcagtgttttgttcaccagttctggaggaggaccagcaaag ggtggagtattattgctagtactggctttgtgttgtaaagttggttttcatacagcttcc agaaagctctctgtcgacgttggtggagctaaacgtcttcaagctttatctcatcttgtt tctgtgcttctcttgtgcccatgggtcattgttctttctgtgacaactgagagtaaagtg gagtcttggttttctctcattatgccttttgcaacggttatcttttttgtcatgatcctg gatttctacgtggattccatttgttcagtcaaaatggaagtttccaaatgtgctcgttat ggatcctttcccatttttattagtgctctcctttttggaaatttttggacacatccaata acagaccagcttcgggctatgaacaaagcagcacaccaggagagcactgaacacgtcctg tctggaggagtggtactttttacctttgtggaattattctatggcgtgctgaccaatagt ctgggcctgatctcggatggattccacatgctttttgactgctctgctttagtcatggga ctttttgctgccctgatgagtaggtggaaagccactcggattttctcctatgggtacggc cgaatagaaattctgtctggatttattaatggactttttctaatagtaatagcgtttttt gtgtttatggagtcagtggctagattgattgatcctccagaattagacactcacatgtta acaccagtctcagttggagggctgatagtaaaccttattggtatctgtgcctttagccat gcccatagccatgcccatggagcttctcaaggaagctgtcactcatctgatcacagccat tcacaccatatgcatggacacagtgaccatgggcatggtcacagccacggatctgcgggt ggaggcatgaatgctaacatgaggggtgtatttctacatgttttggcagatacacttggc agcattggtgtgatcgtatccacagttcttatagagcagtttggatggttcatcgctgac ccactctgttctctttttattgctatattaatatttctcagtgttgttccactgattaaa gatgcctgccaggttctactcctgagattgccaccagaatatgaaaaagaactacatatt gctttagaaaagatacagaaaattgaaggattaatatcataccgagaccctcatttttgg cgtcattctgctagtattgtggcaggaacaattcatatacaggtgacatctgatgtgcta gaacaaagaatagtacagcaggttacaggaatacttaaagatgctggagtaaacaattta acaattcaagtggaaaaggaggcatactttcaacatatgtctggcctaagtactggattt catgatgttctggctatgacaaaacaaatggaatccatgaaatactgcaaagatggtact tacatcataactttttctgttgggctgatccattccccagaaaccattctcttacttcac ttagaagggtataagtccagctgccagccttttggggactgggtaggagaaggcgactgg aggtcttaccatttggtggccgctgcagctgcccgagagcgcaggcgcagaggcagacca cgtgagagcctggccaggccttccggcctagcctcactgtggccccgcccctctcgaacg ccttcgcgcgatcgccctggaaacgcattctctgcgaccggcagccgccaatgggaaggg agtgagtgccacgaacaggccaataaggagggagcagtgcggggtttaaatctgaggcta ggctggctcttctcggcgtgctgcggcggaacggctgttggtttctgctgggtgtccttg gctggtcgggcctccggtgttctgcttctccccgctgagctgctgcctggtgaagaggaa gccatggcgctccgagtcaccaggaactcgaaaattaatgctgaaaataaggcgaagatc aacatggcaggcgcaaagcgcgttcctacggcccctgctgcaacctccaagcccggactg aggccaagaacagctcttggggacattggtaacaaagtcagtgaacaactgcaggccaaa atgcctatgaagaaggaagcaaaaccttcagctactggaaaagtcattgataaaaaacta ccaaaacctcttgaaaaggtacctatgctggtgccagtgccagtgtctgagccagtgcca gagccagaacctgagccagaacctgagcctgttaaagaagaaaaactttcgcctgagcct attttggttgatactgcctctccaagcccaatggaaacatctggatgtgcccctgcagaa gaagacctgtgtcaggctttctctgatgtaattcttgcagtaaatgatgtggatgcagaa gatggagctgatccaaacctttgtagtgaatatgtgaaagatatttatgcttatctgaga caacttgaggaagagcaagcagtcagaccaaaatacctactgggtcgggaagtcactgga aacatgagagccatcctaattgactggctagtacaggttcaaatgaaattcaggttgttg caggagaccatgtacatgactgtctccattattgatcggttcatgcagaataattgtgtg cccaagaagatgctgcagctggttggtgtcactgccatgtttattgcaagcaaatatgaa gaaatgtaccctccagaaattggtgactttgcttttgtgactgacaacacttatactaag caccaaatcagacagatggaaatgaagattctaagagctttaaactttggtctgggtcgg cctctacctttgcacttccttcggagagcatctaagattggagaggttgatgtcgagcaa catactttggccaaatacctgatggaactaactatgttggactatgacatggtgcacttt cctccttctcaaattgcagcaggagctttttgcttagcactgaaaattctggataatggt gaatggacaccaactctacaacattacctgtcatatactgaagaatctcttcttccagtt atgcagcacctggctaagaatgtagtcatggtaaatcaaggacttacaaagcacatgact gtcaagaacaagtatgccacatcgaagcatgctaagatcagcactctaccacagctgaat tctgcactagttcaagatttagccaaggctgtggcaaagagatttcctattgtgtccgaa atttgttccttccggtgggttcttggtctcgttgacttcaagaatgaagccacggaccct cgcggagtgaagccgcagaccttcgacctttgcagtgagtgttacagtgttacagctcat aaagacagcactgttctcttctcccggtgggttcatggtctcgctggcttcaggagtgaa gctgcagaccttccgccgtctgcagttattttcaccagtagagcccggatgaaagaggag cccgtagtaaccacggcaaccgaaaaacaaggcggaaaggtggcgggaaaagcgaccttt tctgagcgcgtttgcctgttgagtggtagcctttcccctcaaccagcaatggaggagcag ccccagatgcaagacgccgacgagcccgcggactccggaggggaaggccgggcaggcggg ccaccgcaggtcgccggcgcccaggcggcgtgcagcgaggaccgcatgaccctgctcctc aggctgagagcacagacaaaacaacaactcttagaatataaatcaatggttgatgcaaaa ttaaaacaagcttcagaaagtaagcttttagaaatacagactgaaaagaacaaacagaag attgatttggacagtatggaaaactcagagaggataaagatcatacgacaaaacctacag atggagataaaaattactactgttattcaacatgtgttccagaaccttattttggggagt aaagtcaattgggcagaggatcctgcccttaaggaaattgttctgcagcttgagaagaat gttgacatgatgtaa >gi568815593f:69067907_69277628|GENSCAN_predicted_peptide_4|217_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGHSTFQRARGWGQTLGEVGENPLRRGRSTRGQ LPDHSRDLEGTVGQIQGSLPSTPRYRLPSDWLIGGTTANPRRRLARLPPGSPHPHTPKTN KVLAEERGIPAVGTGPTCQDAARLPQGSNPRRRTHLTARSNRLKVPSSHTPAGAVTGPPQ RRGHRSASATAFTPRRRRHSQHRPYRPRCGNATNCFT >gi568815593f:69067907_69277628|GENSCAN_predicted_CDS_4|654_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacagcacgtttcagagagcacggggttggggc cagacgcttggagaggttggggagaacccgctccggcgaggacgcagtacgcggggccag ttgcccgaccattcccgagaccttgaagggacagtgggtcagatccagggttcgcttcct tccacgccccgctaccgccttccttccgattggctgatcgggggcaccacagccaatcct agacggcgattggccaggctaccgccagggtcgcctcacccgcacacgcctaagactaac aaggtcctggcggaggagcgggggatcccggcagtaggtacaggacctacctgccaagat gctgcacggttgccgcaaggctcgaaccctcggcgccggacccacctcactgcccggtcc aacaggctcaaagtcccatcctcacacacacccgcgggcgccgtcactggccctccgcag aggcgcggtcaccgctctgcctctgccaccgcctttactccccgacgccgtcgccatagt cagcaccgtccctaccgcccaagatgcggaaacgcgacaaattgctttacctga >gi568815593f:69067907_69277628|GENSCAN_predicted_peptide_5|491_aa MALDVKSRAKRYEKLDFLGEGQVRLSGRTGRAPSGQPRAASTFAGFPVEARGLAWLLVLV GGNRPDALAAHSLHPGGVSAGSHYSCECAVSYLKPICLGVSPEAHGEALVTTADYSGPRG SILLDAFGHKSNISLVFDFMETDLEVIIKDNSLVLTPSHIKAYMLMTLQGLEYLHQHWIL HRDLKPNNLLLDENGVLKLADFGLAKSFGSPNRAYTHQVVTRWYRAPELLFGARMYGVGV DMWAVGCILAELLLRSIPEIHTENLSTTTIHSWPRRKQNSIAPNVMTDHSIEKANEERQQ LRVDRLEIVDLLTIILGNLLAIGIVAGIIFKVIRKMTCRLKNCTLLPYLLSYFKLFLCSI EKVPFLPGDSDLDQLTRIFETLGTPTEEQWPDMCSLPDYVTFKSFPGIPLHHIFSAAGDD LLDLIQGLFLFNPCARITATQALKMKYFSNRPGPTPGCQLPRPNCPVETLKEQSNPALAI KRKRTEALEQX >gi568815593f:69067907_69277628|GENSCAN_predicted_CDS_5|1473_bp atggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttggggag ggacaggtgaggctctctggaaggacggggagggccccaagcggacagccccgcgccgcc tccacctttgcgggttttcccgtggaggccagaggtctggcttggctgctcgttctcgtt gggggaaaccgtccagacgcacttgctgcccattctttacatcctgggggagtctctgct ggtagccactacagctgtgaatgtgctgtgtcatacctgaagccaatatgtctcggagtt tcacctgaggcccatggtgaggccctggttaccactgctgattattcagggcccaggggc tctatactccttgatgcttttggacataaatctaatattagccttgtctttgattttatg gaaactgatctagaggttataataaaggataatagtcttgtgctgacaccatcacacatc aaagcctacatgttgatgactcttcaaggattagaatatttacatcaacattggatccta catagggatctgaaaccaaacaacttgttgctagatgaaaatggagttctaaaactggca gattttggcctggccaaatcttttgggagccccaatagagcttatacacatcaggttgta accaggtggtatcgggcccccgagttactatttggagctaggatgtatggtgtaggtgtg gacatgtgggctgttggctgtatattagcagagttacttctaaggagtataccagaaatt cacactgagaatttatcaaccaccacaatccattcatggccaagaagaaaacagaattcc atagcccctaatgttatgactgatcactccattgagaaagcaaatgaagagagacagcag ttgagagtagacaggttagagattgtggacctgttaacaatcatacttggaaatttatta gccattggcattgttgctggaatcatctttaaagttataaggaaaatgacatgcagacta aagaattgtactcttctgccataccttttgtcctattttaagctgttcttgtgctcaatt gaaaaggttccttttttgccaggagattcagaccttgatcagctaacaagaatatttgaa actttgggcacaccaactgaggaacagtggccggacatgtgtagtcttccagattatgtg acatttaagagtttccctggaatacctttgcatcacatcttcagtgcagcaggagacgac ttactagatctcatacaaggcttattcttatttaatccatgtgctcgaattacggccaca caggcactgaaaatgaagtatttcagtaatcggccagggccaacacctggatgtcagctg ccaagaccaaactgtccagtggaaaccttaaaggagcaatcaaatccagctttggcaata aaaaggaaaagaacagaggccttagaacaagnn