GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:22:17 Sequence gi568815593f:69089635_69309796 : 220162 bp : 42.29% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4622 4704 83 1 2 86 99 150 0.806 16.41 1.02 Intr + 4846 4974 129 1 0 95 57 44 0.196 0.89 1.03 Intr + 18715 18802 88 1 1 88 70 64 0.818 3.65 1.04 Intr + 25603 25773 171 0 0 80 94 96 0.982 8.62 1.05 Intr + 26292 26555 264 2 0 95 67 127 0.926 8.09 1.06 Intr + 27605 27762 158 1 2 57 69 97 0.684 2.59 1.07 Intr + 28865 28994 130 2 1 95 98 5 0.680 1.98 1.08 Intr + 32060 32261 202 1 1 61 84 51 0.672 -0.06 1.09 Intr + 33565 33791 227 2 2 94 79 123 0.972 8.78 1.10 Intr + 38370 38498 129 2 0 100 89 65 0.976 7.77 1.11 Intr + 39813 39979 167 2 2 73 47 100 0.034 2.24 1.12 Intr + 62507 62633 127 2 1 56 64 61 0.011 0.26 1.13 Intr + 77277 77554 278 2 2 71 28 156 0.080 3.29 1.14 Intr + 77559 77649 91 0 1 47 84 110 0.941 5.58 1.15 Intr + 78274 78444 171 0 0 86 88 189 0.993 17.92 1.16 Intr + 78539 78709 171 1 0 103 25 172 0.998 11.62 1.17 Intr + 81636 81818 183 2 0 73 107 181 0.896 17.56 1.18 Intr + 84617 84775 159 1 0 73 99 105 0.932 9.36 1.19 Intr + 85243 85479 237 0 0 69 77 156 0.930 9.49 1.20 Intr + 85763 85903 141 1 0 36 94 97 0.810 4.73 1.21 Intr + 87605 87715 111 1 0 42 101 66 0.871 2.96 1.22 Intr + 87890 87991 102 1 0 127 39 66 0.932 5.05 1.23 Intr + 91983 92067 85 2 1 47 101 104 0.954 6.07 1.24 Intr + 92178 92308 131 1 2 49 117 77 0.549 6.19 1.25 Intr + 99791 99875 85 1 1 28 35 113 0.182 -1.53 1.26 Intr + 99933 100134 202 1 1 59 7 265 0.534 12.92 1.27 Intr + 102161 102216 56 2 2 136 108 36 0.770 8.10 1.28 Intr + 118562 118725 164 0 2 59 92 105 0.949 6.77 1.29 Term + 120073 120165 93 0 0 111 38 81 0.981 2.25 1.30 PlyA + 120704 120709 6 1.05 2.03 PlyA - 120853 120848 6 1.05 2.02 Term - 128768 128234 535 1 1 61 47 495 0.553 35.43 2.01 Init - 131369 131251 119 2 2 68 74 68 0.869 3.12 2.00 Prom - 138779 138740 40 -7.45 3.00 Prom + 139872 139911 40 -6.45 3.01 Init + 145342 145570 229 0 1 53 20 163 0.002 4.58 3.02 Intr + 161111 161247 137 0 2 92 47 108 0.599 6.47 3.03 Intr + 165826 165894 69 0 0 68 94 35 0.346 0.56 3.04 Intr + 168409 168519 111 0 0 89 86 19 0.675 1.46 3.05 Intr + 170184 170302 119 2 2 40 110 69 0.928 2.64 3.06 Intr + 172571 172670 100 2 1 114 110 128 0.999 16.79 3.07 Intr + 174067 174387 321 0 0 53 70 134 0.571 3.23 3.08 Intr + 179573 179659 87 1 0 100 119 43 0.783 7.85 3.09 Intr + 183258 183407 150 2 0 67 68 58 0.621 1.14 3.10 Intr + 186909 187056 148 2 1 61 116 111 0.930 10.09 3.11 Intr + 189992 190154 163 0 1 69 66 155 0.387 9.51 3.12 Intr + 195895 195991 97 1 1 49 94 11 0.078 -3.11 3.13 Intr + 199176 199275 100 2 1 81 103 57 0.241 5.26 3.14 Intr + 208929 209072 144 1 0 68 94 70 0.245 4.93 3.15 Term + 219304 219869 566 2 2 44 43 175 0.003 2.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 39813 39983 171 2 0 73 50 118 0.952 3.34 S.002 Init + 76283 76361 79 1 1 53 86 39 0.843 1.47 S.003 Intr + 77245 77554 310 2 1 66 28 175 0.827 3.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:69089635_69309796|GENSCAN_predicted_peptide_1|1444_aa MEEKYGGDVLAGPGGGGGLGPVDVPSARLPPGSPGSSGACCRRKALKPKSFCLNFWVTRQ TVSLRTARQFTTLLLFEHSDIVVISLLSVLFTSSGGGPAKGGVLLLVLALCCKVGFHTAS RKLSVDVGGAKRLQALSHLVSVLLLCPWVIVLSVTTESKVESWFSLIMPFATVIFFVMIL DFYVDSICSVKMEVSKCARYGSFPIFISALLFGNFWTHPITDQLRAMNKAAHQESTEHVL SGGVVLFTFVELFYGVLTNSLGLISDGFHMLFDCSALVMGLFAALMSRWKATRIFSYGYG RIEILSGFINGLFLIVIAFFVFMESVARLIDPPELDTHMLTPVSVGGLIVNLIGICAFSH AHSHAHGASQGSCHSSDHSHSHHMHGHSDHGHGHSHGSAGGGMNANMRGVFLHVLADTLG SIGVIVSTVLIEQFGWFIADPLCSLFIAILIFLSVVPLIKDACQVLLLRLPPEYEKELHI ALEKIQKIEGLISYRDPHFWRHSASIVAGTIHIQVTSDVLEQRIVQQVTGILKDAGVNNL TIQVEKEAYFQHMSGLSTGFHDVLAMTKQMESMKYCKDGTYIITFSVGLIHSPETILLLH LEGYKSSCQPFGDWVGEGDWRSYHLVAAAAARERRRRGRPRESLARPSGLASLWPRPSRT PSRDRPGNAFSATGSRQWEGSECHEQANKEGAVRGLNLRLGWLFSACCGGTAVGFCWVSL AGRASGVLLLPAELLPGEEEAMALRVTRNSKINAENKAKINMAGAKRVPTAPAATSKPGL RPRTALGDIGNKVSEQLQAKMPMKKEAKPSATGKVIDKKLPKPLEKVPMLVPVPVSEPVP EPEPEPEPEPVKEEKLSPEPILVDTASPSPMETSGCAPAEEDLCQAFSDVILAVNDVDAE DGADPNLCSEYVKDIYAYLRQLEEEQAVRPKYLLGREVTGNMRAILIDWLVQVQMKFRLL QETMYMTVSIIDRFMQNNCVPKKMLQLVGVTAMFIASKYEEMYPPEIGDFAFVTDNTYTK HQIRQMEMKILRALNFGLGRPLPLHFLRRASKIGEVDVEQHTLAKYLMELTMLDYDMVHF PPSQIAAGAFCLALKILDNGEWTPTLQHYLSYTEESLLPVMQHLAKNVVMVNQGLTKHMT VKNKYATSKHAKISTLPQLNSALVQDLAKAVAKRFPIVSEICSFRWVLGLVDFKNEATDP RGVKPQTFDLCSECYSVTAHKDSTVLFSRWVHGLAGFRSEAADLPPSAVIFTSRARMKEE PVVTTATEKQGGKVAGKATFSERVCLLSGSLSPQPAMEEQPQMQDADEPADSGGEGRAGG PPQVAGAQAACSEDRMTLLLRLRAQTKQQLLEYKSMVDAKLKQASESKLLEIQTEKNKQK IDLDSMENSERIKIIRQNLQMEIKITTVIQHVFQNLILGSKVNWAEDPALKEIVLQLEKN VDMM >gi568815593f:69089635_69309796|GENSCAN_predicted_CDS_1|4335_bp atggaggagaaatacggcggggacgtgctggccggccccggcggcggcggcggccttggg ccggtggacgtacccagcgctcggctgcctccgggctccccgggctcttcgggtgcctgc tgcagaaggaaggcgctaaagccaaaaagcttctgcttgaatttctgggttacaaggcag acagtcagtcttagaactgcccgccagttcacgactttgctgctatttgagcacagtgat attgttgtcatttcactactcagtgttttgttcaccagttctggaggaggaccagcaaag ggtggagtattattgctagtactggctttgtgttgtaaagttggttttcatacagcttcc agaaagctctctgtcgacgttggtggagctaaacgtcttcaagctttatctcatcttgtt tctgtgcttctcttgtgcccatgggtcattgttctttctgtgacaactgagagtaaagtg gagtcttggttttctctcattatgccttttgcaacggttatcttttttgtcatgatcctg gatttctacgtggattccatttgttcagtcaaaatggaagtttccaaatgtgctcgttat ggatcctttcccatttttattagtgctctcctttttggaaatttttggacacatccaata acagaccagcttcgggctatgaacaaagcagcacaccaggagagcactgaacacgtcctg tctggaggagtggtactttttacctttgtggaattattctatggcgtgctgaccaatagt ctgggcctgatctcggatggattccacatgctttttgactgctctgctttagtcatggga ctttttgctgccctgatgagtaggtggaaagccactcggattttctcctatgggtacggc cgaatagaaattctgtctggatttattaatggactttttctaatagtaatagcgtttttt gtgtttatggagtcagtggctagattgattgatcctccagaattagacactcacatgtta acaccagtctcagttggagggctgatagtaaaccttattggtatctgtgcctttagccat gcccatagccatgcccatggagcttctcaaggaagctgtcactcatctgatcacagccat tcacaccatatgcatggacacagtgaccatgggcatggtcacagccacggatctgcgggt ggaggcatgaatgctaacatgaggggtgtatttctacatgttttggcagatacacttggc agcattggtgtgatcgtatccacagttcttatagagcagtttggatggttcatcgctgac ccactctgttctctttttattgctatattaatatttctcagtgttgttccactgattaaa gatgcctgccaggttctactcctgagattgccaccagaatatgaaaaagaactacatatt gctttagaaaagatacagaaaattgaaggattaatatcataccgagaccctcatttttgg cgtcattctgctagtattgtggcaggaacaattcatatacaggtgacatctgatgtgcta gaacaaagaatagtacagcaggttacaggaatacttaaagatgctggagtaaacaattta acaattcaagtggaaaaggaggcatactttcaacatatgtctggcctaagtactggattt catgatgttctggctatgacaaaacaaatggaatccatgaaatactgcaaagatggtact tacatcataactttttctgttgggctgatccattccccagaaaccattctcttacttcac ttagaagggtataagtccagctgccagccttttggggactgggtaggagaaggcgactgg aggtcttaccatttggtggccgctgcagctgcccgagagcgcaggcgcagaggcagacca cgtgagagcctggccaggccttccggcctagcctcactgtggccccgcccctctcgaacg ccttcgcgcgatcgccctggaaacgcattctctgcgaccggcagccgccaatgggaaggg agtgagtgccacgaacaggccaataaggagggagcagtgcggggtttaaatctgaggcta ggctggctcttctcggcgtgctgcggcggaacggctgttggtttctgctgggtgtccttg gctggtcgggcctccggtgttctgcttctccccgctgagctgctgcctggtgaagaggaa gccatggcgctccgagtcaccaggaactcgaaaattaatgctgaaaataaggcgaagatc aacatggcaggcgcaaagcgcgttcctacggcccctgctgcaacctccaagcccggactg aggccaagaacagctcttggggacattggtaacaaagtcagtgaacaactgcaggccaaa atgcctatgaagaaggaagcaaaaccttcagctactggaaaagtcattgataaaaaacta ccaaaacctcttgaaaaggtacctatgctggtgccagtgccagtgtctgagccagtgcca gagccagaacctgagccagaacctgagcctgttaaagaagaaaaactttcgcctgagcct attttggttgatactgcctctccaagcccaatggaaacatctggatgtgcccctgcagaa gaagacctgtgtcaggctttctctgatgtaattcttgcagtaaatgatgtggatgcagaa gatggagctgatccaaacctttgtagtgaatatgtgaaagatatttatgcttatctgaga caacttgaggaagagcaagcagtcagaccaaaatacctactgggtcgggaagtcactgga aacatgagagccatcctaattgactggctagtacaggttcaaatgaaattcaggttgttg caggagaccatgtacatgactgtctccattattgatcggttcatgcagaataattgtgtg cccaagaagatgctgcagctggttggtgtcactgccatgtttattgcaagcaaatatgaa gaaatgtaccctccagaaattggtgactttgcttttgtgactgacaacacttatactaag caccaaatcagacagatggaaatgaagattctaagagctttaaactttggtctgggtcgg cctctacctttgcacttccttcggagagcatctaagattggagaggttgatgtcgagcaa catactttggccaaatacctgatggaactaactatgttggactatgacatggtgcacttt cctccttctcaaattgcagcaggagctttttgcttagcactgaaaattctggataatggt gaatggacaccaactctacaacattacctgtcatatactgaagaatctcttcttccagtt atgcagcacctggctaagaatgtagtcatggtaaatcaaggacttacaaagcacatgact gtcaagaacaagtatgccacatcgaagcatgctaagatcagcactctaccacagctgaat tctgcactagttcaagatttagccaaggctgtggcaaagagatttcctattgtgtccgaa atttgttccttccggtgggttcttggtctcgttgacttcaagaatgaagccacggaccct cgcggagtgaagccgcagaccttcgacctttgcagtgagtgttacagtgttacagctcat aaagacagcactgttctcttctcccggtgggttcatggtctcgctggcttcaggagtgaa gctgcagaccttccgccgtctgcagttattttcaccagtagagcccggatgaaagaggag cccgtagtaaccacggcaaccgaaaaacaaggcggaaaggtggcgggaaaagcgaccttt tctgagcgcgtttgcctgttgagtggtagcctttcccctcaaccagcaatggaggagcag ccccagatgcaagacgccgacgagcccgcggactccggaggggaaggccgggcaggcggg ccaccgcaggtcgccggcgcccaggcggcgtgcagcgaggaccgcatgaccctgctcctc aggctgagagcacagacaaaacaacaactcttagaatataaatcaatggttgatgcaaaa ttaaaacaagcttcagaaagtaagcttttagaaatacagactgaaaagaacaaacagaag attgatttggacagtatggaaaactcagagaggataaagatcatacgacaaaacctacag atggagataaaaattactactgttattcaacatgtgttccagaaccttattttggggagt aaagtcaattgggcagaggatcctgcccttaaggaaattgttctgcagcttgagaagaat gttgacatgatgtaa >gi568815593f:69089635_69309796|GENSCAN_predicted_peptide_2|217_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGHSTFQRARGWGQTLGEVGENPLRRGRSTRGQ LPDHSRDLEGTVGQIQGSLPSTPRYRLPSDWLIGGTTANPRRRLARLPPGSPHPHTPKTN KVLAEERGIPAVGTGPTCQDAARLPQGSNPRRRTHLTARSNRLKVPSSHTPAGAVTGPPQ RRGHRSASATAFTPRRRRHSQHRPYRPRCGNATNCFT >gi568815593f:69089635_69309796|GENSCAN_predicted_CDS_2|654_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacagcacgtttcagagagcacggggttggggc cagacgcttggagaggttggggagaacccgctccggcgaggacgcagtacgcggggccag ttgcccgaccattcccgagaccttgaagggacagtgggtcagatccagggttcgcttcct tccacgccccgctaccgccttccttccgattggctgatcgggggcaccacagccaatcct agacggcgattggccaggctaccgccagggtcgcctcacccgcacacgcctaagactaac aaggtcctggcggaggagcgggggatcccggcagtaggtacaggacctacctgccaagat gctgcacggttgccgcaaggctcgaaccctcggcgccggacccacctcactgcccggtcc aacaggctcaaagtcccatcctcacacacacccgcgggcgccgtcactggccctccgcag aggcgcggtcaccgctctgcctctgccaccgcctttactccccgacgccgtcgccatagt cagcaccgtccctaccgcccaagatgcggaaacgcgacaaattgctttacctga >gi568815593f:69089635_69309796|GENSCAN_predicted_peptide_3|846_aa MALDVKSRAKRYEKLDFLGEGQVRLSGRTGRAPSGQPRAASTFAGFPVEARGLAWLLVLV GGNRPDALAAHSLHPGGVSAGSHYSCECAVSYLKPICLGVSPEAHGEALVTTADYSGPRG SILLDAFGHKSNISLVFDFMETDLEVIIKDNSLVLTPSHIKAYMLMTLQGLEYLHQHWIL HRDLKPNNLLLDENGVLKLADFGLAKSFGSPNRAYTHQVVTRWYRAPELLFGARMYGVGV DMWAVGCILAELLLRSIPEIHTENLSTTTIHSWPRRKQNSIAPNVMTDHSIEKANEERQQ LRVDRLEIVDLLTIILGNLLAIGIVAGIIFKVIRKMTCRLKNCTLLPYLLSYFKLFLCSI EKVPFLPGDSDLDQLTRIFETLGTPTEEQWPDMCSLPDYVTFKSFPGIPLHHIFSAAGDD LLDLIQGLFLFNPCARITATQALKMKYFSNRPGPTPGCQLPRPNCPVETLKEQSNPALAI KRKRTEALEQGQQQCIKPTRASNLSDFLVCVCRQGAAKGEAAKQLRVQEIIAPKGSEDTQ NVSNTKRGNLISRTSESRSTQWNVVSRAQPSEPGAMQVDSVRIEGNDRIPSWCVGNPPPL ASGVRSAELSGCQKQLRPGKGQYQSQRVAARTPDSRCYFLTSVLPHQLCRGPCSCLAGAK RFVAFCPCPRDLWNFELERDDLEYLVEEISKQQSIQEMAWVLLKAFHFKREAQHKSSENL QPDNAIEKKIPFSKDKFKLAADICISNEEPNVNPQDNGENVSRACQRPLQQPLSSQSQKS RRKKWFRGQGPGSPCCVQSKDLVPCVPATPAVAERGQGTAPSVNIEGASPKPWQLPHDVE PAGAQK >gi568815593f:69089635_69309796|GENSCAN_predicted_CDS_3|2541_bp atggctctggacgtgaagtctcgggcaaagcgttatgagaagctggacttccttggggag ggacaggtgaggctctctggaaggacggggagggccccaagcggacagccccgcgccgcc tccacctttgcgggttttcccgtggaggccagaggtctggcttggctgctcgttctcgtt gggggaaaccgtccagacgcacttgctgcccattctttacatcctgggggagtctctgct ggtagccactacagctgtgaatgtgctgtgtcatacctgaagccaatatgtctcggagtt tcacctgaggcccatggtgaggccctggttaccactgctgattattcagggcccaggggc tctatactccttgatgcttttggacataaatctaatattagccttgtctttgattttatg gaaactgatctagaggttataataaaggataatagtcttgtgctgacaccatcacacatc aaagcctacatgttgatgactcttcaaggattagaatatttacatcaacattggatccta catagggatctgaaaccaaacaacttgttgctagatgaaaatggagttctaaaactggca gattttggcctggccaaatcttttgggagccccaatagagcttatacacatcaggttgta accaggtggtatcgggcccccgagttactatttggagctaggatgtatggtgtaggtgtg gacatgtgggctgttggctgtatattagcagagttacttctaaggagtataccagaaatt cacactgagaatttatcaaccaccacaatccattcatggccaagaagaaaacagaattcc atagcccctaatgttatgactgatcactccattgagaaagcaaatgaagagagacagcag ttgagagtagacaggttagagattgtggacctgttaacaatcatacttggaaatttatta gccattggcattgttgctggaatcatctttaaagttataaggaaaatgacatgcagacta aagaattgtactcttctgccataccttttgtcctattttaagctgttcttgtgctcaatt gaaaaggttccttttttgccaggagattcagaccttgatcagctaacaagaatatttgaa actttgggcacaccaactgaggaacagtggccggacatgtgtagtcttccagattatgtg acatttaagagtttccctggaatacctttgcatcacatcttcagtgcagcaggagacgac ttactagatctcatacaaggcttattcttatttaatccatgtgctcgaattacggccaca caggcactgaaaatgaagtatttcagtaatcggccagggccaacacctggatgtcagctg ccaagaccaaactgtccagtggaaaccttaaaggagcaatcaaatccagctttggcaata aaaaggaaaagaacagaggccttagaacaaggacagcagcagtgcatcaaacccactcgt gcttcaaatctgtctgacttcctcgtgtgtgtgtgtagacagggagctgccaagggagaa gcagccaagcagctaagagtccaggagataatagcgccaaagggatctgaggacactcaa aatgtgtctaacaccaaaagaggtaacttgatctctaggacaagtgagtccaggagcaca cagtggaatgtagtgagcagagcacagccttcagagccaggagcaatgcaggtagacagt gtgagaattgagggaaatgataggatacccagctggtgtgtggggaatcccccaccccta gcatctggtgtcagaagcgctgagctgagtggttgccaaaagcaactaagacctgggaag ggccaatatcaatcacaacgggttgcggccaggacccctgactcccggtgctacttcctc acctcagtgctccctcaccagctgtgtcggggcccttgttcctgcctggctggagcaaag agatttgtggcattttgcccctgccctagagatctgtggaactttgaacttgagagagat gatttagagtatctggtggaagaaatttctaagcagcaaagcattcaagagatggcttgg gtactgttaaaggcattccattttaaaagggaagcacagcataaaagttcagaaaatttg cagcctgacaatgcaatagaaaagaaaatcccattttctaaggataaattcaagctagct gcagatatttgcataagtaatgaggagccaaatgttaatccccaagataatggggaaaat gtctccagggcatgtcagagacctttgcagcagcctctctcatcacagagccagaagtct cggaggaaaaaatggtttcgtgggcagggcccagggtccccgtgctgtgtgcagtctaag gacttggtgccctgcgtcccagccactccagcggtggctgaaaggggccaaggtacagct ccgtctgtgaatatagagggtgcaagccccaagccttggcagcttccacatgatgttgag cctgcaggtgcacagaagtaa