GENSCAN 1.0 Date run: 7-Nov-116 Time: 14:29:58 Sequence gi568815586r:31182715_31398204 : 215490 bp : 42.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 163 158 6 1.05 1.06 Term - 2986 2933 54 1 0 98 44 48 0.322 -2.02 1.05 Intr - 5972 5855 118 1 1 72 110 40 0.367 4.15 1.04 Intr - 11382 11298 85 1 1 70 89 91 0.660 5.36 1.03 Intr - 17865 17709 157 0 1 79 113 124 0.951 12.66 1.02 Intr - 18521 18338 184 1 1 61 84 153 0.919 11.07 1.01 Init - 24809 24760 50 2 2 95 32 61 0.029 1.67 1.00 Prom - 29003 28964 40 -5.25 2.00 Prom + 32027 32066 40 -3.75 2.01 Init + 34841 35201 361 1 1 61 5 257 0.388 11.99 2.02 Intr + 37808 37907 100 0 1 53 64 100 0.291 2.35 2.03 Intr + 39993 40101 109 0 1 88 75 57 0.221 3.77 2.04 Intr + 42077 42284 208 1 1 64 55 164 0.523 8.43 2.05 Intr + 44619 45068 450 1 0 63 76 244 0.656 13.05 2.06 Intr + 47019 47128 110 1 2 77 25 92 0.162 0.88 2.07 Intr + 50516 50909 394 1 1 55 60 185 0.258 5.70 2.08 Term + 53494 53906 413 2 2 47 38 243 0.124 9.62 2.09 PlyA + 55272 55277 6 1.05 3.14 PlyA - 56142 56137 6 1.05 3.13 Term - 69808 69275 534 1 0 58 52 655 0.886 52.16 3.12 Intr - 69918 69817 102 0 0 -5 42 185 0.893 4.05 3.11 Intr - 71162 70981 182 0 2 23 33 117 0.784 -1.63 3.10 Intr - 71643 71564 80 2 2 46 44 99 0.181 -0.42 3.09 Intr - 97071 96850 222 2 0 61 64 164 0.029 7.62 3.08 Intr - 100157 100002 156 1 0 98 7 153 0.045 6.40 3.07 Intr - 105070 104920 151 2 1 81 93 22 0.567 0.30 3.06 Intr - 111217 111091 127 1 1 79 92 57 0.748 4.53 3.05 Intr - 112619 112520 100 1 1 72 27 64 0.680 -2.11 3.04 Intr - 115548 115363 186 2 0 33 86 110 0.387 3.18 3.03 Intr - 118419 118283 137 0 2 62 77 131 0.907 7.85 3.02 Intr - 122754 122485 270 0 0 83 74 91 0.812 4.02 3.01 Init - 125962 125819 144 1 0 46 59 101 0.854 3.17 3.00 Prom - 127597 127558 40 -7.45 4.08 PlyA - 128030 128025 6 1.05 4.07 Term - 128769 128614 156 0 0 57 43 199 0.478 9.25 4.06 Intr - 138450 138361 90 0 0 98 97 8 0.033 1.97 4.05 Intr - 141308 141170 139 1 1 5 76 88 0.006 -1.15 4.04 Intr - 142495 142278 218 1 2 38 51 171 0.278 4.78 4.03 Intr - 142700 142512 189 2 0 6 90 269 0.871 17.76 4.02 Intr - 142962 142880 83 1 2 81 82 91 0.769 6.24 4.01 Init - 154783 154774 10 1 1 114 97 7 0.676 5.05 4.00 Prom - 157537 157498 40 -9.65 5.00 Prom + 158268 158307 40 -4.25 5.01 Init + 159709 159782 74 0 2 84 69 45 0.475 2.79 5.02 Intr + 164228 164402 175 2 1 24 81 113 0.579 3.22 5.03 Term + 169244 169351 108 1 0 80 49 67 0.224 -0.47 5.04 PlyA + 170122 170127 6 1.05 6.00 Prom + 173214 173253 40 -5.35 6.01 Init + 178019 178186 168 1 0 54 110 103 0.813 8.68 6.02 Intr + 181013 181068 56 1 2 113 52 36 0.016 -0.64 6.03 Term + 187369 187543 175 1 1 56 41 155 0.645 3.85 6.04 PlyA + 188177 188182 6 1.05 7.05 PlyA - 190823 190818 6 1.05 7.04 Term - 205072 204889 184 0 1 58 40 212 0.780 9.43 7.03 Intr - 206784 206610 175 1 1 82 82 94 0.744 6.28 7.02 Intr - 209679 209553 127 0 1 60 67 55 0.597 -0.07 7.01 Intr - 209982 209900 83 1 2 78 53 77 0.507 1.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 55095 55210 116 1 2 90 47 153 0.855 9.15 S.002 Term - 100157 99998 160 1 1 98 42 143 0.937 7.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:31182715_31398204|GENSCAN_predicted_peptide_1|215_aa MAHEDTAPGDPENMRSRQYVLLIPSVLQEGSLDKACAQLFNLTESVVLTVSLNYGEVQTK IFEENVTGENFFKCISFEVPQARSDPLAFITFSAKGATLNLEERRSVAIRSRENVVFVQT DKPTYKPGQKVLPKFQMTVDAPENILVVDSEFKVNVCALYTYGEPVDGKVQLSVCRESTA YHSCAHLISSLCKNFTIQLGKDGCVSKFINTDAFE >gi568815586r:31182715_31398204|GENSCAN_predicted_CDS_1|648_bp atggcccatgaagacacagccccaggagatcctgagaacatgcgctcaaggcagtatgtt ctgctgattccttctgttctacaagaaggctctttggataaagcttgtgcccagcttttt aatctcactgaatctgttgttttgacggtctccctcaactatggtgaggtccagaccaaa atatttgaagaaaatgttactggagaaaatttcttcaaatgcatcagctttgaggttcct caggccagatctgacccactggcatttattacattttctgctaaaggagccactctcaac ctggaagagaggagatctgtggcaatcagatccagagagaatgtggtctttgtacagact gataaacccacctacaagcctggacagaaagtgttacccaaatttcaaatgactgtggat gcaccagaaaatatcttagttgtggactctgaattcaaagtgaatgtctgtgccttatat acctatggtgaacctgtggacgggaaggtccaacttagtgtgtgcagagaatctacggct tatcattcatgtgctcatcttatcagttcactctgtaaaaattttaccattcagttgggg aaagatggctgtgtctccaagtttattaacacagatgcttttgagtga >gi568815586r:31182715_31398204|GENSCAN_predicted_peptide_2|714_aa MVTSASLNYLSLNYLEGEKELVMSGAPVLADTARSSALAIIACKTGSSEGLQALRKPRTP TSGQGKVGKISGRRRSRADHGEGSSRLRRQTLCKRTPRRTKQQSNSKQRLWLELCLSRSR TPKDTVLATSLVSLRVQHNGEHTVGDGCGDLELREDSWVFQDQESPKGLLPEGVLLGLER GLPRTTPSDQTHPSSVKISHSEPSTLPPPPSDPMVHSHQGGCHSAGISSPGQLIAGKATA PSSLCCQIPGPVDSMCHTADCWGQKQHHVSTLHLGVHCLGQGAPRQALLLTLTPSSSDGQ AVRTASPQVPGTRELTTDSQGQLVSPWFPAREPVSTVEPQDSTIRAGFSKAVPLSSVTSR PFAAHCWHLAWALRLPNSSLQRKQETNGAFAALLSQTHARPAHGKGAREAPLGTLVSLGP LPTVRAYRGLEPTPTTLTTAGLRALMLLAYRLLLHPPCHDPVRSSDNPPRRPSPREQTRN QDSWVSAPATPLSAPMQLTPTQFTSHESPAEPPKRCSEVTAAATSTRQTKESPHPSEPVC WDPPQAAEQGHGPGCLAGHHGGLGKWEYRCHDDHINEEQAVSFSIITCPPPPTEPGEEST ACYQKSPTARTRMIRLQRLGSTVPGLLISSLPHPHFFQSPESPNQSLSKFHRICQIQIQS TAGRVLFASHLIYCRKQPEGSKEEDSWYERALKPTSCAAQRKLPNLSELQLLQL >gi568815586r:31182715_31398204|GENSCAN_predicted_CDS_2|2145_bp atggtcacttctgcgtcactgaattacctctcactgaattacctcgaaggtgaaaaggag ctggtgatgtctggtgctcctgtcctggctgacacagctaggtcctcagcattggctatt attgcctgcaagacggggagtagtgaggggcttcaggcactcaggaaacccagaactccc acaagtgggcagggcaaagtaggcaaaatttcagggaggaggcggtctagagcagatcac ggagagggaagctccaggcttaggagacagacactctgcaagagaacaccgcggaggacg aaacagcagagcaacagcaagcagcgcctctggctggagctgtgcctttcaaggagccgc actcctaaggacaccgtgttggccacatctttggtgtctctcagggtccagcacaatggg gagcacacggtgggagatggatgtggtgacctggaactcagggaagacagctgggtcttc caggaccaggaatccccaaaggggctgctcccagagggtgtattgctgggactggagaga ggactgcccagaaccaccccctctgatcagacacatccatcatctgtcaagatcagccac tcagagccctcaaccttgccccctcctccttcagaccccatggtacactcacaccaagga gggtgtcacagtgcagggatttcatccccagggcagctcattgctgggaaggccacagca cccagctccctctgctgccagattccaggccctgtggacagcatgtgtcacacagcagac tgctggggtcaaaaacagcatcacgtaagcaccctccaccttggagtgcactgtttgggg caaggtgcccccagacaggccctgttgctaacacttacacccagcagttccgatggacaa gctgtaaggacagcctcgccccaggtgccaggcaccagagaacttacaactgacagccaa ggccagctggtaagcccttggttcccagcgagagagcctgtgagcacagtggagcctcag gacagcacgatcagagctggcttctcaaaggcggtgcccttgagctcagtcacctctcgc ccctttgccgcccattgctggcacctggcttgggctctccgtctgcccaacagctcactt cagaggaaacaggaaaccaacggggccttcgctgccctcctcagccaaacccatgcaagg ccagcccatggaaagggagccagggaggccccactgggcacactggtaagtctggggcct ctgcccacagtcagggcctatcgtggcctggagcccacacctaccacactgactacagca ggactgagggcactcatgctccttgcttatcgccttcttcttcaccctccttgtcacgac cctgtcagaagctctgataaccctccaagaagaccaagtccaagagagcagaccaggaac caggactcctgggtttcagccccagcaacgccactgtctgcacctatgcaactaacccct acccagttcacatcacatgaaagcccagcagagcccccaaagagatgttcagaagtcact gctgctgctacttctacaaggcaaacaaaggagagcccccatccctcagaacctgtttgc tgggatccacctcaggcagcagaacagggacatggaccaggctgcctggcagggcaccac ggagggctggggaaatgggaatatagatgccatgatgatcatattaatgaggaacaagct gtgtccttcagcatcattacctgccctccccctccgactgaacctggagaagagtcaaca gcctgctaccaaaaatcccccacagcccggactcggatgattcgcctacaaaggctgggg agcacagtcccaggcttactaatctcctctctaccccatccccattttttccagtcccca gaaagtcccaatcaatctctcagcaaattccacaggatctgtcagattcagatccagtca actgcaggccgagtgctcttcgcatctcatcttatctactgcaggaagcaacctgagggc agcaaagaggaagacagctggtatgaacgggctctgaaacctaccagctgtgcggcccag agaaagctacctaacctctctgagcttcagctgcttcagctgtaa >gi568815586r:31182715_31398204|GENSCAN_predicted_peptide_3|796_aa MNLADSDFSQGILIPHPVYAYLLLCSDTLSSSEEENLYLLQQLTITLRLSSHEIVTGRPR HMGIKMTDTNLLNGDILHCCKGLVHQLMKSQTLVQDTSHSALLGDEDSGHDLQPGNFVCW KRHLIKDSLQPWWRGPYQLSSVVTGTNYGVLVKSNLSEGSSHPRDELNADSFEDLFIWGK GKRCPQQRYVNSLGFTDYCPEEKMFGFHKPKMYRSIEGCCICRAKSSSSRFTDSKRYEKD FQSCFGLHETRSGDICNACVLLVKRWKKLPAGSKKNWNHVVDARAGPSLKTTLKPKKVKT LSGNRIKSNQISKLQKEFKRHNSDAHSTTSSASPAQSPCYSNQSDDGSDTEMASGSNRTP VFSFLDLTYWKRQKICCGIIYKGRFGEVLIDTHLFKPCCSNKKAAAEKPEEQGPEPLPIS TQEWTECPDLSLVVSWRPPLSSLPAGCSLHGLAADWTTDMITVTTSKCREPSCQQEGSFQ VLPISKGRGSYEGLDTKREAQWQNGNSSMRTRQSTKVEQLHVEEKPVEPQRVSRHHFEYQ LGLFGSIDVSDDRNDKQYCPEGSFKEDDPQTELPAAAFKELCSCVRRKAFPKRLAKTAEV QVLVLDGRGHLLGRLAAIVVLLGRKVVVVRCEGINISGNFYRNKLKYLAFLRKRMNTNSY RGSYHFRAPSRIFWRTVRGMLPHKTKRGQAALDRLKVFDGIPPPYDKKKRMVVPVALKVV RLKPTRKFAYLGRLAHEFGWKYQAGTATLEEKRKEKAKIHYGKKKQLMRLRKQAEKNVEK KIDKYTEVLKTHGLLI >gi568815586r:31182715_31398204|GENSCAN_predicted_CDS_3|2391_bp atgaatttagcagactcggacttctctcaggggatcctcatccctcatccagtgtatgca tacctcctcctttgctctgacacactttcttcctctgaagaagaaaacttgtacttactt caacaattgaccattacattaaggctctcttctcatgaaattgtaactggaagacccagg cacatgggaatcaagatgactgatacaaatttactgaatggggacatcttgcactgttgt aagggacttgttcaccagctcatgaaaagccagaccttagtgcaagacacgtcccacagt gcgctcctgggagatgaagattctggtcatgacctccaacctggaaattttgtctgttgg aaaagacatctcataaaggactccctccagccttggtggaggggcccttaccagcttagc agtgtggtaactggcacaaactatggcgtcttggtcaaatccaacctcagtgaaggtagt tcacatcctagagatgagctgaatgctgatagctttgaggacttgtttatttgggggaag ggaaagagatgccctcaacaaagatatgtcaattcccttggctttacagactattgccca gaagaaaagatgtttggttttcacaagccaaagatgtaccgaagtatagagggctgctgt atttgcagagctaagtcctccagttctcgattcactgacagtaaacgctatgaaaaggac ttccagagctgttttggattgcatgagactcgttcaggagacatctgcaatgcctgtgtc ctgcttgtgaaaagatggaagaagttgccagcaggatcaaaaaaaaactggaatcatgtg gtagatgcaagggctggacccagtctaaagactacattgaaaccaaagaaagtgaaaact ctatctgggaacaggataaaaagcaaccagatcagtaaactgcagaaggaatttaaacgt cataattctgatgctcacagtaccacctcaagtgcctccccagctcaatctccttgttac agtaaccagtcagatgacggctcagatacagagatggcttctggttctaacagaacacca gttttttcctttttagatctcacttactggaaaagacagaagatatgttgtgggatcatc tataaaggccgttttggggaagtcctcattgacacacatctcttcaagccttgctgcagc aataagaaagcagctgctgagaagccagaggagcaggggccagagcctctgcccatctcc actcaggagtggactgagtgtccagacctctcactggttgtcagctggaggcccccgctt agttccttgccagctgggtgctccctgcatggcctggctgctgactggaccactgacatg attactgtcactaccagcaagtgtcgagagccaagctgccagcaagaggggtcttttcag gtcctgcccatatccaaggggagaggctcatacgaaggattggataccaagagagaggct cagtggcagaatgggaattcttccatgaggactcgacagtcaaccaaagttgaacaatta catgtagaagagaagcctgtagaacctcagagggtgtcaaggcatcactttgagtatcag ttaggattatttggaagtatagatgtatcagatgatagaaatgacaagcagtactgccca gaagggagtttcaaagaggatgacccccaaacggaacttcctgcagctgcatttaaagaa ttgtgttcatgtgtgagaagaaaagcctttcccaagcggctggcgaagacggcagaggtg caggtcctggtgcttgatggtcgaggccatctcctgggccgcctggcggccatcgtggta ctgctgggccggaaggtggtggtcgtacgctgcgaaggcatcaacatttctggcaatttc tacagaaacaagttgaagtacctcgctttcctccgcaagcggatgaacaccaactcttac cgaggctcctaccacttccgggcccccagccgcatcttctggcggaccgtgcgaggtatg ctgccccacaagaccaagcgaggccaggccgctctggaccgcctcaaggtgtttgatggc atcccaccgccctacgacaagaaaaagcggatggtggttcctgttgccctcaaggttgtg cgtctgaagcctacaagaaagtttgcctatctggggcgcctggctcacgagtttggctgg aagtaccaggcagggacagccaccctggaggagaagaggaaagagaaagccaagatccac tacgggaagaagaaacagctcatgaggctacggaaacaggccgagaagaacgtggagaag aaaattgacaaatacacagaggtcctcaagacccacggcctcctgatctga >gi568815586r:31182715_31398204|GENSCAN_predicted_peptide_4|294_aa MPGLFSRGKEPGGRKAREESDSLATAQRPGSCWALPERTRRGVVCSTPEGCGCDNESNTR SLKNSPSSTRLPSSDRAQEPLVSGVLSEPEVLLPRRCGRHREGFPFMGGRPRCKMAREAN LAGDGGGGVVYAAGVSELRRALLFVLYVTDHVLRAATLVLVRMSSPAEHSYLLARRLSRQ SPECPRGKHDLRKVGTRGPWEERGGILLLSEKRFFIESATRIFLYYGNNLEMSSSPKREF VFKSKAKLWAIVKEFPDPYEDRICFTKEFELTIKIYDPGHSDIYRLVYMLVSAA >gi568815586r:31182715_31398204|GENSCAN_predicted_CDS_4|885_bp atgcccggcctgttctctcgcggaaaggagcccggcggccgaaaggcccgagaggaaagc gactcgttggccaccgcgcagcggccggggtcgtgctgggctctgccggagcgcacgcgg cgtggcgttgtttgctcaactccggagggatgcggatgcgacaatgagagcaacacgcgg tcgcttaaaaacagcccgtcttccactcgcctgccctcttcggaccgcgcccaagagcct ctggtgtccggggtgctttcggagcccgaagttttgctcccgcgtaggtgtgggcgacac cgcgagggcttcccgtttatgggcggccgtccgcgctgtaagatggccagggaagccaac ttggctggagatggaggcggcggcgttgtgtacgccgcgggagtcagtgaactccgcagg gcactgctttttgtgctctacgtgacagaccacgtgcttcgggctgcgaccctggtcctt gtccgaatgtccagccccgcggagcattcttacttgctggctcggcgtctctctcggcag tcaccggagtgcccgagggggaagcacgatttacggaaggtggggacccgagggccctgg gaggagcggggaggcattttactgctttcagagaaacggttttttattgaatctgcaact aggatttttctgtattatggaaataatctggaaatgtcatctagtccaaaaagggagttt gttttcaagtcgaaggccaaattatgggccatagttaaggaatttcctgacccttatgaa gatcggatttgctttaccaaagaatttgaactcacaattaaaatctatgacccaggtcat tctgacatttatcggcttgtctacatgctggtttcagctgcttaa >gi568815586r:31182715_31398204|GENSCAN_predicted_peptide_5|118_aa MQIYYSVKCMIGFLEDESGQVMQGLPLSIAPGAPSLRDAHSSLALQIQIGGTSSSPKILP LDSSYLKGSLTHFPTCSCRPTAKVCPRTVFVPKTQFKLVDHKEDLVNSHNPGCARAKP >gi568815586r:31182715_31398204|GENSCAN_predicted_CDS_5|357_bp atgcagatttattacagtgtcaagtgcatgattggatttttggaggatgagtctggccag gtcatgcagggcctaccactctccatagcccctggggcaccaagcctcagggacgcccac agctccctggctctccagatccagattggtggcacttcatcctctcccaaaatacttccc ttggattcttcatacctcaaggggtctctaacccactttcctacatgcagctgcagacct acagctaaggtttgtcctaggaccgtttttgttccaaaaactcaattcaagctagttgac cacaaagaagacctggttaattctcataaccctggatgcgccagggccaaaccttaa >gi568815586r:31182715_31398204|GENSCAN_predicted_peptide_6|132_aa MWALPFHRGEAKAKAQRPEHKHEKHLTTWESGAGSGTSIPLAFEQLFSTALTQHGRVLLT GHFVTWQLTFWGPSNKSYVWEVTRSPRKERRHGNLRIGFSFQEVALLWANHVTSGTVSNQ NKERAGPIFQIL >gi568815586r:31182715_31398204|GENSCAN_predicted_CDS_6|399_bp atgtgggcacttcccttccatcgtggtgaggccaaagcaaaggcacagagacctgaacac aaacatgaaaaacaccttaccacctgggagtcaggggcaggttcaggcacatccattcct ctggcatttgagcagctgttttccacagcattaactcagcacggcagggtccttctcaca gggcacttcgtgacatggcagctgactttctggggaccatccaataaatcctatgtctgg gaggtaacccgatcaccgcggaaagaacgccgacatggaaatcttaggattggtttctcg ttccaggaagttgcgctactttgggcaaatcacgtcacctctgggactgtttctaatcag aataaagaaagggctggaccaatctttcagatcctttga >gi568815586r:31182715_31398204|GENSCAN_predicted_peptide_7|189_aa XPNAGQIQEGIGEAVNNIVKHFHKPEKERGSLTVLLCGENGLVAALEQVFHHGFKSARIF HKNVFIWDFIEKVVAYFETTDQILDNEDDVLIQKSSCKTFCHYVNAINTAPRNIGKDGKF QILVCLGTRDRLLPQWIPLLAECPAITRMYEESALLRDRMTVNSLIRILQTIQDFTIVLE GSLIKGVDV >gi568815586r:31182715_31398204|GENSCAN_predicted_CDS_7|570_bp naacccaatgctgggcagatacaagaaggaattggagaagctgtgaacaatattgtgaaa cattttcataaacctgaaaaagagagaggaagcctcaccgtgttgctgtgtggagaaaat ggcctggttgcagcccttgagcaagttttccaccatgggttcaaatctgcccgcatcttt cacaagaatgtcttcatctgggacttcatagagaaagtggttgcttattttgaaacaact gaccagattctagataatgaagatgatgtccttattcagaaatcatcctgcaaaaccttc tgccactacgtaaatgctattaatactgcacccaggaacattgggaaggatggcaaattc cagattttagtttgccttggaacaagggatcgcctgctcccacagtggattccattgtta gctgagtgtcctgccatcactcgaatgtatgaagagagcgctctcctgcgagaccgcatg actgtcaactcccttatccgaattctgcagaccattcaggacttcaccatagtcctagaa ggatcactcatcaaaggagtggatgtgtaa