GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:44:46 Sequence gi568815584r:61180095_61381147 : 201053 bp : 42.27% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5595 5805 211 2 1 45 30 276 0.188 14.53 1.02 Intr + 6585 6763 179 1 2 47 20 124 0.088 0.22 1.03 Intr + 7467 7574 108 2 0 36 94 64 0.033 1.36 1.04 Intr + 25538 25627 90 1 0 125 98 47 0.624 8.67 1.05 Intr + 29258 29324 67 1 1 8 107 84 0.348 -0.14 1.06 Intr + 31021 31157 137 2 2 61 99 114 0.647 9.17 1.07 Term + 35624 35692 69 1 0 99 42 24 0.099 -4.14 1.08 PlyA + 35955 35960 6 1.05 2.07 PlyA - 36013 36008 6 1.05 2.06 Term - 37575 37368 208 2 1 60 44 150 0.757 3.63 2.05 Intr - 40230 40080 151 1 1 88 54 94 0.464 4.30 2.04 Intr - 64615 64540 76 1 1 77 66 94 0.469 4.27 2.03 Intr - 67966 67769 198 1 0 74 93 89 0.564 6.63 2.02 Intr - 76419 76270 150 0 0 39 71 125 0.466 5.34 2.01 Init - 87325 87173 153 1 0 71 73 75 0.174 4.33 2.00 Prom - 95147 95108 40 -3.35 3.02 PlyA - 95680 95675 6 1.05 3.01 Sngl - 101053 99998 1056 1 0 113 37 1511 0.999 145.39 3.00 Prom - 106069 106030 40 -6.15 4.10 PlyA - 106120 106115 6 1.05 4.09 Term - 109009 108841 169 0 1 -36 48 150 0.240 -5.03 4.08 Intr - 109846 109753 94 2 1 104 49 100 0.730 5.80 4.07 Intr - 116503 116302 202 1 1 99 37 116 0.779 5.54 4.06 Intr - 118422 118357 66 0 0 67 106 57 0.904 3.58 4.05 Intr - 120841 120696 146 2 2 12 68 79 0.361 -2.62 4.04 Intr - 121072 120862 211 1 1 48 64 123 0.483 3.46 4.03 Intr - 122275 122139 137 2 2 37 115 23 0.558 -0.73 4.02 Intr - 129260 129203 58 2 1 98 76 62 0.149 3.54 4.01 Init - 131364 130792 573 0 0 86 55 210 0.113 12.67 4.00 Prom - 131836 131797 40 -7.15 5.00 Prom + 132597 132636 40 -6.25 5.01 Init + 133589 133638 50 1 2 86 72 44 0.649 3.07 5.02 Intr + 141417 141576 160 0 1 54 49 155 0.856 7.27 5.03 Intr + 141900 142539 640 2 1 53 57 767 0.409 60.80 5.04 Intr + 142598 142667 70 0 1 73 82 86 0.462 3.82 5.05 Intr + 160009 160122 114 1 0 78 82 78 0.001 4.84 5.06 Intr + 164408 164535 128 2 2 34 101 129 0.028 8.20 5.07 Intr + 187295 187348 54 0 0 39 68 93 0.511 0.43 5.08 Term + 188294 188520 227 0 2 89 48 164 0.937 8.46 5.09 PlyA + 189707 189712 6 -0.45 6.04 PlyA - 189760 189755 6 1.05 6.03 Term - 191453 191297 157 1 1 47 42 103 0.317 -1.98 6.02 Intr - 194313 194107 207 2 0 -11 62 161 0.181 0.87 6.01 Init - 197093 196984 110 2 2 78 74 140 0.967 11.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 6585 6796 212 1 2 47 50 147 0.820 3.27 S.002 Init - 170138 170089 50 2 2 53 100 97 0.934 7.87 S.003 Sngl + 178604 178852 249 1 0 47 46 212 0.888 5.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:61180095_61381147|GENSCAN_predicted_peptide_1|286_aa MVVMLLVLVAVVFLLVVLVVVVMLLVVVGVGGGSGEGSDDSGGSGGGGGGSGDSGSSGGG GGGDIEHTVEDWCFPRTELPKTGRVCFHMEGHIYASEAIYLFLAIHLEQPQFSEEATCTT LQQQRMIRFRNSACLSGCNACLLLLLQAPPGEGRCFCLVQDRIRVWVSLLSASLPEGFPQ FKETGSALAQDRAKIQEEGPHQQLNYTGTKILDSSFQNGLILNCPAEEAGALCQGVAMIH GPGLGEAEGRARGTVGCSDMPLCEGLPCFDRRVRPCYSYVPLSMLQ >gi568815584r:61180095_61381147|GENSCAN_predicted_CDS_1|861_bp atggtagtgatgttgctggtgttggtggcagtggtgtttttgttggtagtgctggttgtg gtggtgatgttattggtagtggttggtgttggtggtggtagtggtgaaggtagtgatgat agtggtggtagtggtggtggtggtggaggtagtggtgatagtggcagtagtggtggtggt ggtggtggtgacattgagcatacagtagaggattggtgctttccacggactgagttgccg aagaccgggagagtgtgttttcatatggaaggtcacatctatgcatcagaggctatttat ctttttctggccattcatttagagcagccgcagttttctgaagaagccacatgcaccaca ctccaacagcagagaatgattcggttcagaaactctgcatgcttatcggggtgtaatgcc tgcctcctgctgctgctccaagctcctccaggagaaggacgctgcttctgcctggtccag gatcgtatcagggtctgggtgtctctcctctctgcttccctacctgagggctttccccag ttcaaggaaactggttcggcccttgcacaggacagagctaaaatacaggaagaaggccct catcagcaactcaactatactggcaccaagatcttggattccagcttccagaacggcttg atactgaattgccctgcagaagaagctggtgccctttgccagggagtggccatgatacat ggcccaggacttggagaagctgaaggaagagctcgaggaactgtgggttgcagtgatatg cccctgtgtgagggcttgccatgtttcgatagaagagtacgaccctgctatagctatgtt ccactttctatgttgcaataa >gi568815584r:61180095_61381147|GENSCAN_predicted_peptide_2|311_aa MHLGLECQKSENIKESFSVPEVSLNREALLDRHTSRLRALHLAVLSQSELYDQKASTISH RRALSSGKISTDGTKAMTEHTPQAVGFTDGSMLRNQLRGIQKHTPRPYKEVLQLHHCEMT LDDYISKLMEYPVFESHFRIIENKAHGDMEVSLPMPVFPDDLYCCLQEKVDYQMPVTVAP ESTKPLSYSRTSAIALAEVIKKVPLLGVHLLGRLCSLTEEMSLDEGDQEPSPLTTISGVW YIWKVLVAFPLVSEEEEADDVPVEAEVILWRGWEDVETGNRGGRTKQTQSCSSKCEWRLR RRTSTNLPSTL >gi568815584r:61180095_61381147|GENSCAN_predicted_CDS_2|936_bp atgcacttggggctagagtgtcagaagagtgagaatattaaagagtctttctcagtccct gaggtctccttgaaccgggaagctttgctagataggcatacatcaaggctccgagcatta catctcgctgtactctctcaatctgaattgtatgatcaaaaagccagtacaatttcgcat agacgagcattgtcttcaggaaaaataagcacagatggaaccaaagctatgacagaacac accccacaggcggtgggcttcactgacggttctatgctcagaaaccaactccggggaatt cagaaacacactcccagaccatacaaggaagtcttacagctgcaccactgtgaaatgaca ctggatgattacatctccaagttgatggagtatcctgtatttgaaagccattttagaatt atagaaaataaggcccatggggacatggaggtcagcctaccaatgccagtatttccagat gatctctactgttgtcttcaggaaaaggttgactaccagatgcccgttactgtggctcca gagtccactaaacctctgagttattccaggaccagcgctatagctctggccgaggtgatc aaaaaagttcccctcttgggtgtacacctgctaggaagactgtgctccctgacagaggag atgtccttggatgaaggcgatcaggagccttcacctctcacaaccatttcaggagtgtgg tacatttggaaggttctggtggcattcccattagttagtgaggaggaagaagctgatgat gttcctgttgaggctgaggtgatcctatggagaggttgggaagatgtggaaacaggaaat agaggaggccggactaagcagacacagagctgctcctctaagtgcgagtggagactcaga agaagaactagcactaatctgccctctacactatga >gi568815584r:61180095_61381147|GENSCAN_predicted_peptide_3|351_aa MTWSATARGAHQPDNTAFTQQRLPAWQPLLSASIALPLFFCAGLAFIGLGLGLYYSSNGI KELEYDYTGDPGTGNCSVCAAAGQGRALPPPCSCAWYFSLPELFQGPVYLYYELTNFYQN NRRYGVSRDDAQLSGLPSALRHPVNECAPYQRSAAGLPIAPCGAIANSLFNDSFSLWHQR QPGGPYVEVPLDRSGIAWWTDYHVKFRNPPLVNGSLALAFQGTAPPPNWRRPVYELSPDP NNTGFINQDFVVWMRTAALPTFRKLYARIRQGNYSAGLPRGAYRVNITYNYPVRAFGGHK LLIFSSISWMGGKNPFLGIAYLVVGSLCILTGFVMLVVYIRYQDQDDDDEE >gi568815584r:61180095_61381147|GENSCAN_predicted_CDS_3|1056_bp atgacctggagcgccacggcccggggcgcccaccagcccgacaacaccgccttcactcag cagcgcctccccgcctggcagccgctgctgtcggccagcatcgcgctgccgctcttcttc tgcgcgggcctggccttcatcggcctgggcctgggcctctactactcctccaacggcatc aaggagctggagtacgactatacaggcgacccgggcaccggtaactgctcggtgtgcgcc gcggctggccagggccgggcgctgccgcccccctgctcgtgcgcctggtacttctcgctg cccgagctcttccagggcccagtgtacctctactacgagctgaccaacttctaccagaac aaccggcgctacggcgtgtcccgcgacgacgcgcagctgagcggactccccagcgcgctg cgccaccctgtcaacgagtgcgccccctaccagcgcagcgcggccggcctgcccatcgcg ccctgcggcgccatcgccaacagcctcttcaacgactccttctcgctttggcaccagcgc cagcccggcgggccctacgtcgaggtgccgctcgaccgctccggcatcgcctggtggacc gactaccacgtcaagttccgcaacccgccgctggtcaacggcagcctggcgttggccttc cagggcacggcgcccccgcccaactggcgccggccagtctacgagctcagccccgacccc aacaacaccggcttcatcaatcaggacttcgtggtgtggatgcgcacggcggcgctgccc acgttccgcaaactgtacgcgcgcatccgccagggcaactactcggccgggctgccgcgg ggcgcctaccgcgtcaacatcacctacaactacccggtgcgcgcgttcggcggccacaag ctcctcatcttcagcagcatctcgtggatgggtggcaagaaccccttcctgggcatcgcc tacctggtcgtcggctccctctgcatcctcaccggctttgtcatgctggtcgtctacatt cgctaccaggaccaggacgacgacgacgaggagtga >gi568815584r:61180095_61381147|GENSCAN_predicted_peptide_4|551_aa MLVVIWTMKPRLRWSQMEMRNLLNTGAILLPRDRWHFASEVERDDLKLEFMFKREAEHKS LKNLQPDDAIEKKNPFSGGKFKLATEICISNKEPNVNCQDNGENVSRACHRDLGSSPSHH RSGGLGGKLGFMGQAQGPSALCSLGTWHPASHALLLQLQLWLKGANIQLRLLLQKVQAPS LGSFHLALGLQIHDYAGGSTLPFAENLQIRGLICHKERHVSPASEGLCKTTEVDPSGKGK NAICFTAYNLLKAGFMPGNRGHGHTRSGFKSRSLIGERKRKENLSPAERAGRLSGSSGPM VKYTGFIDWLEEVVSDLHRAQRSVGPGEEADHPTLIFYSANGFSTLPVPRFLFFTIHKVD KREDGTAMLNMPSPQIRDRDVELHFLVPNPGSSPHPWPFKTEEIMGLLNVSWTLEKESEA ELRSIGSILERCTPWHDLHSSVLGSKEFSIAHTATQLWGHKCGSDAMTYPFPSLATRKPE AALQTTTSKATGFHGSTGARVKAVATKSHRSQDMEVQKPRALLKMACLSCREKQRQDNAE RRELPDDVLLL >gi568815584r:61180095_61381147|GENSCAN_predicted_CDS_4|1656_bp atgctggtagtgatatggacaatgaagcccaggctgaggtggtctcagatggagatgagg aacttattgaacactggagctatccttttgccaagagacagatggcattttgcctctgaa gttgagagagacgatctgaaattggaatttatgtttaaaagggaagcagagcataaaagt ttgaaaaatttgcagcctgatgatgcaatagaaaagaaaaacccattttctggggggaaa ttcaagctggccacagaaatttgcataagtaacaaggagccaaatgttaattgccaagac aatggggaaaatgtctccagggcatgtcacagagatcttggcagcagcccctcccaccac agatctggaggcctaggaggaaaacttggtttcatgggccaggcccagggcccttctgct ctgtgcagccttggaacatggcaccctgcatcccatgcactgctgcttcagctccagctg tggctaaaaggggccaacatacagctcaggctgttgcttcagaaggtgcaagccccaagc cttggcagcttccacttggcgttgggcctgcagatccatgactatgcagggggctcaacc ctgccatttgcagagaaccttcaaatcagagggcttatatgccataaagaaaggcatgta agtcctgcttcagaaggactgtgtaaaacaactgaagtggacccctctgggaaaggcaag aatgctatatgcttcacagcctataatttgcttaaggcaggatttatgccaggaaatcga ggacatggacacacgagaagtggatttaagagcagaagtttaataggcgaaagaaagaga aaagagaatctctctcctgcagagagagcggggcgcctgagtgggtcttccggtcccatg gtgaagtacacaggttttatagactggcttgaggaggtggtgtctgatttacatagggcc caaagatcggttggaccaggagaagaagctgaccaccccaccctaatcttttattctgca aatggattttctaccttgccagtgccacgttttctgttctttaccattcacaaggttgac aaaagggaagatggaactgccatgctgaacatgcctagcccccagattagggatcgtgat gtggagctgcacttcctggttccaaatcctggctcttcccctcatccctggcccttcaag acagaggagattatggggctactgaacgtatcttggactttagagaaagaaagtgaagct gagctgagatccattggttctattttggaaagatgcaccccgtggcatgaccttcacagc tcagttctggggtcaaaggaattttccattgctcacactgccacccaactctgggggcat aagtgtggaagtgatgcaatgacttatcctttcccttctctggccactaggaaacctgag gcagccctgcagactaccaccagcaaggccacaggatttcatggcagcactggtgccaga gtcaaagctgtggcaaccaaaagccatagaagccaagatatggaggtgcaaaaaccaaga gccttgctgaagatggcctgcctgagctgcagagagaagcagaggcaagacaatgcagag agaagggagctgcctgatgacgttctactcctgtga >gi568815584r:61180095_61381147|GENSCAN_predicted_peptide_5|480_aa MVEGKEKQVTSYMDGSRAGPRTAGHAPGVRKARERTRGFCELGTWRCEVLAGDAAAAAAS WFEARRVGESGCLDSCTCPEGWPETGLPVLPLRSSAAPRGRGSGAGMSSGTMKFNGYLRV RIGEAVGLQPTRWSLRHSLFKKGHQLLDPYLTVSVDQVRVGQTSTKQKTNKPTYNEEFCA NVTDGGHLELAVFHETPLGYDHFVANCTLQFQELLRTTGASDTFEGWVSSGDPFPLCPPN PRSPYVFQNSPGSRFPIALWLREFPPDFGLRPRGRGTGDATAVGQRLGLTGSRAGVQGGH PFRPVFGLFPDSCLQMPFPALFIFLPGAHPQETALAKPRSDSVVSNLLERKRATSKVVES AGWLPCDQTCSFKASGASCGTINPAAAEVSPDVIATEVTVDEVIVGKRPCTAMSSPLAAS CRVFCLIPDTRRLLQTELRGKALQAEQALQSCCSVGQGSADFMLCALGHMSDPPVDISHQ >gi568815584r:61180095_61381147|GENSCAN_predicted_CDS_5|1443_bp atggtggaaggcaaggagaagcaagtcacatcttacatggatggcagcagggccgggccg cggaccgcgggccacgcccctggggtccggaaggcgcgggagcggacgcgggggttctgt gaacttggaacctggcggtgcgaggttctcgccggggatgcggcggcggcagctgcttcc tggtttgaagctcgccgagtgggggagagcggctgcctcgactcctgcacctgtcccgag ggctggcctgagacgggactcccggttctcccgctgcgaagcagcgcggccccccggggc cggggcagcggcgccggcatgtcgtctggcaccatgaagttcaatggctatttgagggtc cgcatcggtgaggcagtggggctgcagcccacccgctggtccctgcgccactcgctcttc aagaagggccaccagctgctggacccctatctgacggtgagcgtggaccaggtgcgcgtg ggccagaccagcaccaagcagaagaccaacaaacccacgtacaacgaggagttttgcgct aacgtcaccgacggcggccacctcgagttggccgtcttccacgagacgcccctgggctac gaccacttcgtggccaactgcaccctgcagttccaggagctgctgcgcacgaccggcgcc tcggacaccttcgagggttgggtgagtagcggtgaccccttccctttgtgtccacccaac ccccgttccccttatgtttttcaaaactcacccgggtcccgctttcccatcgctttgtgg ctgagggaattccctccagactttggtcttcgtccacgtggccgcggcactggtgacgcg accgcggtgggacagcggcttggcctaacgggaagtcgggcaggggtccagggcggtcac cctttccgcccagtctttggcctcttccctgacagctgtctgcagatgcccttccctgcc ctgttcatttttcttcccggtgcccatccccaggagactgcgctggcgaagcccaggagt gacagtgtagtcagtaacttacttgagaggaaaagagccacaagcaaggtagtggagagt gctggatggttgccttgtgaccagacctgttcttttaaggcctctggagccagctgtggg acgatcaatccagcagctgcagaggtcagccctgatgtaattgccaccgaggtcaccgtg gatgaggtcattgtggggaaaaggccctgtactgccatgtcctcccctctggctgcttca tgccgggtcttctgcctaatacctgacacaaggagactcctccagactgaactgcgagga aaggctttacaggcagaacaggctctgcagagttgctgctcagtaggccaggggtcagca gacttcatgctgtgtgccttaggccacatgagtgatcctcctgtagatatttcccatcag tga >gi568815584r:61180095_61381147|GENSCAN_predicted_peptide_6|157_aa MGHGNKQLQKRGEAKCADMEDEVFKGEMTVDDLGDDRVPIGALPSGAVGRQTLSSRPQNG RSTDSLHLAPGKATGTQRHESSRRGCTLHRHRVELPKTFGAHPLHRKNIRQTQFEGHSIK YLTRTCQAVKLTKTKESLRNCHRLEEPKQTYKHDVVS >gi568815584r:61180095_61381147|GENSCAN_predicted_CDS_6|474_bp atgggacatggaaacaagcagcttcaaaagagaggagaagccaaatgtgcagatatggaa gatgaggtctttaaaggggagatgacagtagatgacctgggagatgacagagtccccatt ggggcactacctagtggagctgtgggaagacagacattgtcctccagaccccagaatggt agatccactgacagcttgcatcttgcacctggaaaagccacaggcactcaacgccacgaa agcagtcgcaggggctgtaccctgcataggcacagagtagaactgcccaagacattcgga gcccaccccttgcatcggaaaaacattagacaaacccaatttgaaggacattctataaaa tacctgacgaggacttgtcaagctgtcaagctcaccaaaaccaaggaaagtctgagaaac tgtcacagactagaagaacctaagcagacatataaacatgatgtggtatcttga