GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:29:32 Sequence gi568815593r:156987105_157206962 : 219858 bp : 41.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5426 5446 21 1 0 125 91 -4 0.042 0.42 1.02 Term + 20509 20787 279 0 0 71 41 250 0.813 13.06 1.03 PlyA + 22335 22340 6 1.05 2.00 Prom + 26249 26288 40 -3.65 2.01 Init + 28070 28149 80 1 2 83 47 55 0.148 1.48 2.02 Intr + 36612 36720 109 0 1 109 91 25 0.628 4.27 2.03 Term + 38316 38471 156 2 0 65 48 77 0.553 -1.65 2.04 PlyA + 38876 38881 6 1.05 3.11 PlyA - 41511 41506 6 1.05 3.10 Term - 42737 42629 109 1 1 76 47 119 0.913 3.60 3.09 Intr - 45783 45750 34 1 1 82 121 16 0.935 0.66 3.08 Intr - 50257 50143 115 1 1 88 106 84 0.976 9.30 3.07 Intr - 50434 50335 100 0 1 36 106 39 0.149 -0.31 3.06 Intr - 60984 60800 185 0 2 55 110 39 0.065 0.46 3.05 Intr - 61480 61350 131 2 2 89 98 -9 0.059 -0.31 3.04 Intr - 65550 65257 294 1 0 70 98 407 0.073 35.96 3.03 Intr - 68429 68097 333 0 0 130 113 122 0.989 13.52 3.02 Intr - 70851 70794 58 0 1 101 111 2 0.550 1.34 3.01 Init - 73305 73195 111 0 0 58 61 93 0.455 3.86 3.00 Prom - 97360 97321 40 -3.65 4.08 PlyA - 97733 97728 6 -0.45 4.07 Term - 100158 99998 161 1 2 102 36 228 0.969 16.12 4.06 Intr - 108355 108148 208 1 1 88 -6 161 0.000 4.43 4.05 Intr - 111797 111754 44 0 2 102 98 -9 0.242 -1.46 4.04 Intr - 119858 119523 336 0 0 97 109 373 0.752 34.87 4.03 Intr - 146293 146249 45 2 0 63 87 73 0.036 2.16 4.02 Intr - 150685 150602 84 2 0 98 56 92 0.103 5.87 4.01 Init - 152327 151703 625 2 1 81 40 511 0.521 41.15 4.00 Prom - 156018 155979 40 -5.15 5.04 PlyA - 156612 156607 6 1.05 5.03 Term - 165722 165610 113 0 2 36 48 144 0.827 2.94 5.02 Intr - 170286 170238 49 0 1 70 115 8 0.052 -0.97 5.01 Init - 171522 171439 84 0 0 64 73 85 0.615 5.47 5.00 Prom - 172452 172413 40 -7.45 6.03 PlyA - 173311 173306 6 1.05 6.02 Term - 176554 175343 1212 1 0 70 43 995 0.988 83.85 6.01 Init - 178992 178354 639 0 0 75 6 445 0.379 30.38 6.00 Prom - 179416 179377 40 -4.95 7.03 PlyA - 180065 180060 6 1.05 7.02 Term - 202264 202153 112 0 1 106 28 72 0.006 0.05 7.01 Init - 215085 214961 125 0 2 93 99 119 0.999 13.19 7.00 Prom - 216771 216732 40 -3.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 65545 65262 284 2 2 10 46 566 0.906 39.20 S.002 Intr - 108355 108202 154 1 1 88 101 221 0.999 22.12 S.003 Init - 121879 121822 58 1 1 35 108 77 0.976 4.10 S.004 Sngl - 155618 155355 264 2 0 91 38 177 0.839 8.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:156987105_157206962|GENSCAN_predicted_peptide_1|99_aa VSGLLILGCRLLVVPPCWNLDSGSLIPTTPLGSAQLGTLYEGSNPTFHLGIVTVEALAGK GSTPQQASAWAPRLSHSTSETQVEAAKPSSLLHSVHLQA >gi568815593r:156987105_157206962|GENSCAN_predicted_CDS_1|300_bp gtttctgggctcctaatattgggttgccggctgctggtggtgccaccatgctggaatctg gacagtggcagtcttattccaacaactccactaggcagtgcccaactggggactctgtat gagggctccaaccccacatttcaccttggtattgtgacagtagaggctcttgcagggaag ggctccaccccgcagcaggcttctgcctgggcacccaggctttcccattcaacctctgaa acccaggtagaagctgccaagccttcttcactcttgcattctgtacacctacaggcctaa >gi568815593r:156987105_157206962|GENSCAN_predicted_peptide_2|114_aa MDEAGNHHSQQTITRTENQTLDVLTHREHYEPVTNIHKLDHENQSVNKLPLHFLGLPIFW RLQKQPLTNQLKLHHFLPPNKHPLPQWFSTVPAGTLSVVTTGGCDWHLVGGGQG >gi568815593r:156987105_157206962|GENSCAN_predicted_CDS_2|345_bp atggatgaagctggaaaccatcattctcagcaaactatcacaagaacagaaaaccaaaca ctggatgttctcactcatagagagcactacgaacctgtaacaaacatccacaaactagac catgaaaaccagagtgtaaataagcttcccctacattttttgggtcttcctatattttgg agactccagaagcagccgctgaccaaccagctaaagctacaccacttcctaccccccaac aaacaccctctacctcagtggttctcaactgtgcccgcagggacattgtcagttgtcacc actggaggatgtgactggcatctagtgggtggaggccagggatga >gi568815593r:156987105_157206962|GENSCAN_predicted_peptide_3|489_aa MENTATEKALGLRQVGKKALVAGAESAGERVVEEIYGADPIMHPQVVILSLILHLADSVA GSVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTNGTHVTYRKDTRYKL LGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKITVSLEIVPPKVTTTPIVTTVP TVTTVRTSTTVPTTTTVPMTTVPTTTVPTTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTT TSVPVTTTVSTFVPPMPLPRQNHEPVLGLQASATVPSCFVICKKDVIPRSCRMLVRIKWD ERFGTWLQGGNSQTGFSVSSHTTSTTIINSDEDFCDRMCGVFSPYIKQWTPAVSLIQFLH CLPGERVISHSCPRFSPYGYKLRSFGLTLMFQKSPSTNTGKCGWQLFLEHSLLTANTTKG IYAGVCISVLVLLALLGVIIAKKYFFKKEVQQLSVSFSSLQIKALQNAVEKEVQAEDNIY IENSLYATD >gi568815593r:156987105_157206962|GENSCAN_predicted_CDS_3|1470_bp atggagaacacagccacggaaaaggccttagggttgaggcaagttggaaagaaagctcta gtagctggggctgagtcagcaggggagagagtggtagaagaaatctatggggctgatccc ataatgcatcctcaagtggtcatcttaagcctcatcctacatctggcagattctgtagct ggttctgtaaaggttggtggagaggcaggtccatctgtcacactaccctgccactacagt ggagctgtcacatccatgtgctggaatagaggctcatgttctctattcacatgccaaaat ggcattgtctggaccaatggaacccacgtcacctatcggaaggacacacgctataagcta ttgggggacctttcaagaagggatgtctctttgaccatagaaaatacagctgtgtctgac agtggcgtatattgttgccgtgttgagcaccgtgggtggttcaatgacatgaaaatcacc gtatcattggagattgtgccacccaaggtcacgactactccaattgtcacaactgttcca accgtcacgactgttcgaacgagcaccactgttccaacgacaacgactgttccaatgacg actgttccaacgacaactgttccaacaacaatgagcattccaacgacaacgactgttctg acgacaatgactgtttcaacgacaacgagcgttccaacgacaacgagcattccaacaaca acaagtgttccagtgacaacaactgtctctacctttgttcctccaatgcctttgcccagg cagaaccatgaaccagtgctgggattacaggcatcagccaccgtgcccagctgttttgtc atttgtaaaaaggatgtaataccaagatcttgcagaatgcttgtgaggatcaaatgggat gaaagatttgggacatggttacaggggggaaactcccaaactgggttttctgtctcttct cacaccacgtcaacaacaatcatcaactcagatgaagacttctgtgaccgaatgtgtgga gttttttccccatacatcaagcagtggacaccggctgtgtctttaattcagttcttacac tgtctacccggagagagggtcatatcccacagttgtcctagattttcaccgtatggttat aaactgagatcatttggtcttacccttatgttccagaagtccccaagcaccaatacaggt aaatgtggctggcaactgttcctagaacatagtctactgacggccaataccactaaagga atctatgctggagtctgtatttctgtcttggtgcttcttgctcttttgggtgtcatcatt gccaaaaagtatttcttcaaaaaggaggttcaacaactaagtgtttcatttagcagcctt caaattaaagctttgcaaaatgcagttgaaaaggaagtccaagcagaagacaatatctac attgagaatagtctttatgccacggactaa >gi568815593r:156987105_157206962|GENSCAN_predicted_peptide_4|500_aa MGEPQQVSALPPPPMQYIKEYTDENIQEGLAPKPPPPIKDSYMMFGNQFQCDDLIIRPLE SQGIERLHPMQFDHKKELRKLNMSILINFLDLLDILIRSPGSIKREEKLEDLKLLFVHVH HLINEYRPHQARETLRVMMEVQKRQRLETAERFQKHLERVIEMIQNCLASLPDDLPHSEA GMRVKTEPMDADDSNNCTGQNEHQRENSAAGWTVVSRISLEATEDGKDCQSGSLNDQQSC EEPSKSDGIYGWSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGACPVFECGNVVL RTDERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGIMNDEKFNLKLV IKPAETQTLGSLPDINLTQISTLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALA LIFGALIFKCKCFCFSLPLITNSSVRTRLANAVAEGIRSEENIYTIEENVYEVEEPNEYY CYVSSRQQPSQPLGCRFAMP >gi568815593r:156987105_157206962|GENSCAN_predicted_CDS_4|1503_bp atgggtgaaccacagcaagtgagtgcacttccaccacctccaatgcaatatatcaaggaa tatacggatgaaaatattcaagaaggcttagctcccaagcctccccctccaataaaagac agttacatgatgtttggcaatcagttccaatgtgatgatcttatcatccgccctttggaa agtcagggcatcgaacggcttcatcctatgcagtttgatcacaagaaagaactgagaaaa cttaatatgtctatccttattaatttcttggaccttttagatattttaataaggagccct gggagtataaaacgagaagagaaactagaagatcttaagctgctttttgtacacgtgcat catcttataaatgaataccgaccccaccaagcaagagagaccttgagagtcatgatggag gtccagaaacgtcaacggcttgaaacagctgagagatttcaaaagcacctggaacgagta attgaaatgattcagaattgcttggcttctttgcctgatgatttgcctcattcagaagca ggaatgagagtaaaaactgaaccaatggatgctgatgatagcaacaattgtactggacag aatgaacatcaaagagaaaattcagctgctggttggacggtcgtatccaggataagtttg gaagccaccgaagatggcaaagattgtcagtctggttctttgaatgaccaacaatcctgt gaagaaccttctaaatctgatggtatttatggctggtcctcagaagtggaatacagagcg gaggtcggtcagaatgcctatctgccctgcttctacaccccagccgccccagggaacctc gtgcccgtctgctggggcaaaggagcctgtcctgtgtttgaatgtggcaacgtggtgctc aggactgatgaaagggatgtgaattattggacatccagatactggctaaatggggatttc cgcaaaggagatgtgtccctgaccatagagaatgtgactctagcagacagtgggatctac tgctgccggatccaaatcccaggcataatgaatgatgaaaaatttaacctgaagttggtc atcaaaccagcagagacacagacactggggagcctccctgatataaatctaacacaaata tccacattggccaatgagttacgggactctagattggccaatgacttacgggactctgga gcaaccatcagaataggcatctacatcggagcagggatctgtgctgggctggctctggct cttatcttcggcgctttaattttcaaatgtaagtgtttttgtttctctctccctctgata acaaattcttcagtgagaaccagattggcaaatgcagtagcagagggaattcgctcagaa gaaaacatctataccattgaagagaacgtatatgaagtggaggagcccaatgagtattat tgctatgtcagcagcaggcagcaaccctcacaacctttgggttgtcgctttgcaatgcca tag >gi568815593r:156987105_157206962|GENSCAN_predicted_peptide_5|81_aa MHYQDPPSMKDLCPQLLRMLPADKFQLAWCPYGELELPPLPSSHVYNTKSEDQCELWTSG TNDASMSAHQNQMNHSGAGCR >gi568815593r:156987105_157206962|GENSCAN_predicted_CDS_5|246_bp atgcactatcaagaccctccttcaatgaaggacttgtgtcctcagttgctgagaatgctg ccggcagacaagtttcagctggcatggtgtccatatggggagttggaactcccacccttg cccagtagtcatgtgtataacaccaagagtgaagaccagtgtgaactatggacttcaggt actaatgatgcgtcaatgtcagctcatcaaaatcaaatgaaccactctggtgcaggatgt cgatga >gi568815593r:156987105_157206962|GENSCAN_predicted_peptide_6|616_aa MGDLQRQLYNRGEYNIFKYAPMFESNFIQINKKGEVIDVHNRVRMVTVGIVCTSPILPLP DVMVLAQPTKICEQHVRWGRFAKGRGRRPVKTLELTRLLPLKFVKISIHDHEKQQLRLKL ATGRTFYLQLCPSSDTREDLFCYWEKLVYLLRPPVESYCSTPTLLSGDAPPEDNKSLVVS LSKAREHGAGSLGEQFKASPTKPLVPSEAHLEKAAELHREGDQSETGLYKPCDVSAATSS AYAGGEGIQHASHGTASAASPSTSTPGAAEGGAARTAGGMAVAGTATGPRTDVAIAGAAM SPATGAMSIATTKSAGPGQVTTALAGAAIKNPGENESSKSMAGAANISSEGISLALVGAA STSLEGTSTSMAGAASLSQDSSLSAAFAGSITTSKCAAERTEGPAVGPLISTLQSEGYMS ERDGSQKVSQPSAEVWNENKERREKKDRHPSRKSSHHRKAGESHRRRAGDKNQKASSHRS ASGHKNTRDDKKEKGYSNVRGKRHGSSRKSSTHSSTKKESRTTQELGKNQSASSTGALQK KASKISSFLRSLRATPGSKTRVTSHDREVDIVAKMVEKQNIEAKVEKAQGGQELEMISGT MTSEKTEMIVFETKSI >gi568815593r:156987105_157206962|GENSCAN_predicted_CDS_6|1851_bp atgggggacctgcaacgacaattgtacaacagaggagagtacaacattttcaagtatgca ccaatgttcgagagtaattttattcagataaacaaaaagggagaggtgattgatgtacac aaccgtgtccgaatggtgacagtgggcatcgtctgcaccagccccatcctcccactgcct gacgtcatggttctggcccaaccaactaaaatctgtgaacagcatgtcagatggggccgg tttgccaaggggagaggtcgcaggcccgtcaagactctagagctcacgagactgcttccc ttgaaatttgtgaagatctccatccacgatcatgagaaacagcagctgcgcctgaaactc gccactggccgtactttttatctgcagttgtgtccctcttctgacacacgggaagatctc ttttgctattgggaaaaacttgtctatctcctgaggccaccagtagagagttactgcagt accccaacacttctatctggggacgcaccacccgaagacaacaaaagcctagtggtaagc ctctcaaaggctcgagaacatggggcaggttctttgggggagcagttcaaagccagtcct acaaagcctttggtgccttctgaggcccacctggaaaaagctgcagagctccacagagaa ggggatcagagtgagactgggctctacaagccttgtgatgtatctgcagccacctcttct gcttatgctgggggagagggaatccaacatgcctcccacggaacggctagtgcggcttct ccatccacgagcactccaggggctgctgaaggaggagcagcaaggacagcaggtggcatg gcagtggcaggaacagcaacaggacctagaacagatgtggcaatagcaggggcagcaatg agtcctgcaacaggtgctatgagcatagcaacaaccaaatctgcaggcccaggtcaagtg accacagcgctggcgggagcagctatcaaaaatccaggagaaaatgaatccagcaagtcc atggcaggtgctgccaacatatcctcagagggtattagcttggccttggtgggtgctgca agcacctccttggaaggtacttccacctcgatggcgggggccgccagtctctcccaagac agcagcttgagtgcggcgtttgcaggcagtattacgaccagcaagtgtgcagcagaaaga actgaaggaccagcagtgggacccctcatctccaccttgcaaagcgaaggctacatgagt gaacgagatggaagccagaaagtttcccagcccagtgctgaagtctggaatgaaaacaag gaaagaagagaaaagaaggacagacatcccagtaggaaaagttctcatcaccgcaaggca ggtgaaagtcaccgcaggagagcgggggacaagaatcagaaagcgtcttcccaccggtcc gcatctggccataaaaacacgagagatgacaaaaaagaaaaagggtacagcaacgtaagg ggcaagcgacatggctcctctcgcaagagctccacccacagctccaccaaaaaggagtcg agaacaactcaggaactggggaagaaccaatctgcatctagcacaggagctttacaaaag aaagccagtaagatcagctcttttttaaggagcctcagggccactcctggttcaaaaaca agggtcacatcacatgacagagaggtagatatcgtggctaagatggtggagaagcaaaac atagaggccaaagtggagaaagcccagggtggccaggagctggagatgatcagtggcact atgacatccgagaagacggagatgatcgtctttgaaaccaaatccatttaa >gi568815593r:156987105_157206962|GENSCAN_predicted_peptide_7|78_aa MENYAAIEKYEIVSFAATWVELEAISPSELTQELKTKCCMFSLSQNLNSRVYSISLWAIS TAETDEWVYQSLNSRNQI >gi568815593r:156987105_157206962|GENSCAN_predicted_CDS_7|237_bp atggagaattatgcagccatagaaaagtatgagattgtgtcctttgcagcgacatgggta gagttggaggccattagcccaagtgaactaacacaggaactaaaaaccaaatgctgcatg ttctcactgagtcagaatctaaactctcgtgtttatagtatttctctgtgggctatttct actgcagaaactgatgaatgggtgtaccagtccctgaacagtaggaatcaaatctaa