GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:29:09 Sequence gi568815588r:95215062_95537556 : 322495 bp : 43.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 6874 6913 40 -3.26 1.01 Init + 13649 13709 61 1 1 89 32 86 0.563 4.51 1.02 Term + 17589 17905 317 1 2 78 42 119 0.333 1.40 1.03 PlyA + 17909 17914 6 1.05 2.10 PlyA - 18231 18226 6 1.05 2.09 Term - 23050 22864 187 0 1 80 52 227 0.815 15.26 2.08 Intr - 23624 23507 118 0 1 105 56 78 0.988 5.82 2.07 Intr - 26457 26371 87 1 0 53 99 55 0.745 3.04 2.06 Intr - 32305 32154 152 0 2 74 84 159 0.425 13.91 2.05 Intr - 49002 48803 200 0 2 105 103 207 0.870 22.05 2.04 Intr - 53801 53717 85 1 1 54 111 38 0.561 2.52 2.03 Intr - 56723 56572 152 2 2 126 72 20 0.506 3.16 2.02 Intr - 75861 75759 103 2 1 61 94 131 0.776 11.18 2.01 Init - 77759 77671 89 2 2 68 110 50 0.819 5.31 2.00 Prom - 82566 82527 40 -3.06 3.34 PlyA - 83212 83207 6 1.05 3.33 Term - 89112 88919 194 1 2 77 32 79 0.301 -1.22 3.32 Intr - 98437 98309 129 2 0 111 96 -37 0.254 0.07 3.31 Intr - 103371 103259 113 2 2 66 91 129 0.902 11.02 3.30 Intr - 109174 109147 28 2 1 77 78 16 0.035 -3.33 3.29 Intr - 121891 121471 421 1 1 39 59 291 0.343 14.82 3.28 Intr - 124266 124072 195 0 0 91 70 185 0.897 16.61 3.27 Intr - 126349 126224 126 1 0 103 121 108 0.999 16.48 3.26 Intr - 126617 126503 115 1 1 74 71 58 0.845 3.15 3.25 Intr - 131391 131345 47 0 2 90 106 28 0.869 2.01 3.24 Intr - 136315 136148 168 1 0 47 58 173 0.774 10.34 3.23 Intr - 139906 139821 86 2 2 90 84 61 0.971 5.44 3.22 Intr - 141999 140566 1434 1 0 71 23 855 0.242 66.85 3.21 Intr - 142742 142570 173 1 2 51 86 47 0.146 0.39 3.20 Intr - 165571 165441 131 1 2 17 86 77 0.106 -0.11 3.19 Intr - 166738 166624 115 0 1 74 50 77 0.501 2.95 3.18 Intr - 169053 168924 130 1 1 75 115 0 0.153 1.15 3.17 Intr - 169224 169177 48 1 0 101 98 0 0.094 0.95 3.16 Intr - 179612 179550 63 0 0 69 86 46 0.611 1.19 3.15 Intr - 180014 179940 75 0 0 80 76 67 0.848 4.19 3.14 Intr - 182222 182159 64 2 1 111 69 49 0.772 3.59 3.13 Intr - 184128 183956 173 1 2 72 79 95 0.655 6.66 3.12 Intr - 191086 191057 30 2 0 87 96 14 0.181 0.20 3.11 Intr - 195716 195572 145 2 1 90 94 34 0.463 4.06 3.10 Intr - 199594 199433 162 1 0 93 77 52 0.556 4.77 3.09 Intr - 201088 200504 585 1 0 76 39 203 0.331 7.05 3.08 Intr - 207012 206900 113 1 2 68 96 98 0.968 8.70 3.07 Intr - 211977 211924 54 1 0 97 75 46 0.483 3.15 3.06 Intr - 217515 217387 129 1 0 84 85 2 0.335 0.17 3.05 Intr - 219656 219561 96 0 0 98 70 43 0.725 3.48 3.04 Intr - 222494 222378 117 0 0 124 86 87 0.689 12.54 3.03 Intr - 226119 226065 55 0 1 74 119 27 0.038 2.95 3.02 Intr - 286352 286282 71 0 2 43 94 63 0.717 1.30 3.01 Init - 286822 286723 100 1 1 55 119 77 0.935 7.92 3.00 Prom - 295172 295133 40 -5.46 4.05 PlyA - 295508 295503 6 1.05 4.04 Term - 299808 299688 121 2 1 66 42 83 0.531 -0.55 4.03 Intr - 301136 301032 105 1 0 50 98 98 0.691 6.33 4.02 Intr - 301751 301696 56 2 2 43 105 49 0.471 -0.12 4.01 Intr - 318487 318451 37 0 1 89 119 15 0.161 2.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 36083 36014 70 2 1 90 64 154 0.954 12.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:95215062_95537556|GENSCAN_predicted_peptide_1|125_aa MDEDGYFWWSGRVDDVANALENGTLHSQGVVYKLAFQLCLQQAIPSPFYAGHFWVVQIVI DPLDSMVMCPLLCLLCSKMGPWSHKAMRDPALVDLKIFVSSESVVFEALWTAKANPIYLY GSIPD >gi568815588r:95215062_95537556|GENSCAN_predicted_CDS_1|378_bp atggatgaagacggctacttctggtggtctggtagagttgatgatgttgccaatgcattg gaaaatggcactctacactcacaaggggtcgtatataagctggcatttcaactatgtctt caacaggccataccatcacccttttatgctggccatttctgggtggtgcagattgtgata gatccactggactccatggtcatgtgcccactgctgtgcctcctttgcagtaaaatgggt ccttggtcccacaaggccatgagggatcctgcgctagtggatctaaagatctttgtgtcc tcagaaagtgttgtgtttgaggccctgtggacagcaaaggcaaacccaatatacctatat gggtctattcctgactga >gi568815588r:95215062_95537556|GENSCAN_predicted_peptide_2|390_aa MGSMPSTVLGPGYQDEWVPRKLRRNWAQRGRAMTTQQIDLQGPGPWGFRLVGGKDFEQPL AISRVTPGSKAALANLCIGDVITAIDGENTSNMTHLEAQNRIKGCTDNLTLTVARSEHKV WSPLVTEEGKRHPYKMNLASEPQEVLHIGSAHNRSAMPFTASPASSTTARVITNQYNNPA GLYSSENISNFNNALESKTAASGVEANSRPLDHAQPPSSLVIDKESEVYKMLQEKQELNE PPKQSTSFLVLQEILESEEKDASGIPFHCDNRNCSLGDKTAPSREPLTGGDPNKPSGFRS VKAPVTKVAASIGNAQKLPMCDKCGTGIVGVFVKLRDRHRHPECYVCTDCGTNLKQKGHF FVEDQIYCEKHARERVTPPEGYEVVTVFPK >gi568815588r:95215062_95537556|GENSCAN_predicted_CDS_2|1173_bp atgggctctatgccaagcactgtgctggggcctgggtaccaggatgaatgggtgccaaga aaactgcgaagaaactgggcccagagaggccgcgccatgaccacccagcagatagacctc cagggcccggggccgtggggcttccgcctcgtgggcggcaaggacttcgagcagcctctc gccatttcccgggtcactcctggaagcaaggcggctctagctaatttatgtattggagat gtaatcacagccattgatggggaaaatactagcaatatgacacacttggaagctcagaac agaatcaaaggctgcacagacaacttgactctcactgtagccagatctgaacataaagtc tggtctcctctggtgacggaggaagggaagcgtcatccatacaagatgaatttagcctct gaaccccaggaggtcctgcacataggaagcgcccacaaccgaagtgccatgccctttacc gcctcgcctgcctccagcactactgccagggtcatcacaaaccagtacaacaacccagct ggcctctactcttctgaaaatatctccaacttcaacaatgccctggagtcaaagactgct gccagcggggtggaggcgaacagcagacccttagaccatgctcagcctccaagcagcctt gtcatcgacaaagaatctgaagtttacaagatgcttcaggagaaacaggagttgaatgag cccccgaaacagtccacgtctttcttggttttgcaggaaatcctggagtctgaagaaaaa gacgccagtggcatcccctttcactgtgacaatcgaaactgctctctaggagacaaaact gcccccagtagggaaccactgactggaggggatcccaacaagccctcaggattcagaagt gttaaagctcctgtcactaaagtggctgcgtcgattggaaatgctcagaagttgcctatg tgtgacaaatgtggcactgggattgttggtgtgtttgtgaagctgcgggaccgtcaccgc caccctgagtgttatgtgtgcactgactgtggcaccaacctgaaacagaagggccatttc tttgtggaggatcaaatctactgtgagaagcatgcccgggagcgagtcacaccacctgag ggttatgaagtggtcactgtgttccccaagtga >gi568815588r:95215062_95537556|GENSCAN_predicted_peptide_3|1894_aa MLLRKSNIQKTDTQNVQARVPVKGRKKDPPGTAGLKDLRKAAICWEKACDQFLVSQQSSS EPQLQTTCPATTMSSECDGGSKAVMNGLAPGSNGQDKGKVLGEKSGLVLEGISKATADPL RARSISAVKIIPVKTVKNASGLVLPTDMDLTKICTGKGAVTLRASSSYRETPSSSPASPQ ETRQHESKPGMAVAKRYHKKVNIEAVEDEWRLSSSADANGNAQPSSLAAKGYRSVHPNLP SDKSQDRFAWQSPTIHNTYKDSLYLSSPKPYVPLGTPRQQNPSQPQPISVLLAAGSAPKG VVCPGSLLPDSTFPSASSQPQQRYAATRTVYHKNVSSNPCHEAVGIKKVSSLYVPCLSNN ICLAASENSSRVARDPAEGTPLEAAGTRAPAPGLVSRTAGTGKPPPAPPPDPPKLFFDIR KDAVNRGESPSLGTQASFPDRWPHHSQPARASGSFAPISQTPPSFSPPPPLVPPAPEDLR RVSEPDLTGAVSSTDSSPLLNEVSSSLIGTDSQAFPSVSKPSSAYPSTTIVNPTIVLLQH NREQQKRLSSLSDPVSERRVGEQDSAPTQEKPTSPGKAIEKRAKDDSRRVVKSTQDLSDV SMDEVGIPLRNTERSKDWYKTMFKQIHKLNRDTPEENPYFPTYKFPELPEIQQTSEEDNP YTPTYQFPASTPSPKSEDDDSDLYSPRYSFSEDTKSPLSVPRSKSEMSYIDGEKVVKRSA TLPLPARSSSLKSSSERNDWEPPDKKVDTRKYRAEPKSIYEYQPGKSSVLTNEKMRSLCT SYFKQKVQRGAQNFAFSPLLPFFGGSLLLEDTVRKKHRSIYSSNFHAVKRESDGAPGDLT SLENERQIYKSVLEGGDIPLQGLSGLKRPSSSASTKVHSRTFNTSNALGPVCKHKKPLSA AKACISEILPSKFKPRLSAPSALLQEQKSILLPSEKAQSCENLCVSGSLNDSKRGLPLQV GGSIENLLMRSRRDYDSKSSSTMSLQEYSTSGRRPCPLSRKAGMQFTMLYRDMHQINRSG LFLGSISSSSSVRDLASHFEKSSLALSRGELGPSQEGSEHIPKHTVSSRITAFEQLIQRS RSMPSLDLSGRLSKSPTPVLSRGSLTSARSAESLLESTKLHPKEMDGMNSSGVYASPTCS NMAHHALSFRGLVPSEPLSTCSDDVDRCSNISTDSREGSGGSVHGDFPKHRLNKCKGTCP ASYTRFTTIRKHEQQQTSRQPEWRLDARGDKSTLLRNIYLMSPLPFRLKKPLHHHPRQPS PGDSSGLLVGQKPDLPSQPHQDQPPSGGKPVVPTRLSSRHTMARLSRSSEPSQERPTALE DYPRAINNGNSVPYSDHSLDRNNNPQSELAPSRGDSESPRHFIPADYLESTEEFIRRRHD DKEKLLADQRRLKREQEEADIAARRHTGVIPTHHQFITNERFGDLLNIDDTAKRKSGSEM RPARAKFDFKAQTLKELPLQKGDIVYIYKQIDQNWYEGEHHGRVGIFPRTYIELLPPAEK AQPKKLTPVQVLEYGEAIAKFNFNGDTQVEMSFRKGERITLLRQVDENWYEGRIPGTSRQ GIFPITYVDVIKRPLVKNPVDYMDLPFSSSPSRSATASPQASLHMNGDGGVHTPSSGIHQ DSFLQLPLGSSDSVISQLSDAFSSQSKRQPWREESGQYERKAERGAGERGPGGPKISKKS CLKPSDVVRCLSTEQRLSDLNTPEESRPGKPLGSAFPGSEAEQTERHRGGEQAGRKAARR GFLRESLKMRFSYQALYSYIPQNDDELELRDGDIVDVMEKCDDGWFVDTHPSSQIGVAIL GGRVVYGTPGCLHISQNYPRTIVPKSRVFTGSTLGLHLFSYFLVPSSTLVSECPLYADLL PGLSPGIQTPIYNRLLYISTEISHGCLKLKYIQN >gi568815588r:95215062_95537556|GENSCAN_predicted_CDS_3|5685_bp atgcttctgcggaagagtaacatccagaagactgacactcagaatgttcaggcaagagta ccagtcaaaggcagaaaaaaagacccgcctggtactgctggtttgaaagatctcaggaaa gcagccatttgctgggaaaaggcctgtgaccagttcctggtgtctcaacagagcagttca gagccccagttgcagacgacttgtcctgccaccaccatgagttctgaatgtgatggtggt tccaaagctgtgatgaatggcttggcacctggcagcaatgggcaagacaaaggtaaggtt cttggagagaaatctggattggtactggaaggcatttctaaagcaactgccgacccttta cgcgcacgctctatttctgctgttaaaatcattcctgtgaagacagtgaaaaacgcctca ggcctagttctccctacagacatggatcttacaaaaatctgcactgggaagggagcggtg actctccgggcctcgtcttcctacagggaaaccccaagcagtagccctgcgagccctcag gaaacccggcaacacgaaagcaaaccaggaatggcagtagcaaaaaggtaccataagaag gttaacattgaagcagttgaagatgagtggaggctttcttccagtgctgatgccaatgga aatgcccagccctcttcactcgctgccaagggctacagaagtgtgcatcccaaccttcct tctgacaagtcccaggaccgctttgcatggcagtcccccaccatccacaatacctataag gattccctctacctgagctcaccaaagccttatgtcccactcggcacccccagacagcag aacccctcacaaccccagcctatctctgtcctcctggcagctggatcggctcccaaaggt gtggtgtgccctggttccttgcttccggactccacgttccccagtgcttcatcccaaccc cagcagcgctatgcagccaccaggacagtttatcataagaatgtgagctctaacccatgt catgaggcagttgggattaaaaaggtcagcagcttatatgtaccttgtttatccaataac atttgcctggcggcgtcagaaaactcctctcgtgttgcacgtgaccctgccgaaggcact cccttggaggctgcgggcacccgagcaccggctcccggccttgtctcccgcacagccggc acaggaaaaccaccccctgctcctcctcctgaccctcccaagctattctttgacatccgt aaagatgccgttaaccgtggcgagagtccttccctggggactcaggcttcgttcccagat aggtggccccaccacagccagccagcccgcgcctctggctcctttgcccccattagccag acgccaccatccttctcaccaccacctccgctggtccctcctgccccggaggacctccgc agagtctcggagcctgacctcacgggagctgtttcgagtaccgattccagtcctctacta aatgaagtttcttcttcccttattggaactgattcccaagcctttccatcagttagcaag ccttcatccgcctatccctccacaacgattgtcaatcctactattgtgctcttgcaacac aatcgagaacagcaaaaacgactcagtagcctttcagatcctgtctcagaaagaagagtg ggagagcaggactcagcaccaacccaggaaaaacccacctcacctggcaaggctattgaa aaaagagcaaaggatgacagtaggcgggtggtgaagagcactcaggacttaagcgatgtt tccatggatgaagtgggcatcccactccggaacactgagagatcaaaagactggtacaag actatgtttaaacagatccacaaactgaacagagacactcctgaagaaaacccttatttc cctacgtacaaattccctgaacttcctgaaatccagcaaacttccgaagaggacaatcct tacactcccacctaccagtttcctgcatctactcctagtcctaaatctgaagatgatgat tcagatctgtactctcccagatactcattttctgaagacacaaaatctcccctttctgtg cctcgctcaaaaagtgagatgagctacattgatggtgagaaggtagtcaagaggtcggcc acactacccctcccagcccgctcttcctcactgaagtcaagctcagaaagaaatgactgg gaacccccagataagaaagtagacacaagaaaatatcgtgcagagcccaagagcatttac gaatatcagcctggcaagtcttccgttctgaccaacgaaaagatgcgttcactatgcacc agttatttcaagcagaaagttcagagaggagcacagaatttcgccttctccccactactg cccttctttggaggttctctgctgttagaagatactgtacgtaagaaacatagaagtata tattcttccaacttccatgcagtgaagagggaatcagacggggctcctggggatctcact agcttggagaatgagagacaaatttataaaagtgtcttggaaggtggtgacatccctctt cagggcctgagtgggctcaagcgaccatccagctctgcttccactaaagtccatagccga acgtttaacactagcaatgcgttaggccctgtgtgtaagcacaagaagcccctgtcagct gcgaaggcctgcatttcggaaatccttccatccaaattcaaacccaggctctctgctccc agcgctcttttgcaggaacagaagagtatcctcttgccctcagagaaggctcaaagctgt gagaacctttgtgtttctggttctttaaatgattccaaaagaggcctccccctccaagtg ggagggagcattgagaacctgctcatgcgctcccgacgggattatgacagcaagtcgagc agtaccatgagcctccaggagtacagcaccagcggcaggaggccctgtcctctctcaaga aaggctgggatgcagttcaccatgctttatcgggacatgcaccagatcaaccgatctggc ctcttcctgggctccatctcctcctcctcgagtgtgcgggatcttgcctcccactttgaa aagagtagcctggcattgtccaggggtgagctgggccccagccaggagggctcggaacac atccccaagcacaccgtctcttcccgcatcaccgcttttgagcagctgattcagcggtcc cgttccatgccatccctggacctgtccggaaggctgagcaagtctcccacacctgtgctg tcccggggcagcctgacctcagcccgctcggccgagtccctacttgagtcaaccaagctc catcccaaggagatggatgggatgaactccagtggggtctatgcttccccaacatgtagc aatatggcacaccatgccttgagcttcaggggccttgtgccttctgagcctctctccacc tgctctgatgacgtggaccgctgttcaaatatctccactgacagcagagaaggcagtggc ggcagtgttcatggagatttccccaaacatcgcctcaacaagtgcaagggcacctgcccg gcctcatacacccgcttcaccaccatccggaagcatgagcagcagcagacctctagacag cctgagtggcgcctggatgccagaggggacaagagcaccctcctcaggaacatctaccta atgagcccccttcctttccggctgaaaaagcccctccaccaccaccccagacaaccttcc cctggtgactcctcaggcctcctggtgggccagaagccagacctccccagtcagccccat caggaccagcccccttctggggggaagcccgtggttcccacacgcctgtcttcccgacac accatggccaggcttagccgcagctcagagccctctcaggagagacccacggccctggag gactacccaagggccattaataatggaaactccgtgccatactcagaccacagcctggac aggaacaacaacccacaaagtgaactggcaccatcccgtggagattcagaatcgccaaga cattttataccagctgattacttggaatccacggaagaatttattcgaagacgtcatgat gataaagagaaacttttagcggaccagagacgacttaaacgcgagcaagaagaggctgat attgcagctcgacgccacacaggcgtcattccgacgcaccatcagtttatcactaatgag cgctttggggacctcctcaatatagacgatactgcaaaaaggaaatctgggtcagagatg agacctgccagagccaaatttgactttaaagctcagacactaaaggagcttcctctgcag aagggagatattgtttacatttataagcaaattgatcagaactggtatgaaggagaacac cacggccgggtgggaatcttcccacgcacctacatcgagcttcttcctcctgctgagaag gcacagcccaaaaagttgacaccagtgcaggttttggaatatggagaagctattgctaag tttaactttaatggtgatacacaagtagaaatgtccttcagaaagggtgagaggatcaca ctgctccggcaggtagatgagaactggtacgaagggaggatcccggggacatcccgacaa ggcatcttccccatcacctacgtggatgtgatcaagcgaccactggtgaaaaaccctgtg gattacatggacctgcctttctcctcctccccaagtcgcagtgccactgcaagcccacag gcctcccttcacatgaatggagacggtggtgtccacacgccatcttcaggcatccaccaa gatagcttcttgcagctgccgctggggagctctgatagtgtcatctcccagcttagtgat gcctttagcagccagagcaagaggcagccatggcgcgaagagagtggacaatatgagagg aaagcagagaggggggcaggcgaaagaggccctggtggacccaagatctctaagaagagc tgcttgaagccttcagacgtggtcaggtgcctgagtactgaacagagactctcagatctc aacacccctgaggagagccggcccggcaagcccctgggtagcgcttttccaggaagtgag gctgagcagacagagcggcatagaggtggcgagcaggcggggaggaaagctgctcggaga gggtttctgagggagtcactgaagatgagatttagctatcaagcattatatagctatata ccacagaatgatgatgagttggaactccgcgatggagatatcgttgatgtcatggaaaaa tgtgacgatggatggtttgttgatacacatccaagctcacaaataggagtagcaattcta ggtggtagggttgtgtacggaacccctggctgtctgcatatatctcagaattaccccagg accattgtcccaaagtctagagtctttacaggctccacccttggcctccaccttttttcc tacttcctggtgccctcatccactcttgtgtctgaatgtcctctatatgctgatctcttg cctggactctcccctggaatccagactcctatatacaaccgcctgctctacatctccact gaaatatctcatgggtgcctgaaacttaaatatatccaaaactaa >gi568815588r:95215062_95537556|GENSCAN_predicted_peptide_4|106_aa XFSQVKKWDGVSLRGIENPELMEDGLGNSHLRFPGNAAGLVTTPEELHYPDLEPPGPAGA SDLDSDSVFHRYSASYTLYFCLKAQYESEVEVTFVAHTWFLLTTSL >gi568815588r:95215062_95537556|GENSCAN_predicted_CDS_4|321_bp nntttttcacaagtgaaaaagtgggatggtgtttctctgagaggcattgaaaatccagaa ttaatggaagatggtcttggaaatagtcacttgaggttcccaggcaatgctgctgggctg gtgaccacacctgaagaactacactacccagatctggagccaccaggcccagctggtgcc tcagacctagacagtgacagtgtcttccatcgatattctgcctcatatacactctacttc tgcctcaaggcccagtatgaatctgaagtcgaagtgacttttgtagcccatacttggttc ctgctcacaacttctctatga