GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:19:54 Sequence gi568815590r:69952080_70169860 : 217781 bp : 43.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16413 16613 201 2 0 18 100 131 0.407 6.18 1.02 Intr + 18055 18171 117 0 0 41 80 74 0.711 2.56 1.03 Term + 57369 57515 147 2 0 24 54 175 0.506 5.60 1.04 PlyA + 58470 58475 6 1.05 2.00 Prom + 68997 69036 40 -2.66 2.01 Sngl + 83070 83459 390 2 0 85 39 129 0.680 3.92 2.02 PlyA + 85000 85005 6 1.05 3.11 PlyA - 85442 85437 6 1.05 3.10 Term - 100225 99998 228 1 0 99 41 195 0.998 12.53 3.09 Intr - 100415 100334 82 1 1 32 83 93 0.408 2.84 3.08 Intr - 104005 103965 41 1 2 90 84 9 0.362 -2.28 3.07 Intr - 106763 106561 203 0 2 42 113 198 0.993 16.70 3.06 Intr - 114426 114156 271 0 1 28 113 242 0.991 17.81 3.05 Intr - 116308 116151 158 2 2 127 84 184 0.899 21.63 3.04 Intr - 117805 116998 808 1 1 62 49 664 0.163 50.92 3.03 Intr - 119178 119071 108 0 0 21 80 123 0.172 5.28 3.02 Intr - 119655 119569 87 0 0 102 61 67 0.166 5.57 3.01 Init - 140207 140052 156 2 0 67 86 81 0.416 5.71 3.00 Prom - 147650 147611 40 -5.06 4.18 PlyA - 147830 147825 6 1.05 4.17 Term - 152111 151714 398 0 2 13 43 378 0.008 21.04 4.16 Intr - 158343 158310 34 0 1 70 98 1 0.002 -2.90 4.15 Intr - 169312 169223 90 1 0 123 94 -1 0.298 4.19 4.14 Intr - 172003 171805 199 0 1 71 109 168 0.980 16.55 4.13 Intr - 172786 172609 178 2 1 85 91 44 0.969 3.38 4.12 Intr - 174968 174527 442 2 1 56 82 161 0.663 5.33 4.11 Intr - 176431 176354 78 1 0 68 110 43 0.948 4.25 4.10 Intr - 176901 176623 279 0 0 61 62 135 0.925 5.87 4.09 Intr - 179923 179758 166 0 1 87 85 190 0.726 18.56 4.08 Intr - 186253 186124 130 2 1 113 109 50 0.998 9.35 4.07 Intr - 189320 189105 216 0 0 68 82 118 0.607 7.68 4.06 Intr - 192769 192563 207 2 0 108 110 -10 0.652 2.25 4.05 Intr - 196404 196194 211 0 1 59 73 184 0.984 12.49 4.04 Intr - 205161 203892 1270 2 1 77 111 617 0.427 50.20 4.03 Intr - 207573 207426 148 1 1 92 89 49 0.862 4.69 4.02 Intr - 210775 210632 144 2 0 46 121 64 0.886 5.75 4.01 Intr - 214675 214487 189 2 0 49 81 140 0.907 8.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 152100 151714 387 0 0 63 43 364 0.976 25.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:69952080_70169860|GENSCAN_predicted_peptide_1|154_aa MKAERGEEAAEEKFEASTSCLNKRSHLYNIKVEGEAASIDGEAAASYSEDLAQITDEGQH TKQEIFKTLPWPPQPLATTTLISQQPSTSRQGKTLHQHEDYDLLGQPKKEYDNDDDNDDD DDDHDDSCSAGNGQNIQQEAPLLQTQIQNGVLQH >gi568815590r:69952080_70169860|GENSCAN_predicted_CDS_1|465_bp atgaaagctgaaagaggtgaggaagctgcagaagaaaagtttgaagcaagcacaagttgt cttaacaaaagaagccatctctataacataaaagtggaaggtgaagcagcaagcattgat ggagaagctgcagcaagttattcagaagatctagctcagatcactgatgaaggtcaacac actaaacaagagattttcaagacactgccatggccaccccaacctttagcaaccaccacc ctgatcagtcagcagccatccacatcaaggcaaggcaagactcttcaccagcacgaagac tatgacttgctgggccagccaaagaaagagtatgacaacgatgacgataatgatgatgat gatgatgatcatgatgattcctgctctgcaggaaatggacaaaatatacaacaagaagca cctcttctccaaactcaaattcagaatggagttctccaacactga >gi568815590r:69952080_70169860|GENSCAN_predicted_peptide_2|129_aa MALVSESRDQIDSHLRPENEEEAEPEVAAAPPGHNRERKSWPSAGPAWGHLLEDPWEPLS YSSESAVLSLRPQNQQDTHSVILFFSIAQLEAAGDQEKKNAFPEFSNSNAYQLCTPFGLS KRDQKTKIC >gi568815590r:69952080_70169860|GENSCAN_predicted_CDS_2|390_bp atggccttagtgtctgagagcagagaccagattgactcccacttgagaccagaaaacgaa gaggaagcggaacccgaggtggctgcggctcccccgggacacaaccgggagaggaagtcc tggccatctgccggccccgcctgggggcatctgctcgaggatccctgggagccgttatcg tattcctcagaatctgccgtgctgtccctccgcccccaaaaccaacaggacacccattct gtgattctcttcttctccatcgcccagttggaggctgctggggaccaggagaaaaaaaat gcctttcccgagttctctaattcaaatgcttatcagctttgcacaccattcggcctttcc aaaagggaccagaaaacaaagatatgctaa >gi568815590r:69952080_70169860|GENSCAN_predicted_peptide_3|713_aa MIVKVAFLSEQEHFWLKGGKEDAQEGLQRKMKGKKMMEANVIHSFTDQTNIKVSGTDTHQ VHQRVRRRLRAGWDLPNWERRGPPEVYRARVAYRARVAPQRPGGARSAVALLEVVSESRR ACGPGMALPRPSEAVPQDKVCYPPESSPQNLAAYYTPFPSYGHYRNSLATVEEDFQPFRQ LEAAASAAPAMPPFPFRMAPPLLSPGLGLQREPLYDLPWYSKLPPWYPIPHVPREVPPFL SSSHEYAGASSEDLGHQIIGGDNESGPCCGPDTLIPPPPADASLLPEGLRTSQLLPCSPS KQSEDGPKPSNQEGKSPARFQFTEEDLHFVLYGVTPSLEHPASLHHAISGLLVPPDSSGK GSSHHLGVELDGNAGGNGSVREEALPGLCLMQTVFGEVPHFGVFCSSFIAKGVRFGPFQG KVVNASEVKTYGDNSVMWEIFEDGHLSHFIDGKGGTGNWMSYVNCARFPKEQNLVAVQCQ GHIFYESCKEIHQNQELLVWYGDCYEKFLDIPVSLQVTEPGKQPSGPSEESAEGYRCERC GKVFTYKYYRDKHLKYTPCVDKGDRKFPCSLCKRSFEKRDRLRIHILHVHEKHRPHKLLV SSSVEHTVGTLQKDTSVESFRIKATDDDQECSMALKPKRFTASSILRTHIRQHSGEKPFK CKYCGKSFASHAAHDSHVRRSHKEDDGCSCSICGKIFSDQETFYSHMKFHEDY >gi568815590r:69952080_70169860|GENSCAN_predicted_CDS_3|2142_bp atgatagtgaaagtggcatttctgagtgagcaggagcatttttggctgaaaggtggaaag gaagatgcgcaggaggggctgcagaggaagatgaaaggaaagaaaatgatggaagccaat gtaattcactccttcactgaccaaacaaatattaaggtatcaggaacagacacccaccag gtccaccagcgggtcagacgccgccttcgggcaggctgggatcttccgaactgggagcgg agaggcccgcccgaagtctaccgagcccgagtggcctaccgagcccgagtggccccgcag cgtccaggaggcgcccgctccgcggtggcgctcttggaggtggtgtcggagagccgccga gcgtgcggtcccgggatggctctaccccggccaagtgaggccgtgcctcaggacaaggtg tgctacccgccggagagcagcccgcagaacctggccgcgtactacacgcctttcccgtcc tatggacactacagaaacagcctggccaccgtggaggaagacttccaacctttccggcag ctggaggccgcagcgtctgctgcccccgccatgccccccttccccttccggatggcgcct cccttgctgagcccgggtctgggcctacagagggagcctctctacgatctgccctggtac agcaagctgccaccgtggtacccaattccccacgtccccagggaagtgccgcccttcctg agcagcagccacgagtacgcgggtgccagcagtgaagatctgggccaccaaatcattggt ggcgacaacgagagtggcccgtgttgtggacctgacactttaattccaccgccccctgcg gatgcttctctgttacctgaggggctgaggacctcccagttattaccttgctcacccagc aagcagtcagaggatggtcccaaaccctccaaccaagaagggaagtcccctgctcggttc cagttcacggaggaggacctgcacttcgttctgtacggggtcactcccagcctggagcac ccagccagcctgcaccatgcgatttcaggcctcctggtccccccagacagctctggtaaa gggagttcacaccatttgggagtcgaattggacgggaatgcaggcgggaacgggagtgtc cgggaagaggcgcttccaggtctatgcctcatgcagacggtgtttggtgaagtcccacat tttggtgtgttctgcagtagttttatcgccaaaggagtcaggtttgggccctttcaaggt aaagtggtcaatgccagtgaagtgaagacctacggagacaattctgtgatgtgggagatc tttgaagatggtcatttgagccactttatagatggaaaaggaggtacggggaactggatg tcctatgtcaactgtgcccgcttccccaaggagcagaacctagttgctgtgcagtgtcaa gggcatatattttatgagagctgcaaagagatccatcagaaccaagagctccttgtgtgg tatggagactgctatgagaaatttctggatattcctgtgagccttcaggtcacagagccg gggaagcagccatctgggccctctgaagagtctgcagaaggctacagatgtgaaagatgt gggaaggtatttacctacaaatattacagagataagcacctcaagtacaccccctgtgtg gacaagggcgataggaaatttccctgttctctctgcaaacgatcctttgagaagcgggac cggcttcggatccacattcttcatgttcatgagaagcaccggcctcacaagctcctggtt tctagcagtgtggagcacacagtaggtactctccagaaggacacatcagtggagtccttt cggattaaggctactgatgatgaccaggagtgctctatggccttaaaacccaagaggttc acagcctccagcatactccgcacacacatcaggcagcactccggggagaagcccttcaaa tgcaagtactgtggtaaatcttttgcatcccatgctgcccatgacagccatgtccggcgt tcacacaaggaggatgatggctgctcatgcagcatctgtgggaaaatcttctcagatcaa gaaacattctactcccacatgaagtttcatgaagactactag >gi568815590r:69952080_70169860|GENSCAN_predicted_peptide_4|1459_aa XNGGSWSGEPPRRNSHTFNCRMLVKPLPDSEEEGHDNQEAHQKYETMQCFAVSQPKSIKE EGEGKITSLDTSTMRAAMKPGWEDLVRRCIQKFHAQHEGESVSYAKRHHHEVLRQGLAFS QIYRFSLSDGTLVAAQTKSKLIRSQTTNEPQLVISLHMLHREQNVCVMNPDLTGQTMGKP LNPISSNSPAHQALCSGNPGQDMTLSSNINFPINGPKEQMGMPMGRFGGSGGMNHVSGMQ ATTPQGSNYALKMNSPSQSSPGMNPGQPTSMLSPRHRMSPGVAGSPRIPPSQFSPAGSLH SPVGVCSSTGNSHSYTNSSLNALQALSEGHGVSLGSSLASPDLKMGNLQNSPVNMNPPPL SKMGSLDSKDCFGLYGEPSEGTTGQAESSCHPGEQKETNDPNLPPAVSSERADGQSRLHD SKGQTKLLQLLTTKSDQMEPSPLASSLSDTNKDSTGSLPGSGSTHGTSLKEKHKILHRLL QDSSSPVDLAKLTAEATGKDLSQESSSTAPGSEVTIKQEPVSPKKKENALLRYLLDKDDT KDIGLPEITPKLERLDSKTDPASNTKLIAMKTEKEEMSFEPGDQPGSELDNLEEILDDLQ NSQLPQLFPDTRPGAPAGSVDKQAIINDLMQLTAENSPVTPVGAQKTALRISQSTFNNPR PGQLGRLLPNQNLPLDITLQSPTGAGPFPPIRNSSPYSVIPQPGMMGNQGMIGNQGNLGN SSTGMIGNSASRPTMPSGEWAPQSSAVRVTCAATTSAMNRPVQGGMIRNPAASIPMRPSS QPGQRQTLQSQVMNIGPSELEMNMGGPQYSQQQAPPNQTAPWPESILPIDQASFASQNRQ PFGSSPDDLLCPHPAAESPSDEGALLDQLYLALRNFDGLEEIDRALGIPELVSQSQAVDP EQFSSQDSNIMLEQKAPVFPQQYASQAQMAQGSYSPMQDPNFHTMGQRPSYATLRMQPRP GLRPTGLVQNQPNQLRLQLQHRLQAQQNRQPLMNQISNVSNVNLTLRPGVPTQAPINAQM LAQRQREILNQHLRQRQMHQQQQVQQRTLMMRGQGLNMTPSMVAPSGMPATMSNPRIPQA NAQQFPFPPNYGTGLRSPPPFTSPFSPVSPSVGSQLLSHSSLHGSQMNLANQGMIGNLGG QLGPVRSPQVQHSTFQALSSGISQQPDPGFTGATTPQSPLMSPRMAHTQSPMMQQSQANP AYQAPSDINGWAQGNMGGNSMFSQQSPPHFGQQANTSMYSNNMNINVSMATNTGGMSSMN QMTGQISMTSVTSVPTSGLSSMGPEQVNDPALRGGNLFPNQLPGMDMIKQEGDTTRTKAK SFKFLTTAEFEMTGGKAGKDSGKAKTKAVSRCQRAGLQFPVGRIHLHLKSRTTNHGRVGA TAAVYSAAILKYLTAEVLELAGNASKDLKVKRSTTGHLQLAIRGDEELDSLIKATIAGGG VIPHIHKSLTGKKGQQKTV >gi568815590r:69952080_70169860|GENSCAN_predicted_CDS_4|4380_bp ntaaatgggggatcttggtctggcgaacctccgaggcggaacagccataccttcaattgt cggatgctggtaaaacctttacctgattcagaagaggagggtcatgataaccaggaagct catcagaaatatgaaactatgcagtgcttcgctgtctctcaaccaaagtccatcaaagaa gaaggagaaggcaagatcacgtctctggataccagcaccatgagagcagccatgaaacca ggctgggaggacctggtaagaaggtgtattcagaagttccatgcgcagcatgaaggagaa tctgtgtcctatgctaagaggcatcatcatgaagtactgagacaaggattggcattcagt caaatctatcgtttttccttgtctgatggcactcttgttgctgcacaaacgaagagcaaa ctcatccgttctcagactactaatgaacctcaacttgtaatatctttacatatgcttcac agagagcagaatgtgtgtgtgatgaatccggatctgactggacaaacgatggggaagcca ctgaatccaattagctctaacagccctgcccatcaggccctgtgcagtgggaacccaggt caggacatgaccctcagtagcaatataaattttcccataaatggcccaaaggaacaaatg ggcatgcccatgggcaggtttggtggttctgggggaatgaaccatgtgtcaggcatgcaa gcaaccactcctcagggtagtaactatgcactcaaaatgaacagcccctcacaaagcagc cctggcatgaatccaggacagcccacctccatgctttcaccaaggcatcgcatgagccct ggagtggctggcagccctcgaatcccacccagtcagttttcccctgcaggaagcttgcat tcccctgtgggagtttgcagcagcacaggaaatagccatagttataccaacagctccctc aatgcacttcaggccctcagcgaggggcacggggtctcattagggtcatcgttggcttca ccagacctaaaaatgggcaatttgcaaaactccccagttaatatgaatcctcccccactc agcaagatgggaagcttggactcaaaagactgttttggactatatggggagccctctgaa ggtacaactggacaagcagagagcagctgccatcctggagagcaaaaggaaacaaatgac cccaacctgcccccggccgtgagcagtgagagagctgacgggcagagcagactgcatgac agcaaagggcagaccaaactcctgcagctgctgaccaccaaatctgatcagatggagccc tcgcccttagccagctctttgtcggatacaaacaaagactccacaggtagcttgcctggt tctgggtctacacatggaacctcgctcaaggagaagcataaaattttgcacagactcttg caggacagcagttcccctgtggacttggccaagttaacagcagaagccacaggcaaagac ctgagccaggagtccagcagcacagctcctggatcagaagtgactattaaacaagagccg gtgagccccaagaagaaagagaatgcactacttcgctatttgctagataaagatgatact aaagatattggtttaccagaaataacccccaaacttgagagactggacagtaagacagat cctgccagtaacacaaaattaatagcaatgaaaactgagaaggaggagatgagctttgag cctggtgaccagcctggcagtgagctggacaacttggaggagattttggatgatttgcag aatagtcaattaccacagcttttcccagacacgaggccaggcgcccctgctggatcagtt gacaagcaagccatcatcaatgacctcatgcaactcacagctgaaaacagccctgtcaca cctgttggagcccagaaaacagcactgcgaatttcacagagcacttttaataacccacga ccagggcaactgggcaggttattgccaaaccagaatttaccacttgacatcacattgcaa agcccaactggtgctggacctttcccaccaatcagaaacagtagtccctactcagtgata cctcagccaggaatgatgggtaatcaagggatgataggaaaccaaggaaatttagggaac agtagcacaggaatgattggtaacagtgcttctcggcctactatgccatctggagaatgg gcaccgcagagttcggctgtgagagtcacctgtgctgctaccaccagtgccatgaaccgg ccagtccaaggaggtatgattcggaacccagcagccagcatccccatgaggcccagcagc cagcctggccaaagacagacgcttcagtctcaggtcatgaatatagggccatctgaatta gagatgaacatggggggacctcagtatagccaacaacaagctcctccaaatcagactgcc ccatggcctgaaagcatcctgcctatagaccaggcgtcttttgccagccaaaacaggcag ccatttggcagttctccagatgacttgctatgtccacatcctgcagctgagtctccgagt gatgagggagctctcctggaccagctgtatctggccttgcggaattttgatggcctggag gagattgatagagccttaggaatacccgaactggtcagccagagccaagcagtagatcca gaacagttctcaagtcaggattccaacatcatgctggagcagaaggcgcccgttttccca cagcagtatgcatctcaggcacaaatggcccagggtagctattctcccatgcaagatcca aactttcacaccatgggacagcggcctagttatgccacactccgtatgcagcccagaccg ggcctcaggcccacgggcctagtgcagaaccagccaaatcaactaagacttcaacttcag catcgcctccaagcacagcagaatcgccagccacttatgaatcaaatcagcaatgtttcc aatgtgaacttgactctgaggcctggagtaccaacacaggcacctattaatgcacagatg ctggcccagagacagagggaaatcctgaaccagcatcttcgacagagacaaatgcatcag caacagcaagttcagcaacgaactttgatgatgagaggacaagggttgaatatgacacca agcatggtggctcctagtggtatgccagcaactatgagcaaccctcggattccccaggca aatgcacagcagtttccatttcctccaaactacggtactggacttagatccccaccacct ttcaccagtcctttctccccagtgtcccccagtgttgggtcacagctcctctctcacagc tctttgcatggctcccagatgaatctggctaaccagggaatgataggaaacctgggagga cagttggggcctgtgaggagtccccaagtccagcacagtaccttccaggctctcagctca ggaataagtcagcaacctgatccaggctttactggggctacgactccccagagcccactt atgtcaccccgaatggcacatacacagagtcccatgatgcaacagtctcaggccaaccca gcctatcaggccccctccgacataaatggatgggcgcaggggaacatgggcggaaacagc atgttttcccagcagtccccaccacactttgggcagcaagcaaacaccagcatgtacagt aacaacatgaacatcaatgtgtccatggcgaccaacacaggtggcatgagcagcatgaac cagatgacaggacagatcagcatgacctcagtgacctccgtgcctacgtcagggctgtcc tccatgggtcccgagcaggttaatgatcctgctctgaggggaggcaacctgttcccaaac cagctgcctggaatggatatgattaagcaggagggagacacaacacggactaaagcaaaa tcttttaagtttcttactacagcagaattcgaaatgactggcggtaaggctgggaaggac tccggaaaggccaagacaaaggcggtttcccgctgccagagagccggcttgcagttccca gttgggcgtattcatctacacctgaaatctaggacgaccaatcatggacgtgtgggcgcg actgccgctgtgtacagcgcagccatcctgaagtacctcaccgcagaggtacttgaactg gcaggaaatgcatccaaagacttaaaggtaaagcgtagtaccactggtcacttgcaactt gctatccgtggagatgaagaattggattctctcatcaaggctacgattgctggtggtggt gtcattccacacatccacaaatctctgactgggaagaaaggacaacagaagactgtctaa