GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:22:33 Sequence gi568815594f:183405196_183610949 : 205754 bp : 43.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3098 3166 69 1 0 112 53 23 0.453 2.42 1.02 Intr + 8412 8532 121 2 1 76 61 81 0.876 4.27 1.03 Intr + 10586 10764 179 0 2 67 23 139 0.763 4.94 1.04 Term + 12249 12449 201 2 0 130 47 84 0.779 5.89 1.05 PlyA + 12992 12997 6 1.05 2.04 PlyA - 14855 14850 6 1.05 2.03 Term - 21560 21466 95 0 2 99 52 93 0.513 4.79 2.02 Intr - 24602 24371 232 2 1 98 56 42 0.364 -0.65 2.01 Init - 28341 28234 108 0 0 83 43 173 0.910 10.52 2.00 Prom - 30918 30879 40 -4.86 3.00 Prom + 36840 36879 40 -3.36 3.01 Init + 38691 38796 106 2 1 57 28 122 0.902 3.48 3.02 Intr + 39133 39431 299 2 2 108 52 136 0.566 8.59 3.03 Intr + 39555 39874 320 2 2 58 117 462 0.601 40.86 3.04 Intr + 40340 40470 131 2 2 109 37 77 0.987 5.04 3.05 Term + 40893 42232 1340 1 2 86 35 274 0.975 13.59 3.06 PlyA + 42598 42603 6 1.05 4.00 Prom + 48597 48636 40 -6.66 4.01 Init + 50695 50856 162 0 0 56 99 111 0.645 8.73 4.02 Term + 58528 58686 159 0 0 16 48 113 0.266 -1.96 4.03 PlyA + 59440 59445 6 1.05 5.00 Prom + 60321 60360 40 -4.86 5.01 Init + 61214 61349 136 1 1 53 57 98 0.181 3.50 5.02 Term + 72536 72615 80 0 2 66 44 100 0.169 1.33 5.03 PlyA + 73595 73600 6 1.05 6.00 Prom + 73863 73902 40 -5.46 6.01 Sngl + 78295 78990 696 0 0 101 43 698 0.936 62.81 6.02 PlyA + 79197 79202 6 1.05 7.00 Prom + 94239 94278 40 -3.96 7.01 Init + 99085 99223 139 0 1 56 58 200 0.962 12.10 7.02 Intr + 99921 100172 252 1 0 -88 97 455 0.956 26.21 7.03 Intr + 100259 100539 281 0 2 82 38 166 0.962 8.20 7.04 Intr + 101069 101165 97 1 1 57 31 95 0.537 0.28 7.05 Intr + 105087 105571 485 1 2 123 -22 429 0.214 28.14 7.06 Intr + 107622 107707 86 2 2 66 75 90 0.121 4.12 7.07 Intr + 109697 109847 151 2 1 39 96 7 0.089 -3.24 7.08 Term + 110382 110528 147 2 0 18 34 140 0.418 -0.50 7.09 PlyA + 110892 110897 6 1.05 8.03 PlyA - 112558 112553 6 1.05 8.02 Term - 126217 126150 68 2 2 106 41 18 0.438 -3.00 8.01 Init - 129091 128923 169 1 1 58 60 175 0.703 11.40 8.00 Prom - 130271 130232 40 -5.56 9.03 PlyA - 130799 130794 6 1.05 9.02 Term - 132510 132445 66 0 0 109 39 118 0.963 6.94 9.01 Init - 146356 146276 81 1 0 42 75 111 0.192 4.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:183405196_183610949|GENSCAN_predicted_peptide_1|189_aa MGPGLLQASVPTVPHSPRQPCIQRGDLVDKAELSRSGPCRADDKPTTWPRIRHQVPAAAW FCAAPRRVPDTEEFSMNTSGIREPMEIFIALFISLCEAAAVRVCTPSHGWLLAVLQPKQN LSRAVLLKETAQGRWHGCMLEDPAKSRQAGYEASGADPLRQMGRQRHNLFHSANIWEQNG LSTSAKYHP >gi568815594f:183405196_183610949|GENSCAN_predicted_CDS_1|570_bp atgggccctggtcttcttcaggcctcggtgcccaccgtcccccactcccccagacagcca tgcatccagagaggtgacctggtggacaaggcagagctctccaggtcaggcccctgcagg gccgatgacaagcctaccacgtggcccagaataagacaccaggtgccagcagcagcctgg ttctgcgcagcacctagaagagtgcccgacacagaggagttttccatgaatacttctgga attcgcgaacccatggaaatcttcattgccctgttcatcagtctttgcgaggcagctgct gtccgtgtgtgcacgccttctcatggctggttgctagctgtccttcagccaaaacagaac ctttccagggctgtgctgctgaaggagacagctcaggggagatggcatggatgcatgctg gaggatcctgcaaaatcccggcaagctggctatgaggccagcggtgctgaccctctacgc cagatgggaaggcagagacacaatctgtttcactccgcaaacatctgggaacaaaatggt ctgtccaccagtgcaaaataccacccctaa >gi568815594f:183405196_183610949|GENSCAN_predicted_peptide_2|144_aa MRSHSSALGLLMGLGTMEQGAVLVGEARAAQEPMERDSAPFPLAVPVVCMGLAPITGSSG HVTQAWPLKANCETFAGATRKRMCVSFGCIECRAWIAIVKEQVAPEPRNGESEGCWLELW QQDGPCGDFGVEAVKGRDGKVAGA >gi568815594f:183405196_183610949|GENSCAN_predicted_CDS_2|435_bp atgcgctcgcactcctcagcccttgggctgttgatgggactgggcaccatggagcagggg gcggtgctcgttggggaggctcgggccgcacaggagcccatggagcgggattctgctcct ttccctctggcagtccctgtggtctgtatggggctggccccaatcactggatccagcgga cacgtcacccaggcctggccactgaaagccaattgtgagacttttgctggagcaaccagg aaaagaatgtgtgtatcctttggatgcatagaatgcagagcatggattgcaatcgtgaag gagcaggtagctccagaacccagaaatggagagagcgagggctgctggctggagctctgg cagcaagatgggccatgtggggacttcggagtggaagctgtgaagggcagagacggcaag gtagcaggagcttga >gi568815594f:183405196_183610949|GENSCAN_predicted_peptide_3|731_aa MPGLTRDLINCGNKPYRCGARASFEGCKPHGNTEPADPAGEEDGRSARRHKARTERPGFR RDSVVPGQLDLQSALVGRETPDARPPHPRGALRGLTGTQRRSGPGAAKRLGSAGSRRKNA RARGGGEVIKGKKPASCLSPGAAVFAACRPNMAQEVSEYLSQNPRVAAWVEALRCDGETD KHWRHRRDFLLRNAGDLAPAGGAASASTDEAADAESGTRNRQLQQLISFSMAWANHVFLG CRYPQKVMDKILSMAEGIKVTDAPTYTTRDELVAKVKKRGISSSNEGVEEPSKKRVIEGK NSSAVEQDHAKTSAKTERASAQQENSSTCIGSAIKSESGNSARSSGISSQNSSTSDGDRS VSSQSSSSVSSQVTTAGSGKASEAEAPDKHGSASFVSLLKSSVNSHMTQSTDSRQQSGSP KKSALEGSSASASQSSSEIEVPLLGSSGSSEVELPLLSSKPSSETASSGLTSKTSSEASV SSSVAKNSSSSGTSLLTPKSSSSTNTSLLTSKSTSQVAASLLASKSSSQTSGSLVSKSTS LASVSQLASKSSSQTSTSQLPSKSTSQSSESSVKFSCKLTNEDVKQKQPFFNRLYKTVAW KLVAVGGFSPNVNHGELLNAAIEALKATLDVFFVPLKELADLPQNKSSQESIVCELRCKS VYLGTGCGKSKENAKAVASREALKLFLKKKVVVKICKRKYRGSEIEDLVLLDEESRPVNL PPALKHPQELL >gi568815594f:183405196_183610949|GENSCAN_predicted_CDS_3|2196_bp atgcctggattaactagagacctaataaactgcgggaacaagccctaccgctgcggggcc agggccagcttcgaaggctgcaagcctcacggcaacacggagccagctgacccggcgggg gaggaggacggcaggagcgcgcggcgtcacaaagcgaggacagaacgcccggggttccgc cgagactcggtcgtcccaggccagctcgacttgcagtccgccctcgtggggcgggagacg cccgacgcccgcccacctcacccccggggtgccctccgtggcctcacggggacgcagcgt cgcagcggccccggcgccgcaaagcgtcttgggagcgccggctcgcgccggaagaacgcg agggcgcgcggcggcggcgaggttataaaagggaagaagccggcgtcctgcctgtctccc ggtgcagctgtgttcgcggcctgcaggcccaacatggcgcaggaggtgtcggagtacctg agccagaacccgcgggtggcagcctgggtggaggcgctgcgctgcgacggcgagactgac aaacactggcgccaccgccgggattttttgcttcgcaacgccggggacctggcccccgct ggcggcgctgcctccgctagcacggatgaagctgccgacgccgagagcgggacccgaaac cggcagctgcagcagctcatctccttttccatggcctgggcgaaccacgtcttcctcggg tgccgataccctcaaaaagttatggataaaatacttagtatggctgaaggcatcaaagtg acagatgctccaacctatacaacaagagatgaactggttgccaaggtgaagaaaagaggg atatcgagtagcaatgaaggggtagaagagccatccaaaaaacgagttatagaaggaaaa aacagttctgcagttgagcaagatcacgcaaaaacctctgccaagacagaacgtgcatca gctcagcaggaaaacagttcaacgtgtatagggtcggccatcaaatcagagagtgggaac tcagctcggagctctggcatctccagtcagaatagctctacaagtgatggagatcgatct gtttccagccaaagcagcagcagcgtttcctctcaggtaacaacggcaggatctgggaaa gcttctgaagcagaagctccagataaacacggttctgcatcatttgtttccttgctgaaa tccagtgtgaatagtcacatgacccaatccactgattctagacaacaaagtggatcacct aaaaagagtgctttggaaggctcttcagcctcagcttctcaaagcagctcagagatcgag gtgcccttgttgggctcctcaggaagctcagaggtagaattgccactattgtcttccaaa cctagttcagagacagcttcaagtgggttaacttccaaaactagttcagaggcaagtgtt tcatcatcagttgctaaaaacagttcctcatcaggcacatccttactgactcccaagagc agctcttcaacaaatacatcgctgctaacttccaagagcacttcccaggtagctgcatca ctactagcttccaagagcagctcccagaccagtggatctctggtttccaaaagcacttcc ttagcaagtgtgtcccagttggcttctaagagtagttctcagactagcacctcacagttg ccttctaaaagtacttcacagtcaagtgagagttctgtcaaattctcttgcaagttaacc aatgaagatgtgaaacagaagcaaccttttttcaatagactatataaaacggtggcatgg aagttggtagctgttggtggctttagtcccaatgtgaatcatggagagctcctaaatgca gctattgaggctctgaaagcaacactggatgtattttttgtcccactaaaagaattggca gatctgcctcaaaataagagctctcaagaaagtattgtttgtgaattgaggtgcaagtct gtgtatttgggcactggctgtggaaaaagcaaagaaaatgcaaaagcagttgcatcaaga gaagcattgaagttatttctcaagaaaaaggtggtggtaaaaatatgtaaaaggaaatac agaggcagtgaaatagaagatctagtactccttgatgaagaatcgaggcctgtaaactta cctccagcactaaaacatcctcaagaattactataa >gi568815594f:183405196_183610949|GENSCAN_predicted_peptide_4|106_aa MLVPSHNGFPPPGSCGFHSTGLQAAITSSQSNSLLPMLPIWAAFSPDVYRLKPLSCKPLK VCPDLVPSGGFLVSLTSRMKLRTFTASVTTFKGGVDPKSQQQQDLL >gi568815594f:183405196_183610949|GENSCAN_predicted_CDS_4|321_bp atgctggtgccctcccacaacggatttcctccccctggttcctgtggttttcattctact ggtctccaagctgccatcacatccagccaaagcaactctttgctgccaatgctgccaata tgggcagctttttcaccagatgtctatcgattaaaaccactgagttgtaagcccttaaaa gtgtgtccagacttggttccctctggtggattcttggtctcactgacttcgagaatgaag ctgcggaccttcacagcgagtgttacaacttttaaaggtggtgtggacccaaagagtcag cagcagcaagatttactgtga >gi568815594f:183405196_183610949|GENSCAN_predicted_peptide_5|71_aa MWESLEPPRDLLNGFFQNADSDIDNKIQAEVVSDGDKELVGNWSKGREKQPLSLEMEVYR DAAAGINLIFP >gi568815594f:183405196_183610949|GENSCAN_predicted_CDS_5|216_bp atgtgggaaagtttggaacctcctagagacttgttgaatggctttttccaaaatgctgat agcgatatagacaataagatccaggctgaggtggtgtcagatggagataaggaacttgtt gggaactggagcaaaggaagggaaaaacagcctttatctctggagatggaggtgtaccgg gatgcggctgcgggaattaaccttatcttcccatga >gi568815594f:183405196_183610949|GENSCAN_predicted_peptide_6|231_aa MATSAASSERFKKLHEIFRGLHEDLQGVPEQLLGTAGTEENKSIRDFDEKQQEANEMLAG MEEELRYAPLSFHNTMTSKLRNYRKDLAKLHREVRSTPLTATPGGRGDMKYDIYAVENEH MNRLQSQRAMLLQGPENLNRATQSIERSHQIATETDQIGSETIEELGEQRDHLERTKSRL INTTENLSKSRKILRSMSRKVTTNKLLFSIIILLELAVLGGLVYYRFFRNH >gi568815594f:183405196_183610949|GENSCAN_predicted_CDS_6|696_bp atggccacctccgccgcctcctccgagcgtttcaagaagctgcacgaaatcttccgcggc ctccacgaagacctacaaggggtgcccgagcagctgctggggacggcggggaccgaagag aataagtcgatcagggattttgatgaaaagcaacaggaagcaaatgaaatgctggcaggg atggaggaggagctacgttatgcacccctgtctttccataacaccatgacgtctaagctt cgaaactaccggaaggaccttgccaaactccatcgggaggtgagaagcacgcctttgaca gccacaccaggagggcgaggagacatgaaatatgacatatatgctgtagagaatgagcat atgaatcggctacagtctcaaagggcaatgcttctgcaaggtcctgaaaacctgaaccgg gccacccaaagtattgaacgttctcatcagattgccacggagactgaccagattggctca gaaaccatagaagagctgggggaacaacgagaccatttagaacgcaccaagagtagactg ataaacacaactgaaaacttgagcaaaagtcgaaagattctccgttcaatgtccagaaaa gtgacaaccaacaagctgctgttttccatcatcatcttactggagctcgccgtcctggga ggcctggtttactacagattctttcgcaaccattga >gi568815594f:183405196_183610949|GENSCAN_predicted_peptide_7|545_aa MPGAARRPPGRAVPQASLQVVRKGTACCSFAGSRGAALGPPDGEKAAEGPRRPRPVHVRL LDAEAAATARIGRMLGQQQQQLYSSAALLTGERSRLLTCYVQDYLECVESLPHDMQRNVS VLRELDNKYQVSPSAPRVCRAGLGGLETGLAALRGPAVGEYFSGVPRVGVGAACQPPLQT QRLQLPLRGISHFKQQAFVQALARPVPQTLRPRPGTAAGQGSEGSMDQDGDQQLGPSRIL APRRFRDMCHFRASGGKTLKEIDDVYEKYKKEDDLNQKKRLQQLLQRALINSQELGDEKI QIVTQMLELVENRARQMELHSQCFQDPAESERASDKAKMDSSQPERSSRRPRRQRTSESR DLCHMANGIEDCDDQPPKEKKSKSAKKKKRSKAKQEREASPVEFAIDPNEPTYCLCNQMG EKKRLPNRNAKVWIRLDGKTNLENPGREGSTFIHLLKYLLCTYLQGIVCTGDKAVTKMRQ SVTLWSLDSGVNGAYMQRNHSSFIVSYFNAREVVWTKIPEMWTQPVILDRSLVLFGLQFV QLENG >gi568815594f:183405196_183610949|GENSCAN_predicted_CDS_7|1638_bp atgccgggagccgcacgccgcccgccggggcgcgcagtcccgcaggcctcgctccaggtc gtccgaaaaggaaccgcctgctgctcctttgcaggctcgcgaggagccgcactcggaccg cctgatggggaaaaagccgctgagggcccgcggcggccgcggccggtgcatgtgcggctg ctggatgcggaggcggcggcgacggcgcggatcggcaggatgttagggcagcagcagcag caactgtactcgtcggccgcgctcctgaccggggagcggagccggctgctcacctgctac gtgcaggactaccttgagtgcgtggagtcgctgccccacgacatgcagaggaacgtgtct gtgctgcgagagctggacaacaaatatcaagtctccccgagcgcaccgagggtctgccga gcgggactgggaggactggagaccgggttggcggccctccgtggccccgcggtgggcgag tacttctctggggtccccagagtgggggtgggggccgcttgtcagcccccgctgcagacc cagcggctgcagctgccgctgaggggaatcagccattttaagcagcaagcatttgttcag gcgctggcccgccccgtgccgcagaccctgcggccccggcccgggactgcagccgggcaa gggtcggagggatcaatggatcaggacggcgatcagcagctcggaccgtcgcggatcctg gctccgcgtaggttccgggacatgtgtcacttccgggcttcaggaggaaaaacgttaaag gaaattgatgatgtctacgaaaaatataagaaagaagatgatttaaaccagaagaaacgt ctacagcagcttctccagagagcactaattaatagtcaagaattgggagatgaaaaaata cagattgttacacaaatgctcgaattggtggaaaatcgggcaagacaaatggagttacac tcacagtgtttccaagatcctgctgaaagtgaacgagcctcagataaagcaaagatggat tccagccaaccagaaagatcttcaagaagaccccgcaggcagcggaccagtgaaagccgt gatttatgtcacatggcaaatgggattgaagactgtgatgatcagccacctaaagaaaag aaatccaagtcagcaaagaaaaagaaacgctccaaggccaagcaggaaagggaagcttca cctgttgagtttgcaatagatcctaatgaacctacatactgcttatgcaaccaaatggga gaaaagaaacgacttcctaatagaaatgccaaagtttggattagacttgacgggaaaacg aacctggaaaatcctggaagggagggcagcacttttattcacttactcaaatatttattg tgcacctatttgcaaggcatcgtttgtactggggataaagcagtgaccaaaatgagacaa tctgtaaccttgtggagcttagattctggagtgaatggggcgtatatgcagagaaaccac agctccttcatcgtctcctatttcaatgcgagggaagtggtctggaccaagattccagag atgtggactcaacctgtgatcttggacaggtcacttgtcctctttggcctgcagtttgtt cagctggaaaatggctga >gi568815594f:183405196_183610949|GENSCAN_predicted_peptide_8|78_aa MEANNFKVPGNMDNSSLGGGAGMETARRLDDDSHFAGGAAENATTAGRERRRLNLGDVTS REISFDLRVNIFGKNLHR >gi568815594f:183405196_183610949|GENSCAN_predicted_CDS_8|237_bp atggaagccaacaacttcaaggtccctggaaacatggacaacagcagcctcggaggagga gcggggatggaaactgctcggcggttagatgatgacagtcactttgctggaggagcagca gagaacgcgaccacggctggaagggaaagaagacgactgaatctgggggatgttacatct agagagatctcatttgacttaagagtcaacatttttggcaaaaatcttcataggtga >gi568815594f:183405196_183610949|GENSCAN_predicted_peptide_9|48_aa MVALPSPLRSLSSLLPPGLVLPNTAAQPTQHEADEDEDRYDDPLPLNE >gi568815594f:183405196_183610949|GENSCAN_predicted_CDS_9|147_bp atggttgccctaccttccccgctgcgctccctgagctctctcctgccccctgggcttgtg ctacctaatacagcagcacagcctactcaacatgaagctgacgaggatgaagaccgttat gatgatccacttccacttaatgaatag