GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:25:50 Sequence gi568815575f:149613160_149816773 : 203614 bp : 44.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 1825 1820 6 1.05 1.08 Term - 18828 18488 341 1 2 100 48 438 0.784 35.70 1.07 Intr - 23572 23535 38 0 2 90 72 33 0.051 -0.19 1.06 Intr - 31360 31312 49 2 1 114 111 5 0.035 3.14 1.05 Intr - 34683 34638 46 0 1 77 81 14 0.081 -2.42 1.04 Intr - 36464 36186 279 2 0 -3 49 324 0.401 17.07 1.03 Intr - 36773 36621 153 2 0 86 57 95 0.556 6.47 1.02 Intr - 37025 36888 138 2 0 125 68 47 0.559 7.06 1.01 Init - 39105 39025 81 0 0 62 98 57 0.790 4.86 1.00 Prom - 54689 54650 40 -5.16 2.00 Prom + 56827 56866 40 -4.56 2.01 Init + 57997 58061 65 0 2 63 32 115 0.040 2.10 2.02 Intr + 76416 76506 91 0 1 105 86 52 0.469 6.70 2.03 Intr + 101322 101417 96 2 0 63 94 67 0.874 5.01 2.04 Intr + 102445 102518 74 0 2 95 84 2 0.839 -1.30 2.05 Term + 102594 103617 1024 0 1 110 53 601 0.965 50.39 2.06 PlyA + 103880 103885 6 1.05 3.00 Prom + 118408 118447 40 -3.56 3.01 Init + 145633 145837 205 0 1 100 44 169 0.505 12.04 3.02 Intr + 146552 146751 200 0 2 5 81 137 0.625 3.77 3.03 Intr + 146995 147054 60 0 0 103 91 -4 0.475 0.33 3.04 Intr + 147620 147715 96 1 0 63 94 75 0.958 5.81 3.05 Intr + 155738 155914 177 1 0 86 99 105 0.856 11.52 3.06 Intr + 156540 156663 124 2 1 81 40 85 0.997 3.16 3.07 Intr + 157965 158163 199 1 1 68 47 189 0.970 11.21 3.08 Intr + 158290 158394 105 1 0 118 66 39 0.482 4.03 3.09 Intr + 158946 159057 112 0 1 16 98 56 0.422 -0.22 3.10 Intr + 161826 162086 261 2 0 101 90 191 0.726 18.28 3.11 Term + 162924 163631 708 2 0 33 44 390 0.970 22.61 3.12 PlyA + 163664 163669 6 1.05 4.00 Prom + 167982 168021 40 -4.16 4.01 Init + 171607 171664 58 0 1 74 44 45 0.162 0.17 4.02 Intr + 172843 172911 69 2 0 93 95 3 0.551 0.65 4.03 Term + 172987 173999 1013 2 2 122 53 1083 0.978 100.47 4.04 PlyA + 174559 174564 6 1.05 5.00 Prom + 187152 187191 40 -6.06 5.01 Init + 194429 194450 22 1 1 83 113 -2 0.755 1.69 5.02 Intr + 194837 194910 74 0 2 115 84 48 0.980 6.23 5.03 Intr + 194986 195210 225 0 0 121 36 195 0.908 15.78 5.04 Term + 195769 195996 228 0 0 21 53 202 0.895 6.63 5.05 PlyA + 196471 196476 6 1.05 6.03 PlyA - 197820 197815 6 1.05 6.02 Term - 199554 199457 98 1 2 98 49 64 0.563 1.63 6.01 Intr - 203445 203369 77 2 2 93 89 23 0.482 2.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:149613160_149816773|GENSCAN_predicted_peptide_1|374_aa MWILMFTSGADGGKGLGIMSTASGEQKACGSHLPSSCPHSAILTRVIMSPEQRNQHCEPE ESLEAEGDEALGLSPQGTSTSPTTIDNTLWSQSNEGSSSQEEAPGTSPDPADLKTLFQEA LDGKEVDTTGHSYILVTSLGLSYDGLLGDDQSKPNAGLQIMVLCTIIMEGNCTPEEVIWE TLSVMGLYAGKKHSVYGKPRKLLTQDWVQENYLEYHQDWQPSPQTSGPPRPEGKSLFAKC LMALAQVFSCKVPITSEVTFTGLQPVSIPFLGALLTPSLHPPAISSAGRVRTQGWGPPSE ERSGRGSGRMRGPRTEARKEGQGAVAQVSPGGGGGGGGGGGGGGGDGDGDGSGDGSSSGS SSVDAAGAGAMNPL >gi568815575f:149613160_149816773|GENSCAN_predicted_CDS_1|1125_bp atgtggatcctgatgttcacatcgggggctgatggagggaaggggcttggaatcatgagc acggcctcaggggagcagaaggcctgtgggtcccacttgcccagctcctgcccacactct gccattctgaccagagtcatcatgtctcctgagcagaggaatcagcactgcgagcctgag gaaagccttgaggccgaaggagacgaggctttgggcctgagtcctcagggaacctctacc tcccccactaccattgacaacactctatggagccaatctaatgagggctccagcagccaa gaggaggctccaggcacctcaccagacccagcagacctgaagaccttgttccaagaagca cttgatgggaaggaagtggataccactggccactcctacatccttgtcacttccctgggc ctctcctatgacggcctgctgggtgatgatcagagcaagcccaatgcaggcctccagata atggtcctgtgcacgattataatggagggcaactgcacccctgaagaggtcatatgggaa accctgagtgtgatggggctgtatgctgggaagaagcacagtgtctatgggaagcccagg aagctgctcacccaagactgggtgcaggaaaactacctggagtaccaccaggactggcag cccagcccccagacttcaggccctccccggcctgaaggaaaatccttgtttgcaaagtgc ctaatggcacttgcacaagtcttcagctgcaaagtccctattaccagtgaggtcaccttc acaggccttcagcccgtcagcatccccttcctcggggccctgctcactcccagcctccat ccccctgccatctcctccgccggtcgcgtgcggacacaaggatggggacctcccagcgag gagcgctctgggcggggctccggacgcatgcgcggccctcgtacggaagcccggaaggag gggcagggggcggtggctcaggtttctccgggcggcggcggcggcggcggcggcggcggc ggcggcggcggcggcgacggcgacggcgacggcagcggggacggcagcagtagcgggagc agcagcgtggacgcggctggcgctggcgccatgaacccgctgtaa >gi568815575f:149613160_149816773|GENSCAN_predicted_peptide_2|449_aa MVLVPLWLVLIELLIQLVTVAWQWALMPYLLQNTGDAPALLGKPCARRPSSKVSTMFSED DFQSTERAPYGPQLQWSQDLPRVQVFREQANLEDRSPRRTQRITGGEQVLWGPITQIFPT VRPADLTRVIMPLEQRSQHCKPEEGLQAQEEDLGLVGAQALQAEEQEAAFFSSTLNVGTL EELPAAESPSPPQSPQEESFSPTAMDAIFGSLSDEGSGSQEKEGPSTSPDLIDPESFSQD ILHDKIIDLVHLLLRKYRVKGLITKAEMLGSVIKNYEDYFPEIFREASVCMQLLFGIDVK EVDPTSHSYVLVTSLNLSYDGIQCNEQSMPKSGLLIIVLGVIFMEGNCIPEEVMWEVLSI MGVYAGREHFLFGEPKRLLTQNWVQEKYLVYRQVPGTDPACYEFLWGPRAHAETSKMKVL EYIANANGRDPTSYPSLYEDALREEGEGV >gi568815575f:149613160_149816773|GENSCAN_predicted_CDS_2|1350_bp atggttttggtcccgctgtggctcgttttgattgagctcctcattcagctggtcacggtg gcctggcagtgggccctgatgccttacctgctacagaacactggggatgcccctgcactg ctgggaaagccctgtgctagaagaccttcctcaaaggtgagcactatgttctcagaggac gacttccagtcaacagaaagagccccatatggtccacaactacagtggtcccaggatctg ccaagagtccaggtttttagagaacaggccaacctggaggacaggagtcccaggagaacc cagaggatcactggaggagaacaagtgctgtggggccccatcacccagatatttcccaca gttcggcctgctgacctaaccagagtcatcatgcctcttgagcaaagaagtcagcactgc aagcctgaggaaggccttcaggcccaagaagaagacctgggcctggtgggtgcacaggct ctccaagctgaggagcaggaggctgccttcttctcctctactctgaatgtgggcactcta gaggagttgcctgctgctgagtcaccaagtcctccccagagtcctcaggaagagtccttc tctcccactgccatggatgccatctttgggagcctatctgatgagggctctggcagccaa gaaaaggaggggccaagtacctcgcctgacctgatagaccctgagtccttttcccaagat atactacatgacaagataattgatttggttcatttattgctccgcaagtatcgagtcaag gggctgatcacaaaggcagaaatgctggggagtgtcatcaaaaattatgaggactacttt cctgagatatttagggaagcctctgtatgcatgcaactgctctttggcattgatgtgaag gaagtggaccccactagccactcctatgtccttgtcacctccctcaacctctcttatgat ggcatacagtgtaatgagcagagcatgcccaagtctggcctcctgataatagtcctgggt gtaatcttcatggaggggaactgcatccctgaagaggttatgtgggaagtcctgagcatt atgggggtgtatgctggaagggagcacttcctctttggggagcccaagaggctccttacc caaaattgggtgcaggaaaagtacctggtgtaccggcaggtgcccggcactgatcctgca tgctatgagttcctgtggggtccaagggcccacgctgagaccagcaagatgaaagttctt gagtacatagccaatgccaatgggagggatcccacttcttacccatccctgtatgaagat gctttgagagaggagggagagggagtctga >gi568815575f:149613160_149816773|GENSCAN_predicted_peptide_3|748_aa MPEPPPRSVGSCAARASWTSATPCSTAPSPIDHPRAEECRRTARDWQAAPPVALVPDPLG EASWAPESGGTNNSRRATLRAVTLTAKVRSFTPEPARPRTPPEGRNSEHQKEQTLDTPPL RTVTLTVRVHGFILEVFRFTHDNACPFPFIPWRPQVSTMFSEDDFQSTERAPYGPQLQWS QDLPRVQVVCVPLWILMSFLCLVVLYYIVWSVLFLRSMDVIAEQRRTHITMALSWMTIVV PLLTFEILLVHKLDGHNAFSSIPIFVPLWLSLITLMATTFGQKGGNHWWFGIRKDFCQFL LEIFPFLREYGNISYDLHHEDNEETEETPVPEPPKIAPMFRKKARVVIAQSPGKPEPVLE LGLQVRPSHALFASSAYQPLTTTGRGSGRHMPSAAEVDARGTLPPPPPWGATPSWDIATS ALELVQKLWRLVSSNQFSSIWWDDSGACRVINQKLFEKEILKRDVAHKVFATTSIKSFFR QLNLYGFRKRRQCTFRTFTRIFSAKRLVSILNKLEFYCHPYFQRDSPHLLVRMKRRVGVK SAPRHQEEDKPEAAGSCLAPADTEQQDHTSPNENDQVTPQHREPAGPNTQIRSGSAPPAT PVMVPDSAVASDNSPVTQPAGEWSEGSQAHVTPVAAVPGPAALPFLYVPGSPTQMNSYGP VVALPTASRSTLAMDTTGLPAPGMLPFCHLWVPVTLVAAGAAQPAASMVMFPHLPALHHH CPHSHRTSQYMPASDGPQAYPDYADQST >gi568815575f:149613160_149816773|GENSCAN_predicted_CDS_3|2247_bp atgcctgagcctccccctcgctccgtgggctcctgtgctgcccgagcctcctggacgagc gccactccctgctccacggcacccagtcccatcgaccacccaagggctgaggaatgccgg cgcacggcgcgggactggcaggcagctccacctgtggccctggtgcccgatccactgggt gaagccagctgggctcctgagtctggaggaacgaacaactccagacgcgccaccttaaga gctgtaacactcaccgcgaaggtccgcagcttcactcctgagccggcgagaccacgaacc ccaccagaaggaagaaactccgaacatcagaaggaacaaactctggacacgccgccctta agaactgtaacactcaccgtgagggtccacggcttcattcttgaagtattcagatttact catgacaatgcctgcccctttcccttcatcccctggaggccccaggtgagcactatgttc tcagaggacgacttccagtcaacagaaagagccccgtatggtccacaactacagtggtcc caggatctgccaagagtccaggttgtgtgtgtcccgctgtggattctcatgtcctttctg tgcctggtggtcctctactacattgtgtggtccgtcttgttcttgcgctctatggatgtg attgcggagcagcgcaggacacacataaccatggccctgagctggatgaccatcgtcgtg ccccttcttacatttgagattctgctggttcacaaactggatggccacaacgccttctcc agcatcccgatctttgtccccctttggctctcgttgatcacgctgatggcaaccacattt ggacagaagggaggaaaccactggtggtttggtatccgcaaagatttctgtcagtttctg cttgaaatcttcccatttctacgagaatatggaaacatttcctatgatctccatcacgaa gataatgaagaaaccgaagagaccccagttccggagccccctaaaatcgcacccatgttt cgaaagaaggccagggtggtcattgcccagagccctgggaaaccagaaccagtactggag ctgggtctccaggtacgtccatctcatgccttgtttgcatccagcgcctatcagccactc accacgacgggacgcggaagtggcaggcacatgccttctgctgcagaagtggacgcccgt ggcacactcccccccccccccccgtggggtgccacgccttcatgggacattgccacttct gccctggaactcgtgcagaaactgtggagactggtcagcagcaaccagttttcgtccatc tggtgggatgacagtggggcttgtagagtgatcaatcaaaaactctttgaaaaggagatt ctcaaaagggacgtcgcacacaaagtgtttgccacaacttcgataaagagcttcttccgc cagctaaacttgtatggcttccgaaaacggcgtcaatgcactttcaggaccttcacccgc attttctccgcaaaaaggctggtctccatcttgaataagttagagttctactgccatcct tactttcaaagagactcccctcacctcctcgtgaggatgaagagaagagtgggtgtcaag tctgcaccaagacatcaggaggaggacaagccagaagctgctggatcctgtctggcacca gcagacactgagcaacaagatcacacgtctccgaatgagaatgaccaggtcacaccgcaa caccgggaaccggccggtcccaacacccaaatcaggagtggctctgctccaccagcaact cctgtgatggtgcctgattccgccgtggcgagtgacaacagtccagtgacccagccggcc ggcgagtggtcagagggcagccaggctcacgtcactccggtggccgctgtccctgggcct gcagcgctgcccttcctctatgtccctggatctcccactcagatgaattcttacgggcct gtggtggcccttcccacagcgtcccgtagtacccttgccatggacaccacaggacttcct gcacctggcatgctgcccttttgccatctctgggtaccggtgaccctagtggctgctggg gctgcacagcctgctgcctccatggtcatgttcccccatctcccagctctgcaccaccat tgcccccacagccaccgcacgtcacagtacatgccagctagcgatgggccccaggcgtac ccagactacgcagaccagagcacatag >gi568815575f:149613160_149816773|GENSCAN_predicted_peptide_4|379_aa MNNPQEDFGTPTSYLKGSAGSRDRLTRRTGAPRGPRAALTKTCLWVSIAQLLPTLLTAAL TRVIMSLEQRSPHCKPDEDLEAQGEDLGLMGAQEPTGEEEETTSSSDSKEEEVSAAGSSS PPQSPQGGASSSISVYYTLWSQFDEGSSSQEEEEPSSSVDPAQLEFMFQEALKLKVAELV HFLLHKYRVKEPVTKAEMLESVIKNYKRYFPVIFGKASEFMQVIFGTDVKEVDPAGHSYI LVTALGLSCDSMLGDGHSMPKAALLIIVLGVILTKDNCAPEEVIWEALSVMGVYVGKEHM FYGEPRKLLTQDWVQENYLEYRQVPGSDPAHYEFLWGSKAHAETSYEKVINYLVMLNARE PICYPSLYEEVLGEEQEGV >gi568815575f:149613160_149816773|GENSCAN_predicted_CDS_4|1140_bp atgaataacccgcaggaggactttggaacacccacctcatacctgaagggttcagctggt tctcgggacaggctaaccaggaggacaggagccccaagaggccccagagcagcactgacg aagacctgcctgtgggtctccatcgcccagctcctgcccacgctcctgactgctgccctg accagagtcatcatgtctctcgagcagaggagtccgcactgcaagcctgatgaagacctt gaagcccaaggagaggacttgggcctgatgggtgcacaggaacccacaggcgaggaggag gagactacctcctcctctgacagcaaggaggaggaggtgtctgctgctgggtcatcaagt cctccccagagtcctcagggaggcgcttcctcctccatttccgtctactacactttatgg agccaattcgatgagggctccagcagtcaagaagaggaagagccaagctcctcggtcgac ccagctcagctggagttcatgttccaagaagcactgaaattgaaggtggctgagttggtt catttcctgctccacaaatatcgagtcaaggagccggtcacaaaggcagaaatgctggag agcgtcatcaaaaattacaagcgctactttcctgtgatcttcggcaaagcctccgagttc atgcaggtgatctttggcactgatgtgaaggaggtggaccccgccggccactcctacatc cttgtcactgctcttggcctctcgtgcgatagcatgctgggtgatggtcatagcatgccc aaggccgccctcctgatcattgtcctgggtgtgatcctaaccaaagacaactgcgcccct gaagaggttatctgggaagcgttgagtgtgatgggggtgtatgttgggaaggagcacatg ttctacggggagcccaggaagctgctcacccaagattgggtgcaggaaaactacctggag taccggcaggtgcccggcagtgatcctgcgcactacgagttcctgtggggttccaaggcc cacgctgaaaccagctatgagaaggtcataaattatttggtcatgctcaatgcaagagag cccatctgctacccatccctttatgaagaggttttgggagaggagcaagagggagtctga >gi568815575f:149613160_149816773|GENSCAN_predicted_peptide_5|182_aa MRPLQEKGSQRTGRPGGQKPQEAPEEHRRRRSICGFLPIAQLLPALQPAALTRVIMSSEQ RSQHCKPEDGLEAQGQEALGLVGVQAPATEEHEAASSFTLIEGTLEELRKLLTQDWVQEN YLQYRQVPSSDPPCYQFLWGPRALIETSYVKVLEYAARVSTKESISYPSLHEEALGEEEE GV >gi568815575f:149613160_149816773|GENSCAN_predicted_CDS_5|549_bp atgaggcccctccaggagaaaggttctcagcggacaggccggccaggaggtcagaagccc caggaggccccagaggagcaccgaaggagaagatctatctgtgggttcctccccatcgcc cagctgctgcccgcactccagcctgctgccctgaccagagtcatcatgtcttctgagcag aggagtcagcactgcaagcctgaggatggccttgaggcccaaggacaggaggctctgggc ctggtgggtgtgcaggctcccgccaccgaggagcacgaggctgcctcctccttcactctg attgaaggcaccctggaggagctgaggaagctgctcacccaagattgggtgcaggaaaac tacctgcaataccgccaggtgcccagcagtgatcccccgtgctaccagttcctgtggggt ccaagggccctcattgaaaccagctatgtgaaagtcctggagtatgcagccagggtcagt actaaagagagcatttcctacccatccctgcatgaagaggctttgggagaggaggaagag ggagtctga >gi568815575f:149613160_149816773|GENSCAN_predicted_peptide_6|58_aa XGYLCRFVAWVYCVSQRFGVLTTYHSDGSEKTSFALKFSEFASVYHVADAFKCINPSN >gi568815575f:149613160_149816773|GENSCAN_predicted_CDS_6|177_bp nnggggtacctgtgcagatttgttgcatgggtatattgcgtgtctcagaggtttggtgta ctgacgacttatcactcagacggttctgagaagaccagttttgccttgaaattctctgaa tttgcaagtgtctaccacgtagcagatgctttcaaatgcattaacccatcaaattga