GENSCAN 1.0 Date run: 7-Nov-116 Time: 04:41:58 Sequence gi568815575f:149674682_149876787 : 202106 bp : 43.68% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8734 8777 44 0 2 86 78 15 0.145 0.22 1.02 Intr + 14894 14984 91 2 1 105 86 52 0.535 6.70 1.03 Intr + 39800 39895 96 1 0 63 94 67 0.874 5.01 1.04 Intr + 40923 40996 74 2 2 95 84 2 0.839 -1.30 1.05 Term + 41072 42095 1024 2 1 110 53 601 0.965 50.39 1.06 PlyA + 42358 42363 6 1.05 2.00 Prom + 56886 56925 40 -3.56 2.01 Init + 84111 84315 205 2 1 100 44 169 0.505 12.04 2.02 Intr + 85030 85229 200 2 2 5 81 137 0.625 3.77 2.03 Intr + 85473 85532 60 2 0 103 91 -4 0.475 0.33 2.04 Intr + 86098 86193 96 0 0 63 94 75 0.958 5.81 2.05 Intr + 94216 94392 177 0 0 86 99 105 0.856 11.52 2.06 Intr + 95018 95141 124 1 1 81 40 85 0.997 3.16 2.07 Intr + 96443 96641 199 0 1 68 47 189 0.970 11.21 2.08 Intr + 96768 96872 105 0 0 118 66 39 0.482 4.03 2.09 Intr + 97424 97535 112 2 1 16 98 56 0.422 -0.22 2.10 Intr + 100304 100564 261 1 0 101 90 191 0.726 18.28 2.11 Term + 101402 102109 708 1 0 33 44 390 0.970 22.61 2.12 PlyA + 102142 102147 6 1.05 3.00 Prom + 106460 106499 40 -4.16 3.01 Init + 110085 110142 58 2 1 74 44 45 0.162 0.17 3.02 Intr + 111321 111389 69 1 0 93 95 3 0.551 0.65 3.03 Term + 111465 112477 1013 1 2 122 53 1083 0.978 100.47 3.04 PlyA + 113037 113042 6 1.05 4.00 Prom + 125630 125669 40 -6.06 4.01 Init + 132907 132928 22 0 1 83 113 -2 0.755 1.69 4.02 Intr + 133315 133388 74 2 2 115 84 48 0.980 6.23 4.03 Intr + 133464 133688 225 2 0 121 36 195 0.910 15.78 4.04 Term + 134247 134474 228 2 0 21 53 202 0.784 6.63 4.05 PlyA + 134949 134954 6 1.05 5.04 PlyA - 136298 136293 6 1.05 5.03 Term - 138032 137935 98 0 2 98 49 64 0.425 1.63 5.02 Intr - 141923 141847 77 1 2 93 89 23 0.300 2.06 5.01 Init - 146904 146741 164 0 2 61 72 75 0.226 2.40 5.00 Prom - 147850 147811 40 -4.96 6.06 PlyA - 148022 148017 6 1.05 6.05 Term - 149213 148941 273 2 0 56 34 246 0.774 11.47 6.04 Intr - 149454 149368 87 0 0 75 41 79 0.631 2.07 6.03 Intr - 154845 154615 231 0 0 58 93 109 0.711 6.37 6.02 Intr - 159422 159192 231 2 0 78 93 43 0.584 1.77 6.01 Init - 166272 166267 6 0 0 70 87 0 0.160 -1.02 6.00 Prom - 173274 173235 40 -2.76 7.02 PlyA - 173754 173749 6 1.05 7.01 Sngl - 175746 175516 231 0 0 60 42 158 0.355 3.51 7.00 Prom - 177509 177470 40 -1.86 8.00 Prom + 179683 179722 40 -5.66 8.01 Init + 183412 183457 46 0 1 87 84 47 0.855 5.07 8.02 Term + 184233 184372 140 1 2 101 48 60 0.823 1.43 8.03 PlyA + 185095 185100 6 1.05 9.00 Prom + 188836 188875 40 -0.46 9.01 Init + 189942 190045 104 2 2 67 74 41 0.290 0.32 9.02 Term + 199507 199723 217 1 1 -20 54 211 0.534 3.52 9.03 PlyA + 199807 199812 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:149674682_149876787|GENSCAN_predicted_peptide_1|442_aa MEGKGLKVGVVDCLGQWALMPYLLQNTGDAPALLGKPCARRPSSKVSTMFSEDDFQSTER APYGPQLQWSQDLPRVQVFREQANLEDRSPRRTQRITGGEQVLWGPITQIFPTVRPADLT RVIMPLEQRSQHCKPEEGLQAQEEDLGLVGAQALQAEEQEAAFFSSTLNVGTLEELPAAE SPSPPQSPQEESFSPTAMDAIFGSLSDEGSGSQEKEGPSTSPDLIDPESFSQDILHDKII DLVHLLLRKYRVKGLITKAEMLGSVIKNYEDYFPEIFREASVCMQLLFGIDVKEVDPTSH SYVLVTSLNLSYDGIQCNEQSMPKSGLLIIVLGVIFMEGNCIPEEVMWEVLSIMGVYAGR EHFLFGEPKRLLTQNWVQEKYLVYRQVPGTDPACYEFLWGPRAHAETSKMKVLEYIANAN GRDPTSYPSLYEDALREEGEGV >gi568815575f:149674682_149876787|GENSCAN_predicted_CDS_1|1329_bp atggagggcaaggggctgaaagttggagtcgttgattgtttggggcagtgggccctgatg ccttacctgctacagaacactggggatgcccctgcactgctgggaaagccctgtgctaga agaccttcctcaaaggtgagcactatgttctcagaggacgacttccagtcaacagaaaga gccccatatggtccacaactacagtggtcccaggatctgccaagagtccaggtttttaga gaacaggccaacctggaggacaggagtcccaggagaacccagaggatcactggaggagaa caagtgctgtggggccccatcacccagatatttcccacagttcggcctgctgacctaacc agagtcatcatgcctcttgagcaaagaagtcagcactgcaagcctgaggaaggccttcag gcccaagaagaagacctgggcctggtgggtgcacaggctctccaagctgaggagcaggag gctgccttcttctcctctactctgaatgtgggcactctagaggagttgcctgctgctgag tcaccaagtcctccccagagtcctcaggaagagtccttctctcccactgccatggatgcc atctttgggagcctatctgatgagggctctggcagccaagaaaaggaggggccaagtacc tcgcctgacctgatagaccctgagtccttttcccaagatatactacatgacaagataatt gatttggttcatttattgctccgcaagtatcgagtcaaggggctgatcacaaaggcagaa atgctggggagtgtcatcaaaaattatgaggactactttcctgagatatttagggaagcc tctgtatgcatgcaactgctctttggcattgatgtgaaggaagtggaccccactagccac tcctatgtccttgtcacctccctcaacctctcttatgatggcatacagtgtaatgagcag agcatgcccaagtctggcctcctgataatagtcctgggtgtaatcttcatggaggggaac tgcatccctgaagaggttatgtgggaagtcctgagcattatgggggtgtatgctggaagg gagcacttcctctttggggagcccaagaggctccttacccaaaattgggtgcaggaaaag tacctggtgtaccggcaggtgcccggcactgatcctgcatgctatgagttcctgtggggt ccaagggcccacgctgagaccagcaagatgaaagttcttgagtacatagccaatgccaat gggagggatcccacttcttacccatccctgtatgaagatgctttgagagaggagggagag ggagtctga >gi568815575f:149674682_149876787|GENSCAN_predicted_peptide_2|748_aa MPEPPPRSVGSCAARASWTSATPCSTAPSPIDHPRAEECRRTARDWQAAPPVALVPDPLG EASWAPESGGTNNSRRATLRAVTLTAKVRSFTPEPARPRTPPEGRNSEHQKEQTLDTPPL RTVTLTVRVHGFILEVFRFTHDNACPFPFIPWRPQVSTMFSEDDFQSTERAPYGPQLQWS QDLPRVQVVCVPLWILMSFLCLVVLYYIVWSVLFLRSMDVIAEQRRTHITMALSWMTIVV PLLTFEILLVHKLDGHNAFSSIPIFVPLWLSLITLMATTFGQKGGNHWWFGIRKDFCQFL LEIFPFLREYGNISYDLHHEDNEETEETPVPEPPKIAPMFRKKARVVIAQSPGKPEPVLE LGLQVRPSHALFASSAYQPLTTTGRGSGRHMPSAAEVDARGTLPPPPPWGATPSWDIATS ALELVQKLWRLVSSNQFSSIWWDDSGACRVINQKLFEKEILKRDVAHKVFATTSIKSFFR QLNLYGFRKRRQCTFRTFTRIFSAKRLVSILNKLEFYCHPYFQRDSPHLLVRMKRRVGVK SAPRHQEEDKPEAAGSCLAPADTEQQDHTSPNENDQVTPQHREPAGPNTQIRSGSAPPAT PVMVPDSAVASDNSPVTQPAGEWSEGSQAHVTPVAAVPGPAALPFLYVPGSPTQMNSYGP VVALPTASRSTLAMDTTGLPAPGMLPFCHLWVPVTLVAAGAAQPAASMVMFPHLPALHHH CPHSHRTSQYMPASDGPQAYPDYADQST >gi568815575f:149674682_149876787|GENSCAN_predicted_CDS_2|2247_bp atgcctgagcctccccctcgctccgtgggctcctgtgctgcccgagcctcctggacgagc gccactccctgctccacggcacccagtcccatcgaccacccaagggctgaggaatgccgg cgcacggcgcgggactggcaggcagctccacctgtggccctggtgcccgatccactgggt gaagccagctgggctcctgagtctggaggaacgaacaactccagacgcgccaccttaaga gctgtaacactcaccgcgaaggtccgcagcttcactcctgagccggcgagaccacgaacc ccaccagaaggaagaaactccgaacatcagaaggaacaaactctggacacgccgccctta agaactgtaacactcaccgtgagggtccacggcttcattcttgaagtattcagatttact catgacaatgcctgcccctttcccttcatcccctggaggccccaggtgagcactatgttc tcagaggacgacttccagtcaacagaaagagccccgtatggtccacaactacagtggtcc caggatctgccaagagtccaggttgtgtgtgtcccgctgtggattctcatgtcctttctg tgcctggtggtcctctactacattgtgtggtccgtcttgttcttgcgctctatggatgtg attgcggagcagcgcaggacacacataaccatggccctgagctggatgaccatcgtcgtg ccccttcttacatttgagattctgctggttcacaaactggatggccacaacgccttctcc agcatcccgatctttgtccccctttggctctcgttgatcacgctgatggcaaccacattt ggacagaagggaggaaaccactggtggtttggtatccgcaaagatttctgtcagtttctg cttgaaatcttcccatttctacgagaatatggaaacatttcctatgatctccatcacgaa gataatgaagaaaccgaagagaccccagttccggagccccctaaaatcgcacccatgttt cgaaagaaggccagggtggtcattgcccagagccctgggaaaccagaaccagtactggag ctgggtctccaggtacgtccatctcatgccttgtttgcatccagcgcctatcagccactc accacgacgggacgcggaagtggcaggcacatgccttctgctgcagaagtggacgcccgt ggcacactcccccccccccccccgtggggtgccacgccttcatgggacattgccacttct gccctggaactcgtgcagaaactgtggagactggtcagcagcaaccagttttcgtccatc tggtgggatgacagtggggcttgtagagtgatcaatcaaaaactctttgaaaaggagatt ctcaaaagggacgtcgcacacaaagtgtttgccacaacttcgataaagagcttcttccgc cagctaaacttgtatggcttccgaaaacggcgtcaatgcactttcaggaccttcacccgc attttctccgcaaaaaggctggtctccatcttgaataagttagagttctactgccatcct tactttcaaagagactcccctcacctcctcgtgaggatgaagagaagagtgggtgtcaag tctgcaccaagacatcaggaggaggacaagccagaagctgctggatcctgtctggcacca gcagacactgagcaacaagatcacacgtctccgaatgagaatgaccaggtcacaccgcaa caccgggaaccggccggtcccaacacccaaatcaggagtggctctgctccaccagcaact cctgtgatggtgcctgattccgccgtggcgagtgacaacagtccagtgacccagccggcc ggcgagtggtcagagggcagccaggctcacgtcactccggtggccgctgtccctgggcct gcagcgctgcccttcctctatgtccctggatctcccactcagatgaattcttacgggcct gtggtggcccttcccacagcgtcccgtagtacccttgccatggacaccacaggacttcct gcacctggcatgctgcccttttgccatctctgggtaccggtgaccctagtggctgctggg gctgcacagcctgctgcctccatggtcatgttcccccatctcccagctctgcaccaccat tgcccccacagccaccgcacgtcacagtacatgccagctagcgatgggccccaggcgtac ccagactacgcagaccagagcacatag >gi568815575f:149674682_149876787|GENSCAN_predicted_peptide_3|379_aa MNNPQEDFGTPTSYLKGSAGSRDRLTRRTGAPRGPRAALTKTCLWVSIAQLLPTLLTAAL TRVIMSLEQRSPHCKPDEDLEAQGEDLGLMGAQEPTGEEEETTSSSDSKEEEVSAAGSSS PPQSPQGGASSSISVYYTLWSQFDEGSSSQEEEEPSSSVDPAQLEFMFQEALKLKVAELV HFLLHKYRVKEPVTKAEMLESVIKNYKRYFPVIFGKASEFMQVIFGTDVKEVDPAGHSYI LVTALGLSCDSMLGDGHSMPKAALLIIVLGVILTKDNCAPEEVIWEALSVMGVYVGKEHM FYGEPRKLLTQDWVQENYLEYRQVPGSDPAHYEFLWGSKAHAETSYEKVINYLVMLNARE PICYPSLYEEVLGEEQEGV >gi568815575f:149674682_149876787|GENSCAN_predicted_CDS_3|1140_bp atgaataacccgcaggaggactttggaacacccacctcatacctgaagggttcagctggt tctcgggacaggctaaccaggaggacaggagccccaagaggccccagagcagcactgacg aagacctgcctgtgggtctccatcgcccagctcctgcccacgctcctgactgctgccctg accagagtcatcatgtctctcgagcagaggagtccgcactgcaagcctgatgaagacctt gaagcccaaggagaggacttgggcctgatgggtgcacaggaacccacaggcgaggaggag gagactacctcctcctctgacagcaaggaggaggaggtgtctgctgctgggtcatcaagt cctccccagagtcctcagggaggcgcttcctcctccatttccgtctactacactttatgg agccaattcgatgagggctccagcagtcaagaagaggaagagccaagctcctcggtcgac ccagctcagctggagttcatgttccaagaagcactgaaattgaaggtggctgagttggtt catttcctgctccacaaatatcgagtcaaggagccggtcacaaaggcagaaatgctggag agcgtcatcaaaaattacaagcgctactttcctgtgatcttcggcaaagcctccgagttc atgcaggtgatctttggcactgatgtgaaggaggtggaccccgccggccactcctacatc cttgtcactgctcttggcctctcgtgcgatagcatgctgggtgatggtcatagcatgccc aaggccgccctcctgatcattgtcctgggtgtgatcctaaccaaagacaactgcgcccct gaagaggttatctgggaagcgttgagtgtgatgggggtgtatgttgggaaggagcacatg ttctacggggagcccaggaagctgctcacccaagattgggtgcaggaaaactacctggag taccggcaggtgcccggcagtgatcctgcgcactacgagttcctgtggggttccaaggcc cacgctgaaaccagctatgagaaggtcataaattatttggtcatgctcaatgcaagagag cccatctgctacccatccctttatgaagaggttttgggagaggagcaagagggagtctga >gi568815575f:149674682_149876787|GENSCAN_predicted_peptide_4|182_aa MRPLQEKGSQRTGRPGGQKPQEAPEEHRRRRSICGFLPIAQLLPALQPAALTRVIMSSEQ RSQHCKPEDGLEAQGQEALGLVGVQAPATEEHEAASSFTLIEGTLEELRKLLTQDWVQEN YLQYRQVPSSDPPCYQFLWGPRALIETSYVKVLEYAARVSTKESISYPSLHEEALGEEEE GV >gi568815575f:149674682_149876787|GENSCAN_predicted_CDS_4|549_bp atgaggcccctccaggagaaaggttctcagcggacaggccggccaggaggtcagaagccc caggaggccccagaggagcaccgaaggagaagatctatctgtgggttcctccccatcgcc cagctgctgcccgcactccagcctgctgccctgaccagagtcatcatgtcttctgagcag aggagtcagcactgcaagcctgaggatggccttgaggcccaaggacaggaggctctgggc ctggtgggtgtgcaggctcccgccaccgaggagcacgaggctgcctcctccttcactctg attgaaggcaccctggaggagctgaggaagctgctcacccaagattgggtgcaggaaaac tacctgcaataccgccaggtgcccagcagtgatcccccgtgctaccagttcctgtggggt ccaagggccctcattgaaaccagctatgtgaaagtcctggagtatgcagccagggtcagt actaaagagagcatttcctacccatccctgcatgaagaggctttgggagaggaggaagag ggagtctga >gi568815575f:149674682_149876787|GENSCAN_predicted_peptide_5|112_aa MNKFLDTYAFSSLNQEEVESLNRPITPSDIEAIINSLPTKKRRGPDRFTAELYQRGYLCR FVAWVYCVSQRFGVLTTYHSDGSEKTSFALKFSEFASVYHVADAFKCINPSN >gi568815575f:149674682_149876787|GENSCAN_predicted_CDS_5|339_bp atgaataaattcctggacacatacgccttctcaagcctaaaccaggaagaagttgaatcc ctgaatagaccaataacaccctccgatattgaggcaataattaatagcctaccaaccaaa aaacgtcgaggaccagacagattcacagccgaattgtaccagagggggtacctgtgcaga tttgttgcatgggtatattgcgtgtctcagaggtttggtgtactgacgacttatcactca gacggttctgagaagaccagttttgccttgaaattctctgaatttgcaagtgtctaccac gtagcagatgctttcaaatgcattaacccatcaaattga >gi568815575f:149674682_149876787|GENSCAN_predicted_peptide_6|275_aa MIKGYTRQQFCHESTPNKGDRVIYNLTCPPYCCVWFPLAGTGPHILYLSRLASNLELFKR GKGRGEQRKEEVTCGMLRKKGYTRQQFCHENTPNKGDRVIYDLTHPPYCCVRFPLAGTGP HILYSSQMASNLGVFRRGKGRGEQRKEEVNCGMLRKPPRVIDTQANGVWSGPRQTPTDLQ LKVLTKCSSLPATEQSWMENDFDELRKEGFRRSVITNFSKLKEDVQTHRKEAKNLEKGLD EWLTRINSIEKTLNDLMELKTMARELRDICTSFSS >gi568815575f:149674682_149876787|GENSCAN_predicted_CDS_6|828_bp atgattaaagggtacactcgccagcagttttgccatgagagtacaccaaacaaaggagac agggtcatttataacctgacatgtccaccctactgctgtgtctggtttccattggctgga acgggacctcacattctttatttgtcccgattggctagcaacttagagctttttaaaaga ggcaaaggtagaggagaacaaaggaaggaggaagtaacttgtggaatgctgagaaagaaa gggtacactcgccagcagttttgccacgagaatacaccaaacaaaggagacagggtcatt tatgacctgacgcatccaccctactgctgtgtccggtttccattggctggaacgggacct cacattctgtattcgtcccaaatggctagcaacttaggagtttttagaagaggcaaaggt agaggagaacaaaggaaggaggaagtaaattgtggaatgctgagaaagcctcctcgggtg attgatacccaggcaaacggggtctggagtggacctaggcaaactccaacagacctgcag ctgaaggtcctgacgaaatgcagctccttgccagcaacagaacaaagctggatggagaat gactttgacgagttgagaaaagaaggcttcagacgatcggtaataacaaacttctccaag ctaaaggaggatgttcaaacccatcgcaaagaagctaaaaatcttgaaaaaggattagat gaatggctaactagaataaacagcatagagaagaccttaaatgacctgatggagctgaaa accatggcacgagaactacgtgacatatgcacaagcttcagtagctga >gi568815575f:149674682_149876787|GENSCAN_predicted_peptide_7|76_aa MNPGIKSQKYQGNQYKNVTLEELITLVSVRVKGDSKAVSGISPAFVKGSPESSAVPFGSY HGLPETAKRTNNSKSI >gi568815575f:149674682_149876787|GENSCAN_predicted_CDS_7|231_bp atgaatcctggaatcaagagccaaaagtatcaggggaatcaatacaaaaatgtgacccta gaggaacttataacccttgtgtctgtgagagtgaaaggggactcaaaagctgtcagcggc atctcccctgcatttgtcaaggggtctccagagtcatcagcagtgccttttgggtcctat catgggttgccagaaactgctaaaagaacaaataacagcaaatcaatttaa >gi568815575f:149674682_149876787|GENSCAN_predicted_peptide_8|61_aa MHRDSFLATSDNWGRGVNIGMDTSSSAPPPCTATSTGANTLMEASSSRPACVPPLLLPTL A >gi568815575f:149674682_149876787|GENSCAN_predicted_CDS_8|186_bp atgcaccgggactcattcctggcaaccagcgacaactggggaaggggtgtgaacataggc atggacaccagcagctctgccccaccaccttgcactgccacctctactggtgcaaacaca ctcatggaagccagcagctccaggcccgcctgtgtcccacccctgctgttgccaacattg gcatga >gi568815575f:149674682_149876787|GENSCAN_predicted_peptide_9|106_aa MSFAAPWMELEAIILSELTQEQKTKYHMFSVISGRLIKDLNVKLMKTLENNLNNTIEDIA MGKDFMTNMLKAIATKAKIDKWDLIKLKRFCTAKETINKVNRKPAK >gi568815575f:149674682_149876787|GENSCAN_predicted_CDS_9|321_bp atgtcctttgcagcaccatggatggagctggaggccattattctaagtgaactaacacag gaacagaaaaccaaataccacatgttctcagttataagtgggagattgatcaaagactta aatgtaaaacttatgaaaaccctggaaaataacctaaacaataccattgaggacatagca atgggcaaagatttcatgacaaacatgctgaaagcaattgcaacaaaagcaaaaattgac aaatgggatctaattaagctaaaacgcttctgcacagcaaaggaaactatcaacaaagtg aacagaaagcctgcaaaatga