GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:16:08 Sequence gi568815585f:27522873_27723247 : 200375 bp : 39.49% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5388 5446 59 2 2 79 97 42 0.429 5.04 1.02 Intr + 10277 10564 288 2 0 50 73 279 0.496 18.04 1.03 Intr + 12694 12785 92 1 2 0 59 116 0.437 -1.38 1.04 Term + 15729 15928 200 1 2 83 38 153 0.795 6.38 1.05 PlyA + 16889 16894 6 1.05 2.00 Prom + 23293 23332 40 -4.75 2.01 Init + 25031 25179 149 1 2 31 80 159 0.448 9.01 2.02 Term + 26553 26652 100 0 1 119 37 67 0.790 1.32 2.03 PlyA + 26756 26761 6 1.05 3.09 PlyA - 27354 27349 6 1.05 3.08 Term - 27619 27457 163 0 1 97 47 73 0.346 0.53 3.07 Intr - 30567 30336 232 1 1 102 84 306 0.604 27.51 3.06 Intr - 33541 33364 178 1 1 95 94 112 0.991 11.07 3.05 Intr - 37113 36970 144 0 0 82 100 61 0.987 6.26 3.04 Intr - 39909 39541 369 0 0 90 97 454 0.991 40.88 3.03 Intr - 44967 44768 200 1 2 53 69 212 0.996 13.95 3.02 Intr - 46404 46157 248 2 2 63 45 204 0.804 9.78 3.01 Init - 58831 58425 407 1 2 60 115 142 0.838 10.21 3.00 Prom - 68134 68095 40 -6.15 4.00 Prom + 68151 68190 40 -6.45 4.01 Init + 69731 69782 52 1 1 70 58 41 0.013 0.67 4.02 Intr + 76622 76703 82 0 1 74 103 36 0.079 1.48 4.03 Intr + 97295 97765 471 2 0 43 54 240 0.171 7.37 4.04 Intr + 98566 98723 158 1 2 102 110 72 0.821 9.53 4.05 Intr + 98789 98870 82 0 1 21 99 20 0.588 -5.82 4.06 Intr + 99063 99137 75 0 0 37 82 112 0.567 3.21 4.07 Intr + 100003 100238 236 1 2 11 65 211 0.007 7.51 4.08 Intr + 104898 104959 62 1 2 74 75 36 0.004 -1.57 4.09 Intr + 118820 118895 76 1 1 41 80 103 0.276 2.97 4.10 Intr + 122909 122981 73 0 1 86 98 -3 0.558 -1.95 4.11 Intr + 125507 125581 75 2 0 120 111 79 0.989 11.11 4.12 Intr + 132020 132133 114 2 0 60 81 98 0.870 4.94 4.13 Intr + 138716 138833 118 2 1 73 55 105 0.698 5.25 4.14 Intr + 141296 141326 31 1 1 123 60 48 0.422 2.29 4.15 Intr + 142891 143077 187 2 1 20 77 323 0.239 22.23 4.16 Intr + 154162 154193 32 1 2 50 115 19 0.003 -2.34 4.17 Intr + 155871 156064 194 1 2 84 49 95 0.007 3.49 4.18 Term + 157356 157409 54 2 0 93 37 57 0.061 -2.32 4.19 PlyA + 159523 159528 6 1.05 5.00 Prom + 165928 165967 40 -6.05 5.01 Sngl + 169904 170287 384 1 0 66 42 202 0.967 9.43 5.02 PlyA + 172521 172526 6 1.05 6.03 PlyA - 172622 172617 6 1.05 6.02 Term - 173921 173533 389 0 2 -29 49 473 0.952 25.82 6.01 Intr - 197541 197345 197 2 2 64 64 113 0.105 4.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 100003 100378 376 1 1 11 37 306 0.821 11.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:27522873_27723247|GENSCAN_predicted_peptide_1|212_aa MGVAHEEALMNGLGHPLGDKYPTLPAAASLTSCTGSHRLQTGCKHRGAASEGGQSIEPAE GAETRPSELGTSRASADHWWCPSSQGSDARLLQSAEKTNRTLHCYFLPDAHESHLCLQPY HPERAQSCLNVERGRVLLMGKENARRARKRDRDKLLLKRNHLQLDMPNGGDAWNAAGRSL SREAFKQELDDHITAGTCIPEKWTRASLSKVP >gi568815585f:27522873_27723247|GENSCAN_predicted_CDS_1|639_bp atgggggtggctcatgaggaggccctcatgaatggcttgggccatccccttggtgataaa tatccaacccttcctgctgcagcttccctgacctcctgcaccgggagccaccggctgcaa acaggctgcaaacaccggggagctgcaagcgagggcggccagtccatcgagccagcagag ggcgctgaaacgcggcccagtgaactagggacgtccagggccagcgctgaccactggtgg tgcccgagctcccagggctctgatgcccggctgctccagtccgcagagaaaactaaccgg acactccactgctactttcttcccgatgctcatgagtctcacctttgtctacagccatac caccctgaacgtgctcaatcttgtctgaacgttgaacgtggcagagtgctcctcatgggg aaagaaaatgcgagaagagctaggaagagggacagggacaaattattactaaaaaggaat cacttacaattagacatgcccaacggaggcgatgcctggaatgcagcagggaggtccctg tcacgggaagccttcaagcaggagctagatgaccatatcacagcggggacctgtatacca gaaaaatggacaagagcttccctttctaaggttccatga >gi568815585f:27522873_27723247|GENSCAN_predicted_peptide_2|82_aa MVLVMTVAVHAVVTGFADIRPEGYRTRSMIQLLKTKTHQKVFTPSGAQFGARALIRRCTL SEDFCSVVEAGRGVKVQGSRAR >gi568815585f:27522873_27723247|GENSCAN_predicted_CDS_2|249_bp atggtattggttatgacagtggcagtacacgctgttgttactggctttgctgacatcagg cctgagggatacagaactcgttctatgattcagttactgaaaacaaagacccaccagaag gtctttactccatctggggcacagtttggtgcccgggctttgataagacgctgtacactg agcgaggacttttgctcagtggtagaggctggcagaggggttaaagtccagggatctaga gccagatag >gi568815585f:27522873_27723247|GENSCAN_predicted_peptide_3|646_aa MGTTSDEMVSVEQTSSSSLNPLCFECGQQHWTRENHLYNYQNEVDDDLVCHICLQPLLQP LDTPCGHTFCYKCLRNFLQEKDFCPLDRKRLHFKLCKKSSILVHKLLDKLLVLCPFSSVC KDVMQRCDLEAHLKNRCPGASHRRVALERRKTSRTQAEIENENGPTLLDPAGTLSPEADC LGTGAVPVERHLTSASLSTWSEEPGLDNPAFEESAGADTTQQPLSLPEGEITTIEIHRSN PYIQLGISIVGGNETPLINIVIQEVYRDGVIARDGRLLAGDQILQVNNYNISNVSHNYAR AVLSQPCNTLHLTVLRERRFGNRAHNHSDSNSPREEIFQVALHKRDSGEQLGIKLVRRTD EPGVFILDLLEGGLAAQDGRLSSNDRVLAINGHDLKYGTPELAAQIIQASGERVNLTIAR PGKPQPGNTIREAGNHSSSSQHHTPPPYYSRPSSHKDLTQCVTCQEKHITVKKEPHESLG MTVAGGRGSKSGELPIFVTSVPPHGCLARDGRIKRGDVLLNINGIDLTNLSHSEAVAMLK ASAASPAVALKALEVQIVEEATQNAEEQPSTFSENEYDASWSPSWVMWLGLPSTLHSCHD IVLRRSYLGSWGFSIVGGYEENHTNQPFFIKTIVLGTPAYYDGRLK >gi568815585f:27522873_27723247|GENSCAN_predicted_CDS_3|1941_bp atgggaacaacaagtgatgagatggtgtctgtggaacagacctcctcctcttctctaaac cccctgtgttttgaatgtggccaacagcactggacaagagaaaaccatttgtacaattac cagaatgaagtggatgatgacctagtctgccatatttgccttcaacctctgctgcagcca ctagacacaccctgtggacatacattctgctacaagtgcctcagaaactttttacaagag aaagatttctgtccgttggaccggaaaagacttcattttaagttgtgcaagaagtctagt attctagttcataaactcctagacaaattattagttttatgtccattttcttcagtgtgc aaagatgtaatgcaacgttgtgatctggaggcacatctcaaaaacagatgtcctggagct tctcatcggagagttgccctggagagaaggaaaactagtagaactcaagcagagattgag aatgaaaatgggcccactctactagatcctgcaggtaccttatctccagaagcagactgt ttggggacaggcgcagtgcctgtggagcggcacttgacatcagcgtctctttccacatgg agtgaggagcctggccttgacaaccctgcctttgaggagagcgctggagctgacaccaca caacagccacttagtttaccagaaggagaaatcaccacgattgaaattcatcggtccaat ccttacattcagttaggaatcagcattgtgggtggcaacgaaacacctttgattaacatt gtcatccaggaggtctatcgggatggggtcattgccagagacgggagacttcttgctgga gaccagattcttcaggtcaacaactacaatatcagcaatgtgtcccataactatgcccga gctgtcctttcccagccctgcaacacactgcatcttactgtgcttcgagagaggcgcttt ggcaaccgagcacacaaccattctgatagtaactctccacgagaagagattttccaagtg gctcttcataaacgggactctggtgaacagcttggcattaaattggtgcgaaggacagat gagccaggggtttttattcttgacctgttggaaggggggttggctgcccaggacggcagg ctaagcagcaatgaccgagtgctggccatcaatgggcacgacctgaagtatggaactccg gagcttgctgcccagattattcaggccagtggagagagagtgaatttaacaattgctaga ccagggaaaccccagcctggtaacaccattagagaagcaggaaatcatagcagcagcagc cagcaccacacaccaccaccgtattatagcagaccaagctcacataaggatcttactcag tgtgttacatgccaagaaaaacacattactgtaaagaaggaaccacatgaatcccttggc atgaccgttgctgggggcaggggaagtaagagtggtgagctgcccatctttgtgaccagt gtgccaccccatggctgccttgcacgagatggcagaataaagagaggtgatgtgttgcta aatatcaacggcattgatttgaccaatttaagtcacagtgaggcagttgcaatgctgaaa gccagtgccgcgtcccctgctgttgcccttaaagcacttgaggtccagattgttgaggag gcgactcagaacgcggaggagcagccgagtactttcagcgaaaatgagtatgatgccagt tggtccccatcatgggtcatgtggcttgggcttcccagcacacttcatagctgccacgat atagttttacgaagaagttacttgggaagttggggctttagtatcgttggtggatatgaa gagaaccacaccaatcagccttttttcattaaaactattgtcttgggaactcctgcttat tatgatggaagattaaagtga >gi568815585f:27522873_27723247|GENSCAN_predicted_peptide_4|723_aa MVWKAIRLEKITKSVDESSSLQSELTIPVKEIHLRYRQYTYVLTRPTPESTPAPQPGRAK TAAKDQGSGASCGAAPAASRGAALTHTQLTPAPEAAARSQREGAQRGGQGKVTHWESAAA RAAHGLLAPLLGTSSRSPRPAEGAESLPGSAPPGIAAAAAAAADFLAKSKLRPRLHFSTA PAAFPGAAPRPSGPTRTRANRRCVDRLQFKPVVPQKAREGRGPRKGHPGSTPPQLGSRAA FGLATGKGSDLGAAGLSGPRGKAALAPPPAPPAASGLAWSPRAPDPPAPPEDPETAPAME EDQELERKISGLKTSMAEGERKTALEMVQAAGTDRHCVTFVLHEEDHTLGNSLRYMIMKN PEVEFCGYTTTHPSESKINLRIQTREWVSGRKLEDSSELGTFVIVQWAYKKGAVTSIDVE QTGRSGKAWWRGSLSHTYIFFSLCTRKPEGILKRARKAIEELLKEAKRGKTRAETMGPMG CANVNLLEKSKLYQNYGNSFDFVDCLMSLEDSQGSRVTWAQMVTLKLIREKPGMVECLMP VYHVISNTVEDRGRREMAECPNASVPPGEQDHEQKEGDKEPAKSQAQKEENPKKHRSHPY KHSFRARGSASYSPPRKRSSQDKYEKRSNRRGTSSPTSSIHDNPVCCSCVWKEQAIKWDF GAESFTAQLAMWTPCTDGVSIVKTSTGGESGEWTSGKGCTAEVMSQHVSDHLAKSLSSDY ESV >gi568815585f:27522873_27723247|GENSCAN_predicted_CDS_4|2172_bp atggtatggaaagccatcagattggaaaagatcactaaaagtgtagatgagtcatcctca ttgcaatctgaacttacgattcccgtaaaagaaattcatttacgttacagacaatatact tatgttctaaccagacccacacccgagtccacgccagccccgcagcccgggcgagcgaag acagccgcgaaggaccaaggctccggggcgagctgtggcgccgctccggccgcctcccgg ggcgcggcgctgacccacacccagctgacgcccgcacctgaggcggctgcccgcagccag cgagagggcgcgcagcgaggcgggcagggcaaagttactcactgggaaagcgcggctgcc cgggccgcgcacgggctcctggcgccgctgctcggcaccagctcccgctctccgcggccc gctgagggagcggagagcctgcccggctccgctccgccaggaatcgccgccgccgccgcc gccgccgcggatttcctggcaaaaagcaaactccggccaagacttcacttctcaaccgcc cctgctgccttcccgggggcagcgccaaggccgagcggcccaacccgcacccgggccaat cggaggtgtgttgaccgactacagtttaagcctgtcgtcccacaaaaggcccgggagggc cggggtcccaggaagggccatcctgggtcaacaccgccccagttgggaagtcgggccgct ttcggacttgccactgggaagggaagtgatttgggggcggcaggcctgtctgggcctcgg ggtaaggcggcgctggccccgccccctgcccctcccgccgcgagtggcctcgcgtggtcg cccagagcccccgatccgccagcaccacctgaggatccagaaaccgccccagcgatggaa gaggatcaggagctggagagaaaaatatctggattgaagacctcaatggctgaaggcgag aggaagacagccctggaaatggtccaggcagctggaacagatagacactgtgtgacattt gtattgcacgaggaagaccataccctaggaaattctctacgttacatgatcatgaagaac ccggaagtggaattttgtggttacactacgacccatccttcagagagcaaaattaattta cgcattcagactcgagaatgggtgagtggaaggaaactggaagacagcagtgaactgggg acatttgtaatagtccagtgggcctataagaaaggagcagttacatctattgatgtggag caaacagggagatcgggaaaggcttggtggagaggcagtctttctcatacatatatcttc ttttctctttgtactcgtaaaccagaaggcatccttaagcgtgccaggaaagcaatagaa gaactgcttaaggaggcaaaacgtgggaaaactagagctgaaacaatgggacccatgggt tgtgcaaatgtcaaccttctggagaaaagcaaattatatcaaaactatggaaatagcttt gactttgtagattgcctcatgagtctggaggactctcagggttcacgggtcacatgggct cagatggttaccctaaagctgatacgagaaaagccaggaatggtggagtgtctaatgcca gtttatcatgtcatctcaaatactgtggaagatcgggggaggcgggaaatggcagaatgt cctaatgccagcgtgcccccaggagagcaagaccatgaacaaaaagagggcgataaggaa ccagcgaagagccaggcccagaaagaagaaaacccgaagaaacacagaagccatccttac aagcacagcttccgcgctcgaggttccgccagttactccccgccacgaaagcggagcagc caggacaagtacgaaaagcggtccaaccggcgaggaacttccagtcctacaagctctatt catgacaatccagtttgctgcagttgtgtctggaaagagcaggcaatcaaatgggatttt ggagctgaatcattcaccgcccagctagcaatgtggacaccttgcacagatggtgtttcc attgttaaaacttccactggtggtgaatcaggagagtggacttctggaaagggatgcact gcagaggtgatgagtcagcatgtctctgaccatctggccaaatcattgtccagcgactat gagtcagtatag >gi568815585f:27522873_27723247|GENSCAN_predicted_peptide_5|127_aa MDLLTIAPNSLLLEFLLPIPATLSSAGLELLVAKVVVVGKWENASSRDHMNGSTELEDET ATRPFWASYATEATCVKGGRGGVTLLVEVIDSNYQTEIGGKEDYVWNPGDSLGCLLSGYT ILHPPQQ >gi568815585f:27522873_27723247|GENSCAN_predicted_CDS_5|384_bp atggatcttctcaccattgcacctaatagtcttctcctggaatttttgcttcctatccca gcaactttgagctctgctggtttggaactcttagtcgccaaggtggtggtagttgggaag tgggagaatgcttccagcagggatcacatgaatggttccacagaactagaagatgagact gctaccaggccattttgggcatcttatgccactgaagcaacatgcgtaaaaggagggagg ggtggtgttaccctacttgttgaagtgattgattccaattaccaaactgaaattggaggc aaggaggactatgtgtggaacccaggggactctctggggtgcctcttaagtggctatacc attttgcatcccccccagcaatga >gi568815585f:27522873_27723247|GENSCAN_predicted_peptide_6|195_aa XSYGLSYPQPGTGGGPQGAYEVPPQVLSSRPLATSSSPALATISSDPLATTYRPCSLPAG DCSVQKGGSKISQKKVAADEEEEEDDDDEDDEETEKKSTSEEIYTRYSSQKSTEVKREWK RLKTINTKIKKTRILQKEKKTETPKGSSSVEDIKAKMQASVEKGGSLPKVETTFINYVKN CFWMTDQEAIQDLWQ >gi568815585f:27522873_27723247|GENSCAN_predicted_CDS_6|588_bp nnctcttatggtctgagctatccccagcctggcacaggaggaggacctcaaggggcctat gaagttcctccgcaggttctctccagcaggccccttgccacctcctcctcccctgccttg gctaccatatcgtcagatcccctggccaccacatatcgtccctgttccctgccagccggg gactgctcagtgcagaaaggtggtagcaagatttcacagaaaaaagttgctgctgatgaa gaagaagaagaagatgatgatgatgaagatgatgaggaaactgaaaaaaaaagcacaagt gaagaaatctataccagatactccagccaaaaaagcacagaagtcaaacgagaatggaaa agactcaaaaccatcaacaccaagataaaaaagacaagaatccttcaaaaagagaaaaaa actgaaacaccaaaaggatctagttctgtagaagatattaaagcaaaaatgcaagcaagt gtagaaaaaggtggttctcttcccaaagtggaaaccacgttcatcaattatgtgaagaat tgtttctggatgactgaccaggaggctattcaagatctctggcagtga