GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:08:35 Sequence gi568815588r:90812869_91020349 : 207481 bp : 40.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9969 9995 27 2 0 102 87 12 0.424 2.15 1.02 Term + 12322 12495 174 0 0 8 48 185 0.668 3.28 1.03 PlyA + 12548 12553 6 1.05 2.03 PlyA - 13076 13071 6 1.05 2.02 Term - 18982 18745 238 0 1 37 48 159 0.323 1.46 2.01 Init - 19791 19532 260 0 2 66 49 157 0.663 6.41 2.00 Prom - 21620 21581 40 -5.15 3.06 PlyA - 22747 22742 6 1.05 3.05 Term - 30001 29951 51 1 0 79 36 105 0.490 0.85 3.04 Intr - 34842 34764 79 2 1 77 79 44 0.048 1.03 3.03 Intr - 44858 44265 594 1 0 -22 110 787 0.311 60.66 3.02 Intr - 45283 44885 399 0 0 59 93 318 0.962 22.10 3.01 Init - 49811 49804 8 2 2 53 98 0 0.302 -2.19 3.00 Prom - 57261 57222 40 -5.45 4.00 Prom + 58736 58775 40 -9.25 4.01 Init + 59119 59200 82 0 1 89 84 137 0.946 14.65 4.02 Intr + 62001 62056 56 1 2 80 98 64 0.939 4.38 4.03 Intr + 62690 62746 57 1 0 59 96 51 0.650 1.16 4.04 Intr + 66195 66266 72 2 0 89 84 24 0.588 0.78 4.05 Intr + 72944 73033 90 1 0 123 103 10 0.869 5.27 4.06 Intr + 81907 81974 68 0 2 46 69 74 0.376 -1.92 4.07 Intr + 83445 83524 80 0 2 52 99 100 0.959 5.78 4.08 Term + 87702 87811 110 1 2 86 48 125 0.951 5.99 4.09 PlyA + 87877 87882 6 1.05 5.10 PlyA - 88782 88777 6 1.05 5.09 Term - 100108 99998 111 1 0 36 54 127 0.829 1.68 5.08 Intr - 102773 102675 99 2 0 102 75 70 0.886 6.49 5.07 Intr - 103060 102914 147 1 0 30 70 165 0.858 8.41 5.06 Intr - 103401 103303 99 0 0 49 61 77 0.579 0.49 5.05 Intr - 104962 104864 99 1 0 22 54 147 0.942 4.09 5.04 Intr - 106104 105997 108 0 0 45 94 128 0.992 8.66 5.03 Intr - 106400 106263 138 2 0 60 59 120 0.983 6.04 5.02 Intr - 107480 107301 180 2 0 130 78 189 0.999 21.24 5.01 Init - 108159 108133 27 0 0 104 84 14 0.589 2.51 5.00 Prom - 109448 109409 40 -2.95 6.06 PlyA - 109849 109844 6 1.05 6.05 Term - 115585 115385 201 1 0 31 43 196 0.451 5.81 6.04 Intr - 116605 116495 111 1 0 63 42 88 0.393 1.36 6.03 Intr - 121074 120983 92 1 2 90 44 25 0.018 -2.91 6.02 Intr - 129013 128954 60 2 0 66 99 53 0.498 1.99 6.01 Init - 130398 130248 151 0 1 88 32 100 0.438 4.65 6.00 Prom - 136930 136891 40 -2.95 7.07 PlyA - 137211 137206 6 1.05 7.06 Term - 141490 141218 273 1 0 61 38 200 0.634 6.79 7.05 Intr - 142603 142364 240 1 0 59 74 135 0.323 6.02 7.04 Intr - 148395 148238 158 1 2 132 58 11 0.007 1.31 7.03 Intr - 166252 166142 111 2 0 49 121 41 0.049 2.93 7.02 Intr - 171269 171070 200 1 2 110 84 39 0.280 3.77 7.01 Init - 174375 174218 158 0 2 78 59 77 0.141 3.23 7.00 Prom - 178493 178454 40 -1.25 8.03 PlyA - 178545 178540 6 1.05 8.02 Term - 187678 187386 293 2 2 74 41 205 0.113 8.92 8.01 Init - 192288 191862 427 0 1 54 16 234 0.027 9.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 200103 200046 58 1 1 125 101 50 0.873 7.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:90812869_91020349|GENSCAN_predicted_peptide_1|66_aa MEPHLKEAKRILQDLIHNHQGSTSMSLQACTSMSLQAYTSTRLPPNADMAAVTKNLDHNI QVPSNT >gi568815588r:90812869_91020349|GENSCAN_predicted_CDS_1|201_bp atggagccacatctaaaagaggccaagagaattcttcaggatctcatccacaaccaccaa ggcagtacctctatgagtctgcaggcttgtacctctatgagtctgcaggcttatacctct acaaggctgccgcctaatgcagacatggcggcagtgaccaaaaacttagatcacaacatc caagtcccttcaaatacctga >gi568815588r:90812869_91020349|GENSCAN_predicted_peptide_2|165_aa MLEEPFSPLLHCGSPSLGWLRLEPAPSACGKVWRERHEREPGLCMELEGQLEFRVGVGLA GPHSEQPAGPATPSSEGLSTWASSCRGDCKYTNQHSVPSSGFVNTPVDTLYLANLVGTWI TFVSSSGIVNAPISTLSKRTNQLSVKQTNRLSVKWTNPQVVGGAR >gi568815588r:90812869_91020349|GENSCAN_predicted_CDS_2|498_bp atgcttgaggagcccttcagcccactgctgcactgtgggagcccttctctgggctggctg aggctggagccagctccctcagcttgtgggaaggtgtggagagagaggcacgagcgggaa ccggggctgtgcatggagcttgagggccagctggagttccgggtgggcgtgggcttggcg ggcccgcactcggagcagccggctggccctgccaccccaagcagtgaggggcttagcacc tgggccagcagctgcagaggggattgtaaatacaccaatcagcactctgtgcctagctca gggtttgtaaatacaccagtcgacactctgtatctagctaatctagtggggacgtggata acttttgtgtctagctcagggattgtaaacgcaccaatcagcaccctgtcaaaacggacc aatcaactctctgtaaaacagaccaatcggctctctgtaaaatggaccaatccgcaggtt gtgggtggagccagataa >gi568815588r:90812869_91020349|GENSCAN_predicted_peptide_3|376_aa MQQSARERAAAGGGGGGDAARLPQGSGGGGGGGGGREGRGALRNFCRCRPGALGAAELLL RTVELPAEPSPSSLDSARPRRPSGAEAPPAGSPREAALPRYPQRPGVRGVRGSETEPGAP QRSARAPWLGRAERNRRRRVAGAPAPWAAAHGGAMMDVNSSGRPDLYGHLRSFLLPEVGR GLPDLSPDGGADPVAGSWAPHLLSEVTASPAPTWDAPPDNASGCGEQINYGRVEKVVIGS ILTLITLLTIAGNCLVVISVCFVKKLRQPSNYLIVSLALADLSVAVAVMPFVSVTDLIGG KWIFGHFFCNVFIAMDVMCCTASIMTLCVISIDRNSSALFARAVIFSSDWYLAEALLPRQ NTDISKAASDVKVLAA >gi568815588r:90812869_91020349|GENSCAN_predicted_CDS_3|1131_bp atgcaacagagcgcgagggagcgggcagcggccggaggcggcggcggcggggacgcggcg cggctgccgcaggggagcggcggcggcggcggcggcggcggcgggcgcgaggggcggggc gcactccgcaacttctgccgctgccgcccgggcgctctcggcgcagccgagctgctgctg cgaactgtggagctgccagcggagcccagcccgagctccctcgactccgcgcgtcccagg cgtccgtcgggggccgaagcgcctcccgcagggagtccccgtgaggccgcgctgccgcgt tacccgcagcggccgggggtccggggagtgcggggctccgagacagagccaggcgccccc cagcggtcggcgcgggccccatggctgggccgggcggagcggaaccgccggaggcgcgtg gccggggcgccggctccatgggcagcggcacacggcggcgcgatgatggacgttaacagc agcggccgcccggacctctacgggcacctccgctctttccttctgccagaagtggggcgc gggctgcccgacttgagccccgacggtggcgccgacccggtcgcgggctcctgggcgccg cacctgctgagcgaggtgacagccagcccggcgcccacctgggacgcgcccccggacaat gcctccggctgtggggaacagatcaactacggcagagtcgagaaagttgtgatcggctcc atcctgacgctcatcacgctgctgacgatcgcgggcaactgcctggtggtgatctccgtg tgcttcgtcaagaagctccgccagccctccaactacctgatcgtgtccctggcgctggcc gacctctcggtggctgtggcggtcatgcccttcgtcagcgtcaccgacctcatcgggggc aagtggatctttggacactttttctgtaatgtcttcatcgccatggacgtcatgtgctgc acggcctcgatcatgaccctgtgcgtgatcagcattgacaggaactcatctgccttattt gcaagagctgtaatatttagctctgattggtatctggctgaagccttgcttcctcgccag aacaccgacattagcaaagctgccagtgatgtgaaggtcttagcagcttag >gi568815588r:90812869_91020349|GENSCAN_predicted_peptide_4|204_aa MAVFADLDLRAGSDLKALRGLVETAAHLGYSVVAINHIVDFKEKKQEIEKPVAVSELFTT LPIVQRATSSRARLYDVVAVFPKTEKLFHIACTHLDVDLVCITVTEKLPFYFKRPPINVA IDRGLAFELVYSPAIKDSTMRRGLLFGLSESDAKAAVSTNCRAALLHGETRKTAFGIIST VKKPRPSEGDEDCLPASKKAKCEG >gi568815588r:90812869_91020349|GENSCAN_predicted_CDS_4|615_bp atggcggtgtttgcagatttggacctgcgagcgggttctgacctgaaggctctgcgcgga cttgtggagacagccgctcaccttggctattcagttgttgctatcaatcatatcgttgac tttaaggaaaagaaacaggaaattgaaaaaccagtagctgtttctgaactcttcacaact ttgccaattgtacagagagcaacttcttcaagggcccggctctatgatgttgttgcagtt tttccaaagacagaaaagctttttcatattgcttgcacacatttagatgtggatttagtc tgcataactgtaacagagaaactaccattttacttcaaaagacctcctattaatgtggcg attgaccgaggcctggcttttgaacttgtctatagccctgctatcaaagactccacaatg agaagaggcttgctgtttgggctctctgaaagtgacgccaaggctgcggtgtccaccaac tgccgagcagcgcttctccatggagaaactagaaaaactgcttttggaattatctctaca gtgaagaaacctcggccatcagaaggagatgaagattgtcttccagcttccaagaaagcc aagtgtgagggctga >gi568815588r:90812869_91020349|GENSCAN_predicted_peptide_5|335_aa MMVLKVEELVTGKKNGNGEAGEFLPEDFRDGEYEAAVTLEKQEDLKTLLAHPVTLGEQQW KSEKQREAELKKKKLEQRSKLENLEDLEIIIQLKKRKKYRKTKVPVVKEPEPEIITEPVD VPTFLKAALENKLPVVEKFLSDKNNPDVCDEYKRTALHRACLEGHLAIVEKLMEAGAQIE FRDMLESTAIHWASRGGNLDVLKLLLNKGAKISARDKPLSSFLANQQRFPIFQLLSTALH VAVRTGHYECAEHLIACEADLNAKDREGDTPLHDAVRLNRYKMIRLLIMYGADLNIKNCA GKTPMDLVLHWQNGTKAIFDSLRENSYKTSRIATF >gi568815588r:90812869_91020349|GENSCAN_predicted_CDS_5|1008_bp atgatggtactgaaagtagaggaactggtcactggaaagaagaatggcaatggggaggca ggggaattccttcctgaggatttcagagatggagagtatgaagctgctgttactttagag aagcaggaggatctgaagacacttctagcccaccctgtgaccctgggggagcaacagtgg aaaagcgagaaacaacgagaggcagagctcaaaaagaaaaaactagaacaaagatcaaag cttgaaaatttagaagaccttgaaataatcattcaactgaagaaaaggaaaaaatacagg aaaactaaagttccagttgtaaaggaaccagaacctgaaatcattacggaacctgtggat gtgcctacgtttctgaaggctgctctggagaataaactgccagtagtagaaaaattcttg tcagacaagaacaatccagatgtttgtgatgagtataaacggacagctcttcatagagca tgcttggaaggacatttggcaattgtggagaagttaatggaagctggagcccagatcgaa ttccgtgatatgcttgaatccacagccatccactgggcaagccgtggaggaaacctggat gttttaaaattgttgctgaataaaggagcaaaaattagcgcccgagataagcctctgtcc tcattcctagcgaatcagcagcgttttccaattttccagttgctcagcacagcgctgcat gtggcggtgaggactggccactatgagtgcgcggagcatcttatcgcctgtgaggcagac ctcaacgccaaagacagagaaggagataccccgttgcatgatgcggtgagactgaaccgc tataagatgatccgactcctgattatgtatggcgcggatctcaacatcaagaactgtgct gggaagacgccgatggatctggtgctacactggcagaatggaaccaaagcaatattcgac agcctcagagagaactcctacaagacctctcgcatagctacattctga >gi568815588r:90812869_91020349|GENSCAN_predicted_peptide_6|204_aa MDKAGGHYSQQTNTGKENQTPHVLTYKWELNDENTCTQGGEHHTLGPVRGASLILNKSIS ELKADWATESEMRFHHVAQVGLELLASKDPPAQLPKAPRLQLPAPKAGAAERALSTGALF CSIPVALGKPLKNTDVRVCVQLQNWTRRCESPSNGPKAYVLKSLTIPSYENGFDPVDEGS SSNGSSSNGSSMAVISPFSFFPFD >gi568815588r:90812869_91020349|GENSCAN_predicted_CDS_6|615_bp atggataaagctggaggccattattctcagcaaactaacacaggaaaagaaaaccaaaca ccacatgttctcacttataagtgggagctgaacgatgagaacacatgcacacaaggaggg gaacaccacacactggggcctgtcagaggtgcctccctaatactgaataaaagcatctca gaacttaaagccgattgggccacagaatcagagatgaggtttcaccatgttgcccaagtt ggtcttgaacttctggcctcaaaggatcctccagctcaactacccaaagctccaagacta cagctgcctgcaccaaaagcaggtgcagctgagagggcactgtccacaggagccttattc tgtagcattcctgtggctctagggaagcctctgaaaaacactgatgtcagggtgtgtgta cagttacagaactggactagacgatgtgaaagcccttctaatggtccaaaagcctatgtt ctgaagagcttaacaatacccagttatgaaaatggttttgatccagtggatgaaggtagt tcaagtaatggcagttcaagtaatggcagttcaatggcagtaatttcaccttttagcttt tttccctttgactag >gi568815588r:90812869_91020349|GENSCAN_predicted_peptide_7|379_aa MLALKKAKENMSECVHVCVSVVCQGPWEKDSGTKICTQVFTGECCGETTSVKGQMSVSRC ALPSPHQGRALQVLVIMEPARVRSSGSLSAVLMSVTCGFLFFYSQLTGVSHSVTKNMDPL NLLRQIFPLHWKACRSEKVRTIWEEGVGSSFGVAQKGNFGLLHDGPDVSDPVVWCPSLVN AWSFITLHSFSFFIYKIIMERLLGLAINLSLQPIYQEIPLGTYTTRALGFKHKNWAAIWA DTELAAGVFFTPQQHLEPQQEGTVHSPGKGSEAREPSGLAQQIPPQRRPDHNSLPAREQN WMENEFDELTEVSFRKWVITNSSELKEHVLTQCKEAENLDKRLEELLTRLTSLEKNINDL MELKNTAQELPEAYTSINS >gi568815588r:90812869_91020349|GENSCAN_predicted_CDS_7|1140_bp atgctggcactaaagaaggcaaaagagaatatgtctgagtgtgtgcatgtctgtgtgtct gtggtatgtcaaggtccctgggaaaaagattctgggacaaagatttgcacgcaggtgttt actggggagtgctgtggcgaaacaacatctgtaaaagggcagatgtcggttagcaggtgt gctctgccctctcctcaccaaggcagagctttgcaggtgcttgtgattatggagccagca agagttaggtcttctgggagcctttcagcggtactaatgtcagttacttgtgggttccta tttttttacagtcaattaactggagtttcacacagtgttacgaaaaacatggatccatta aacttacttaggcagatcttccctctgcactggaaggcctgcagatctgagaaggtgagg acaatttgggaagaaggtgtgggaagctcatttggggtggctcagaaaggtaactttggg ctacttcatgatggcccagatgtctctgatcctgtagtgtggtgccctagcctggtcaac gcttggagttttataaccctccatagcttcagtttcttcatctataaaataatcatggag cgtctactaggtcttgcaattaatctgtctttgcaacccatataccaggagattcccttg ggtacctataccaccagggccctgggtttcaagcacaaaaactgggcagccatttgggca gacactgagctagctgcaggagtttttttcacaccccagcagcacctggaaccccagcaa gaaggaaccgttcactcccctggaaaggggtctgaagccagggagccaagtggtctagct cagcagatcccaccccaacggagaccagatcacaactccttgccagcaagggaacaaaac tggatggagaatgagtttgatgaattgacagaagtaagcttcagaaagtgggtaataaca aactcttccgagctaaaggagcatgttctaacccaatgcaaggaagctgagaaccttgat aaaaggttagaggaattgttaactagactaaccagtttagagaagaacataaatgacctg atggagctgaaaaacacagcacaagaacttcctgaagcatacacaagtatcaatagctga >gi568815588r:90812869_91020349|GENSCAN_predicted_peptide_8|239_aa MWECLEFPRDLEGSEDRKMWESLELPRDLLNGFDQNADSDRDNKVWAEVVSDGDEELVGN WSKGRSCYAKRLVAFCPCPRDLWNFELQRDNVGYLVEEISKQQSIQEVAGHKSSKILQPD DAVEKKKPFSREKFKLATEICRTAVAVPTDYCTDSGTAELQNKTKCSAKFLLATWSKDGL SSSVFAELTINHGYLSLFELSLATMGQDPSESLFQVGLPNQHRGTSTPPVIETIFDFSV >gi568815588r:90812869_91020349|GENSCAN_predicted_CDS_8|720_bp atgtgggaatgtttggaatttcctagagacttggagggctcagaagataggaagatgtgg gaaagtttggaacttcctagggacttgttgaatggctttgaccaaaatgctgacagtgat agggacaataaagtctgggctgaggtggtctcagatggagatgaggagcttgttgggaac tggagtaaaggtcgttcttgctatgcaaagagactggtggcattttgcccctgccctaga gatctgtggaactttgaacttcagagagataacgtggggtatctggtggaagaaatttct aagcaacaaagcattcaagaggtggcagggcataaaagttcgaaaattttgcagcctgat gatgcagtagaaaagaaaaaaccattttctagggagaaattcaagctagctacagaaatt tgcagaactgctgtagctgtccctactgactactgcacagactcaggcactgctgagctc caaaacaaaacaaaatgttccgcaaaattcctgctggccacttggtcaaaggatggtctc tcttcatccgtctttgctgagctcactataaatcatggatatctgtctctctttgagctg tctctggctacgatgggccaggacccttcagagagcctctttcaggttggacttcctaat cagcatcgtggaacatccactcctccagtaatagaaaccatatttgatttttctgtttaa