GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:24:32 Sequence gi568815590r:27136849_27342493 : 205645 bp : 42.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8538 8663 126 2 0 17 28 185 0.377 5.63 1.02 Intr + 22283 22432 150 1 0 86 19 137 0.158 6.04 1.03 Intr + 35051 35223 173 1 2 55 77 134 0.327 6.82 1.04 Term + 37332 37413 82 0 1 123 38 75 0.113 2.19 1.05 PlyA + 39897 39902 6 1.05 2.04 PlyA - 40164 40159 6 1.05 2.03 Term - 43769 43575 195 2 0 85 43 139 0.389 5.53 2.02 Intr - 50479 50351 129 1 0 89 57 64 0.416 3.37 2.01 Init - 63459 63295 165 0 0 109 82 52 0.643 6.38 2.00 Prom - 66134 66095 40 -4.45 3.14 PlyA - 66413 66408 6 1.05 3.13 Term - 70547 70430 118 1 1 62 36 95 0.267 -1.27 3.12 Intr - 72362 72244 119 2 2 77 79 54 0.319 1.74 3.11 Intr - 80832 80728 105 0 0 57 84 65 0.157 2.49 3.10 Intr - 83166 83037 130 2 1 23 91 94 0.164 2.98 3.09 Intr - 94211 94072 140 2 2 82 64 95 0.067 4.84 3.08 Intr - 95240 95067 174 2 0 11 81 110 0.016 1.81 3.07 Intr - 97261 97171 91 0 1 33 116 33 0.020 -0.32 3.06 Intr - 100057 100002 56 1 2 158 103 94 0.966 14.76 3.05 Intr - 103314 103123 192 0 0 84 100 274 0.931 26.97 3.04 Intr - 104414 104206 209 0 2 78 94 236 0.920 21.07 3.03 Intr - 104909 104829 81 0 0 99 53 108 0.651 7.09 3.02 Intr - 105644 105549 96 0 0 68 86 95 0.550 6.36 3.01 Init - 117568 117373 196 1 1 69 33 222 0.129 13.94 3.00 Prom - 125310 125271 40 -2.95 4.05 PlyA - 126789 126784 6 1.05 4.04 Term - 131145 130918 228 0 0 49 45 146 0.101 2.05 4.03 Intr - 140765 140611 155 0 2 48 57 129 0.420 4.67 4.02 Intr - 141971 141836 136 2 1 93 29 64 0.499 0.22 4.01 Init - 143361 143065 297 0 0 47 67 133 0.577 4.38 4.00 Prom - 143675 143636 40 -9.55 5.08 PlyA - 143732 143727 6 1.05 5.07 Term - 151279 150702 578 2 2 79 49 551 0.874 43.94 5.06 Intr - 151489 151360 130 1 1 28 10 82 0.377 -6.25 5.05 Intr - 152061 151879 183 0 0 95 67 124 0.553 10.06 5.04 Intr - 152432 152294 139 1 1 102 37 189 0.508 14.75 5.03 Intr - 153330 153308 23 0 2 102 131 35 0.996 4.62 5.02 Intr - 157462 157232 231 1 0 155 91 297 0.998 33.75 5.01 Init - 161696 161616 81 2 0 94 99 111 0.938 13.84 5.00 Prom - 170148 170109 40 -5.75 6.03 PlyA - 171384 171379 6 1.05 6.02 Term - 173833 173723 111 1 0 79 51 95 0.426 2.48 6.01 Init - 174387 173953 435 0 0 84 94 405 0.663 36.91 6.00 Prom - 176320 176281 40 -5.45 7.07 PlyA - 176583 176578 6 1.05 7.06 Term - 179852 179709 144 2 0 52 43 139 0.598 2.73 7.05 Intr - 185593 185519 75 1 0 64 94 92 0.949 6.19 7.04 Intr - 187457 187377 81 2 0 98 75 72 0.531 5.82 7.03 Intr - 188626 188470 157 0 1 85 44 50 0.217 -0.61 7.02 Intr - 190143 190006 138 2 0 94 75 46 0.336 2.56 7.01 Init - 194166 194033 134 0 2 51 116 132 0.956 11.96 7.00 Prom - 196761 196722 40 -9.65 8.03 PlyA - 197756 197751 6 1.05 8.02 Term - 198011 197887 125 0 2 118 55 69 0.935 4.17 8.01 Intr - 198440 198287 154 2 1 41 33 181 0.661 6.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 98415 98355 61 2 1 -50 32 243 0.904 1.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:27136849_27342493|GENSCAN_predicted_peptide_1|176_aa MAEVERGPFPRLSLGLGTLEIGTRVHCPEDLRPEKCRKNMTSCLNVVDADICFIQLSPSD HLPMGQLDATYVTCPTDPTPCMDCVDMLQQPTVFDLPALCCGTLDESPSYGDCKKGLVSS RSHFVTPRPSWRTPEENLVVVSKFRPNIKSPQALHWDNGMPTVDQGLLGPGPSDSV >gi568815590r:27136849_27342493|GENSCAN_predicted_CDS_1|531_bp atggcagaagtagaaagaggacccttcccacgactgtcacttggacttggtaccctggaa atcggcaccagggtccactgccctgaagacctacgacctgaaaaatgccgtaagaacatg acctcttgtctcaatgtggttgatgcagatatctgctttatacaactgtctcccagtgac cacctccctatgggacagctggatgcaacctacgtgacctgccccactgaccctacaccc tgcatggactgtgtagacatgctgcagcaacctactgtctttgatctgcctgcactttgc tgtgggacactggacgagtcaccatcctacggggattgtaaaaagggcttggtgtcctct cgctcacacttcgtaactccacgaccctcatggcgcacaccagaggaaaatcttgttgtg gtgagtaaatttagaccgaacataaaaagccctcaggccttgcactgggacaatgggatg cccaccgttgaccaagggctccttggtcctgggccttcagactcagtctga >gi568815590r:27136849_27342493|GENSCAN_predicted_peptide_2|162_aa MAHALRELLKLLSDTGGKKIHLILLLNWSLAVGMCWETLELVDMLGLCDGNEDYNRGPDL VIYEIHSPASLPFHSSDASVLGDIPPTTYSNISNSNQELLKFPEKVHNGVFWAYPYSREA LVSPNSHLGQEALWSPPGSFRSPGDSWAIPGDLGMMKRELEG >gi568815590r:27136849_27342493|GENSCAN_predicted_CDS_2|489_bp atggctcatgcattaagggagttactcaagctgttatctgatactggtgggaaaaagata catttaattctcttactcaactggtctctagctgtgggaatgtgttgggaaactttggaa ctggtggatatgctgggattgtgtgatggtaatgaagattacaatcgtggcccagacttg gtgatatacgaaatacactcccctgcttctctcccctttcactcctctgatgcctctgtc ctcggagatattcccccaaccacttactctaacatatcaaattcgaatcaggagttgctg aagtttcctgagaaagtgcataatggcgttttctgggcttacccttactccagagaggca cttgtgtctccaaactcacatcttgggcaagaggctctgtggagtcctcctgggtccttc aggtccccaggtgacagctgggccattcctggagatcttgggatgatgaagagggagctt gagggatga >gi568815590r:27136849_27342493|GENSCAN_predicted_peptide_3|568_aa MGFRQLESLDNKLENMNPEKGAERRSRMKDPEWEDEQSVSGPCGSLCLDSQTRGRKVHTE HAERVAYKEKMKELPLVSLFCSCFLADPLNKSSYKYEGWCGRQCRRKDESQRKDSADWRE RRAQADTVDLNWCVISDMEVIELNKCTSGQSFEVILKPPSFDGVPEFNASLPRRRDPSLE EIQKKLEAAEERRKYQEAELLKHLAEKREHEREVIQKAIEENNNFIKMAKEKLAQKMESN KENREAHLAAMLERLQEKDKHAEEVRKNKELKEEASRMGLGAAFLLKERSGVLPGSFICE DFTAALKQDEEIGTMKGKIREAYFKSFVPTPVAFSGGVMLPTGGTLEMCGNNFGGHKEQA GTVLPAFAIALRVTPSASLVLRPLNLDQAMLQGSLGLQLADGLFWDLASIMEEQYIIQSG QKTTYNGVGRRWAWNEWTIGKDTERQTEKSGLNLQSSIPFAHTLPTIVFLLMLIFQIPVT LDTVSLPSNLTFGLDNPLETISPYPSFARRGNSCLEREHDSSKIIQITQWSEVLGSLQAV AGTCSVQLEVTNIPAHVQSAFSSQQGPI >gi568815590r:27136849_27342493|GENSCAN_predicted_CDS_3|1707_bp atgggctttaggcaactcgagagtttggataacaaactggagaatatgaatccagagaaa ggagctgagcgcagatctaggatgaaagacccagagtgggaggatgagcagagtgtcagt ggaccatgtgggagtctgtgcctagacagccagactagagggagaaaagttcacactgag catgcagaaagagttgcctacaaagagaagatgaaggagctcccgctggtgtccttgttc tgctcctgcttcctggccgatcccctgaataagtcgtcctacaaatatgaaggctggtgt gggagacagtgtaggaggaaggatgaaagccagcggaaagacagtgctgactggagagaa agaagagctcaggcagacacggtggacctgaattggtgcgtcatttccgacatggaagtc atcgagctgaacaaatgcacctcgggccaatcctttgaagtcatcctgaagccaccctcc tttgatggggttcccgagttcaacgcctccctgccaaggcggcgagacccatccctggaa gagatccagaagaaactagaagcggctgaggagcgaaggaagtaccaggaagcggagctc ctgaaacacctagcagagaaacgggaacatgagagagaggtgatccaaaaggccattgag gaaaacaacaacttcatcaagatggctaaggaaaaactggcccagaagatggaatccaac aaggagaacagggaggcccacctcgccgccatgttggaacggctgcaagagaaggacaag cacgccgaggaggtgcggaaaaacaaggagctgaaggaagaggcctccaggatgggatta ggagcagcatttctgttgaaagagaggtcgggggtcttacctggaagcttcatttgtgag gactttactgcagcactgaagcaggatgaggaaatagggacaatgaagggcaaaataaga gaggcatattttaaatcctttgttcctacaccagtggctttcagtgggggtgtaatgctt cccacagggggcactttggaaatgtgtggcaataattttggtggacacaaggagcaggca ggcacagtgcttccagcctttgccatcgcactaagagttacaccatcagcttccttggtc ctaaggcctttgaatttggaccaagccatgctacagggatccctgggtctccagcttgca gatggattattttgggacttagcctccataatggaggaacagtacattatccagagtggc cagaagaccacgtacaatggggttggtagaagatgggcatggaatgaatggacaattggg aaggacactgaacgtcagactgagaaatctggacttaatctacagtcatccattccattt gctcacacattgccaaccatcgtctttctcctgatgctgatctttcagatcccagtgact cttgacactgtatccttgccttccaacctgacttttgggctggacaatcccttagagacc attagtccatatccttcatttgccagacgaggaaattcatgcttagagagagaacatgac tcttccaagatcatacagatcacacagtggtctgaggtcctcggttccttgcaggctgta gctgggacctgctctgtgcagctggaggtcactaacattcctgctcatgtgcaatctgca ttttcaagccagcaaggacccatctaa >gi568815590r:27136849_27342493|GENSCAN_predicted_peptide_4|271_aa MTGAGHCGVVRSVPCLRTLGGSTGKAIRELKAIRELWSPGSAREIRSRVARAHFRGGLKI QVKDLRGVNRGRIATKGRRRNRCKYKYVEQWCQKTPNSDLLFEVTRNPVFFCPSSCSTIL ATASSRKQLGREKGELVATWCILGGTEGGRGQEAEYEKGVSEIEQNCKETEKNTVDLKLI EERVKNRAVTKIEKNGVWDDLQINKVKVDFLNCPAHPKAKSVKQRTRALGLPSKKLQPHR QGPAQLLSFTSCLETAFNKCSLPKLNRQAAS >gi568815590r:27136849_27342493|GENSCAN_predicted_CDS_4|816_bp atgactggtgctggtcactgcggcgtggtgaggtcagtgccgtgtttgaggacattgggt ggaagcacagggaaggccattagagagctcaaggccattagagagctctggagtccagga tctgcacgggagatacggagtcgtgtggcacgggcacattttagaggggggctgaagatt caggttaaagacttacgaggagtgaatagaggcagaattgcaaccaaaggaaggcggcgt aacagatgcaaatacaaatacgttgaacagtggtgtcaaaagaccccgaattcagatctg ctttttgaagtgaccaggaatcctgttttcttctgtccttcatcctgcagcaccatccta gcgactgcaagctccaggaagcaactgggcagagagaagggggaactggtggcaacatgg tgcatcctgggaggaacagagggaggaagaggacaggaggctgagtatgagaagggagtg agtgagatagagcaaaactgcaaggaaactgagaagaacacagtggacttgaaactcata gaagagagagtcaagaacagagctgtaaccaagattgagaaaaatggagtttgggatgat ctacaaattaacaaggtgaaggtggactttctcaactgccctgcgcaccctaaggcgaaa tccgttaagcagagaaccagagccctcgggcttccttctaagaaactgcagccccacagg cagggcccagctcagctgctcagcttcacaagctgtctagaaacagctttcaacaaatgc agtctccctaaattaaatcgacaggctgccagttaa >gi568815590r:27136849_27342493|GENSCAN_predicted_peptide_5|454_aa MEHALREKAKAFWAMRRSYEAIAKHNQVEAAWLEGRIRQEFDKLREFLRVEEQAILDAMA EETRQKQLLADEKMKQLTEETEVLAHEIERLQMEMKEDDVSFLMKHKSRKRRLFCTMEPE PVQPGMLIDVCKYLGSLQYRVWKKMLASVESGERGCRQRGKLAFLPIQNCVLYDKRPPAS GLASPKATAAHMSHSVLVLRGGISLTQGMGRAVLTSLGRVTLGGTRGWDCRISSGGYSPT PATTPALGHSLHGLGARCDTVVVPFSFDPNTAAGWLSVSDDLTSVTNHGYRVQVENPERF SSAPCLLGSRVFSQGSHAWEVALGGLQSWRVGVVRVRQDSGAEGHSHSCYHDTRSGFWYV CRTQGVEGDHCVTSDPATSPLVLAIPRRLRVELECEEGELSFYDAERHCHLYTFHARFGE VRPYFYLGGARGAGPPEPLRICPLHISVKEELDG >gi568815590r:27136849_27342493|GENSCAN_predicted_CDS_5|1365_bp atggagcatgcactgcgggagaaggccaaggccttctgggccatgcggcgctcctatgag gccatcgccaagcacaatcaggtggaggctgcatggctggaaggccggatccggcaggag tttgataagcttcgcgagttcttgagagtggaggagcaggccattctggatgccatggcc gaggagacaaggcagaagcaacttctggccgacgagaagatgaagcagctcacagaggag acggaggtgctggcacatgagatcgagcggctgcagatggagatgaaggaggacgacgtt tcttttctcatgaaacacaagagccgaaaacgccgactcttctgcaccatggagccagag ccagtccagcccggcatgcttatcgatgtctgcaagtacctgggctccctgcagtaccgc gtctggaagaagatgcttgcatctgtggaatctggtgagcgggggtgccggcagagaggg aaactggctttcttgcctattcagaactgtgttttgtatgataaaagacctcctgcatct gggcttgcctctcctaaggcgactgcagcccacatgagccacagtgtcctggttttacgg ggaggaatatcactgactcagggaatgggccgtgctgtcctaacatcccttggtcgtgtc acactcggaggtaccaggggttgggactgccgcatatcttctgggggatacagcccaacc cctgccaccactccagccctgggccactccttgcatggacttggggccaggtgtgacaca gtggttgtacccttcagctttgaccccaacaccgcagctggctggctctccgtgtctgac gacctcaccagcgtcaccaaccatggctaccgcgtgcaggtggagaacccggaacgcttc tcctcggcgccctgcctgctgggctcccgtgtcttctcacagggctcgcacgcctgggag gtggcccttggggggctgcagagctggagggtgggcgtggtacgtgtgcgccaggactcg ggcgctgagggccactcacacagctgctaccacgacacacgctcgggcttctggtatgtc tgccgcacgcagggcgtggagggggaccactgcgtgacctcggacccagccacgtcgccc ctggtcctggccatcccacgccgcctgcgtgtggagctggagtgtgaggagggcgagctg tctttctatgacgcggagcgccactgccacctgtacaccttccacgcccgctttggggag gttcgcccctacttctacctggggggtgcacggggcgccgggcctccagagcctttgcgc atctgccccttgcacatcagtgtcaaggaagaactggatggctga >gi568815590r:27136849_27342493|GENSCAN_predicted_peptide_6|181_aa MERSPDVSPGPSRSFKEELLCAVCYDPFRDAVTLRCGHNFCRGCVSRCWEVQVSPTCPVC KDRASPADLRTNHTLNNLVEKLLREEAEGARWTSYRFSRVCRLHRGQLSLFCLEDKELLC CSCQADPRHQGHRVQPVKDTAHDFRHAGPHLSDPGSQGWLLLQGRPRGPCWEKRMRTESD R >gi568815590r:27136849_27342493|GENSCAN_predicted_CDS_6|546_bp atggagcggagtcccgacgtgtcccccgggccttcccgctccttcaaggaggagttgctc tgcgccgtctgctacgaccccttccgcgacgcagtcactctgcgctgcggccacaacttc tgccgcgggtgcgtgagccgctgctgggaggtgcaggtgtcgcccacctgcccagtgtgc aaagaccgcgcgtcacccgccgacctgcgcaccaaccacaccctcaacaacctggtggag aagctgctgcgcgaggaggccgagggcgcgcgctggaccagctaccgcttctcgcgtgtc tgccgcctgcaccgcggacagctcagcctcttctgcctcgaggacaaggagctgctgtgc tgctcctgccaggccgacccccgacaccaggggcaccgcgtgcagccggtgaaggacact gcccacgactttcggcatgcaggaccccacctctccgacccaggttctcagggctggctg cttcttcaaggtcggcccaggggcccgtgctgggagaagagaatgcgaacggaaagtgac agatga >gi568815590r:27136849_27342493|GENSCAN_predicted_peptide_7|242_aa MLANTAKLGNGKACLPQVLAIADLGFSHTNWLPHKSSNHLSAGKSPLVQGWIVPCTFPWQ PSLNHPVSHTALVSRGHATLTLSPHLAPGRRVGCVVCVCRLQPLTINSEIDAATNSLSAG QTWGAISAIPPAANGILGCYVPLICHVPSTQHFGGNLGDYTAVSRSEGGQLVFTTVYEVA SPIPGLQKGKGSLDKASVKVHRTTVISSKIYLVTLMNKVDYVCSARMPVSEDAAASPVAD FI >gi568815590r:27136849_27342493|GENSCAN_predicted_CDS_7|729_bp atgttagcgaacacagccaagctaggaaacgggaaggcttgcctcccgcaggtattggcc attgctgatcttggttttagtcacactaactggctgcctcataaaagcagcaaccacctt agtgctgggaaaagccctctagtgcagggttggatagtaccctgcaccttcccctggcag ccctccctcaaccacccggtatcccacactgccctggtgagcagaggccatgccacactc accttgtccccacaccttgcacctggccgcagggtggggtgtgttgtctgtgtgtgccgg ctccagcctcttaccataaactctgaaatagacgctgccaccaattccctgagcgccggg cagacctggggggcgatctctgccattcccccggccgcaaatgggatcctcggatgttat gtgccccttatttgtcacgttcccagcactcagcattttggaggtaatcttggtgactac acagcggtcagcaggtcagaaggtggtcagcttgtcttcacaaccgtctatgaagttgct agtcctattcctggtttgcagaaaggaaaaggaagcttggataaggcctcagtaaaggta catcggactacagtgatttcctcaaagatttacttggtaactctgatgaacaaggtcgat tatgtctgcagtgccaggatgcctgtttccgaagatgctgctgccagccctgtggctgac tttatttag >gi568815590r:27136849_27342493|GENSCAN_predicted_peptide_8|92_aa GQVKTGRGLRGCTGDNADKRGDCRTSVCKSRASQGQKAQSPAAKAMSVYTGGNLHLQDRQ EGHPNHRARIQETRDVAPILVATGCCDTHCGE >gi568815590r:27136849_27342493|GENSCAN_predicted_CDS_8|279_bp ggtcaagtaaagacagggagagggctcagaggctgcactggtgacaatgcagataaacga ggtgattgtcgaacaagtgtttgcaaatctcgagcttcccagggccagaaagctcagagt ccagcagccaaggctatgtctgtgtacactggaggaaatctccaccttcaggacaggcaa gaagggcatcctaaccacagggcaagaattcaagagactcgggatgtggcccctattcta gttgcaactggctgttgcgacacccactgcggggaatga