GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:15:46 Sequence gi568815597f:24231550_24454485 : 222936 bp : 45.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5028 5288 261 2 0 73 76 156 0.764 9.97 1.02 Intr + 5369 5482 114 1 0 29 75 80 0.584 1.44 1.03 Intr + 19345 19446 102 0 0 61 56 94 0.044 3.87 1.04 Intr + 22976 23080 105 1 0 53 80 106 0.956 6.71 1.05 Intr + 25362 25412 51 2 0 73 111 10 0.486 0.90 1.06 Intr + 25871 25972 102 1 0 90 62 33 0.494 1.27 1.07 Intr + 31116 31173 58 2 1 53 94 28 0.018 -1.64 1.08 Term + 43939 44039 101 2 2 90 52 74 0.000 2.29 1.09 PlyA + 44786 44791 6 1.05 2.00 Prom + 45466 45505 40 -1.76 2.01 Init + 66506 66545 40 1 1 83 94 31 0.368 3.56 2.02 Intr + 70118 70226 109 0 1 31 91 56 0.040 -0.46 2.03 Intr + 90311 90476 166 2 1 52 100 76 0.001 5.16 2.04 Intr + 97647 97738 92 2 2 113 64 10 0.016 -0.11 2.05 Intr + 99883 100063 181 1 1 73 88 280 0.749 26.37 2.06 Intr + 103096 103157 62 0 2 117 78 37 0.911 3.13 2.07 Intr + 104933 105278 346 2 1 115 78 318 0.994 28.90 2.08 Intr + 105529 105602 74 0 2 80 88 50 0.997 2.40 2.09 Intr + 106087 106240 154 1 1 134 100 193 0.999 25.17 2.10 Intr + 106443 106554 112 2 1 69 72 217 0.997 18.15 2.11 Intr + 108119 108213 95 0 2 96 86 168 0.993 17.08 2.12 Intr + 110566 110724 159 0 0 113 34 214 0.808 18.68 2.13 Intr + 111145 111223 79 0 1 155 49 121 0.999 14.02 2.14 Intr + 111343 111476 134 2 2 81 60 173 0.997 14.16 2.15 Intr + 113348 113382 35 1 2 76 80 16 0.339 -3.38 2.16 Intr + 115004 115092 89 2 2 87 45 114 0.995 6.61 2.17 Intr + 115919 116004 86 0 2 63 103 111 0.881 9.64 2.18 Intr + 118509 118573 65 2 2 117 96 90 0.997 10.22 2.19 Term + 122825 122939 115 2 1 110 48 227 0.967 18.94 2.20 PlyA + 123737 123742 6 1.05 3.10 PlyA - 124299 124294 6 1.05 3.09 Term - 127070 126994 77 0 2 125 50 142 0.994 12.10 3.08 Intr - 129492 129302 191 2 2 80 71 101 0.863 6.83 3.07 Intr - 138290 138125 166 0 1 90 86 49 0.721 3.92 3.06 Intr - 141731 141666 66 0 0 66 113 17 0.497 0.88 3.05 Intr - 142261 142153 109 1 1 88 115 -4 0.805 2.16 3.04 Intr - 148274 148104 171 2 0 109 64 56 0.876 5.44 3.03 Intr - 152454 152353 102 0 0 123 81 21 0.938 5.27 3.02 Intr - 153927 153734 194 1 2 72 29 112 0.785 2.91 3.01 Init - 169839 169770 70 0 1 94 83 53 0.762 6.61 3.00 Prom - 172766 172727 40 -5.16 4.00 Prom + 177516 177555 40 1.34 4.01 Init + 182059 182168 110 0 2 87 -23 125 0.172 -0.79 4.02 Intr + 182188 182225 38 1 2 85 78 48 0.284 1.41 4.03 Intr + 183729 183831 103 1 1 57 77 71 0.257 2.13 4.04 Intr + 184476 184567 92 0 2 57 59 86 0.305 2.24 4.05 Intr + 184593 184750 158 1 2 27 81 90 0.117 1.93 4.06 Intr + 204791 204880 90 1 0 83 -17 110 0.248 0.19 4.07 Intr + 206611 206706 96 0 0 31 100 43 0.233 0.01 4.08 Intr + 208623 208691 69 2 0 87 111 77 0.963 9.28 4.09 Intr + 210506 210677 172 1 1 43 82 232 0.997 17.62 4.10 Intr + 213636 213695 60 1 0 93 80 45 0.662 2.91 4.11 Intr + 217932 218050 119 1 2 122 54 147 0.648 14.88 4.12 Intr + 221859 221955 97 2 1 122 111 125 0.970 17.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 88078 87969 110 2 2 123 54 79 0.903 6.67 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:24231550_24454485|GENSCAN_predicted_peptide_1|297_aa MEASSRHTSRSWGDSSTEGIYGILAFMERMQALSSFFVVTAAFNPHNNWCFGNCYYFHFT HEDDTAQGGKNSSKFHAREMRSWDLIKALTHIPDYGGPTLSPVLDARDTDNKTHSLSSSK LSVYQNTAVKTSPDLSSADFEALTSAITQSDQNGHKALCDADALCGHGHVCIILMLHLVL RLRQPSIESCPDFQDYVWFQFLPPYVIIINLKFCEWIILPHLADVATEDQKVEVISLTQC QVSTQIISRVKILSLITSAKSLRAVPTKQAVILIFRWLNQSMTLNEKPKPFPRICKN >gi568815597f:24231550_24454485|GENSCAN_predicted_CDS_1|894_bp atggaggcttccagccgccacacttctaggagctggggagacagcagcacagagggcatt tatggaatacttgcatttatggaacgcatgcaggctctgagctcattctttgttgtgact gctgcctttaaccctcacaataactggtgctttgggaactgttattatttccactttacc catgaggacgacacggcccagggaggaaagaactcgtccaagttccatgccagggagatg cggagctgggatttgatcaaggcacttactcacatacctgattatggagggcccactctg agccctgtgctagatgccagagatacagacaataaaacacactccctgtcctcctcaaag ctctcggtctaccagaacacagctgtgaagacttcacctgatttgtcatctgcagatttt gaggcgctaactagtgccataactcagagtgaccaaaatggtcacaaggccttgtgtgac gcagatgctctttgtggacacgggcacgtgtgcatcatcttgatgctccacctggtgctt aggttacggcagccatcaatcgagtcctgtccagacttccaagactatgtctggtttcag tttctaccaccatatgtaataatcatcaacttgaaattctgtgaatggatcatattaccc catcttgcagatgtggccactgaggatcagaaagttgaagtgatttcattaacacagtgc caagtttcaacacagataatctctcgtgtcaagatccttagcttaattacatctgcaaag tcccttcgagcagtccccaccaagcaggcagtcatcttgatcttcagatggctaaatcag agcatgacactgaatgaaaaacccaagcccttccccaggatctgcaagaactga >gi568815597f:24231550_24454485|GENSCAN_predicted_peptide_2|730_aa MCFCTAELAPGEAANPSSYSENVTERILDPSSPAPEPVSSSTMLTCFQYSALGAWSPQPW AAPGYRRAQGILGCGRGRRKSPPTAWVSQENSRRPRAAQRRVFLKSPAPHTLGPGGMGDT VLDEAAGRAAASCMLRSVRLLKNDPVNLQKFSYTSEDEAWKTYLENPLTAATKAMMRVNG DDDSVAALSFLYDYYMGPKEKRILSSSTGGRNDQGKRYYHGMEYETDLTPLESPTHLMKF LTENVSGTPEYPDLLKKNNLMSLEGALPTPGKAAPLPAGPSKLEAGSVDSYLLPTTDMYD NGSLNSLFESIHGVPPTQRWQPDSTFKDDPQESMLFPDILKTSPEPPCPEDYPSLKSDFE YTLGSPKAIHIKSGESPMAYLNKGQFYPVTLRTPAGGKGLALSSNKVKSVVMVVFDNEKV PVEQLRFWKHWHSRQPTAKQRVIDVADCKENFNTVEHIEEVAYNALSFVWNVNEEAKVFI GVNCLSTDFSSQKGVKGVPLNLQIDTYDCGLGTERLVHRAVCQIKIFCDKGAERKMRDDE RKQFRRKVKCPDSSNSGVKGCLLSGFRGNETTYLRPETDLETPPVLFIPNVHFSSLQRSG GAAPSAGPSSSNRLPLKRTCSPFTEEFEPLPSKQAKEGDLQRVLLYVRRETEEVFDALML KTPDLKGLRNAISEKYGFPEENIYKVYKKCKRGILVNMDNNIIQHYSNHVAFLLDMGELD GKIQIILKEL >gi568815597f:24231550_24454485|GENSCAN_predicted_CDS_2|2193_bp atgtgcttttgtaccgctgaacttgcccctggtgaggcagcaaatccaagttcctatagt gagaatgtgacagagaggattttggacccaagtagtccagctccagaacctgtttcctcg tccaccatgctgacctgctttcagtacagcgctctgggtgcctggagcccgcagccctgg gcagccccgggctaccgcagggcgcaagggatcctgggctgcggccgagggcgccggaag tcgccgccgaccgcctgggtctcgcaggaaaacagccggcgcccgcgagctgcccagcgt cgggttttcctgaagagcccagctcctcacaccttggggcctggtgggatgggagacact gtcctggatgaagccgctgggagagctgccgcctcctgtatgctgaggtctgtgcggctg ctaaagaacgacccagtcaacttgcagaaattctcttacactagtgaggatgaggcctgg aagacgtacctagaaaacccgttgacagctgccacaaaggccatgatgagagtcaatgga gatgatgacagtgttgcggccttgagcttcctctatgattactacatgggtcccaaggag aagcggatattgtcctccagcactgggggcaggaatgaccaaggaaagaggtactaccat ggcatggaatatgagacggacctcactccccttgaaagccccacacacctcatgaaattc ctgacagagaacgtgtctggaaccccagagtacccagatttgctcaagaagaataacctg atgagcttggagggggccttgcccacccctggcaaggcagctcccctccctgcaggcccc agcaagctggaggccggctctgtggacagctacctgttacccaccactgatatgtatgat aatggctccctcaactccttgtttgagagcattcatggggtgccgcccacacagcgctgg cagccagacagcaccttcaaagatgacccacaggagtcgatgctcttcccagatatcctg aaaacctccccggaacccccatgtccagaggactaccccagcctcaaaagtgactttgaa tacaccctgggctcccccaaagccatccacatcaagtcaggcgagtcacccatggcctac ctcaacaaaggccagttctaccccgtcaccctgcggaccccagcaggtggcaaaggcctt gccttgtcctccaacaaagtcaagagtgtggtgatggttgtcttcgacaatgagaaggtc ccagtagagcagctgcgcttctggaagcactggcattcccggcaacccactgccaagcag cgggtcattgacgtggctgactgcaaagaaaacttcaacactgtggagcacattgaggag gtggcctataatgcactgtcctttgtgtggaacgtgaatgaagaggccaaggtgttcatc ggcgtaaactgtctgagcacagacttttcctcacaaaagggggtgaagggtgtccccctg aacctgcagattgacacctatgactgtggcttgggcactgagcgcctggtacaccgtgct gtctgccagatcaagatcttctgtgacaagggagctgagaggaagatgcgcgatgacgag cggaagcagttccggaggaaggtcaagtgccctgactccagcaacagtggcgtcaagggc tgcctgctgtcgggcttcaggggcaatgagacgacctaccttcggccagagactgacctg gagacgccacccgtgctgttcatccccaatgtgcacttctccagcctgcagcgctctgga ggggcagccccctcggcaggacccagcagctccaacaggctgcctctgaagcgtacctgc tcgcccttcactgaggagtttgagcctctgccctccaagcaggccaaggaaggcgacctt cagagagttctgctgtatgtgcggagggagactgaggaggtgtttgacgcgctcatgttg aagaccccagacctgaaggggctgaggaatgcgatctctgagaagtatgggttccctgaa gagaacatttacaaagtctacaagaaatgcaagcgaggaatcttagtcaacatggacaac aacatcattcagcattacagcaaccacgtcgccttcctgctggacatgggggagctggac ggcaaaattcagatcatccttaaggagctgtaa >gi568815597f:24231550_24454485|GENSCAN_predicted_peptide_3|381_aa MDNSAQKNERTGKHPRRASEVQKALRICIHNTRVLLGSRPGVLEYLAPQCLLLPASLSMA VSSSSVHRGSFSPKAGISTPVVTSHGARNDIPGPGFYNVIHQSPVSNSVSLSKKGTCMFP SMCARLDTIISKYPAANAYTIPSDFISKRDFSNSCSSMFQLPSFMKALKFETPAPNYYNA SVSCCKQRNNVCTRAGFMSKTQRGSFAFADKGPPPEVTSSSSHKTEEAALFPEHPLHGHY DINESLVKQSPNTLMSCFKSKTNRGLKLTSTGPGPGYYNPSDCTKVPKKTLFPKNPILNF SAQPSPLPPKPPFPGPGQYEIVDYLGPRKHFISSASFVSNTSRWTAAPPQPGLPGPATYK PELPGKQSFLYNEDKKWIPVL >gi568815597f:24231550_24454485|GENSCAN_predicted_CDS_3|1146_bp atggacaactctgcacagaaaaatgaacgcactggcaaacatcccagacgtgccagtgaa gtacagaaagcccttcgcatctgcatccacaacacgagggttctcctgggctctcggcct ggggtgctggagtacctggctccacagtgcctcctgctgcctgcctcactctccatggca gtcagttcctcttccgtccaccggggatcattctcacctaaagcaggaatcagcacccct gtcgtcacctcccatggggcaaggaatgatatcccaggacctgggttctacaatgttatt caccagtcaccggtgtccaacagtgtctcattgtccaagaaaggaacttgcatgtttccc tcaatgtgcgcccgattggacaccatcatttctaaataccctgcagcgaatgcatacact atcccatcggattttatttccaagagagactttagtaattcgtgttccagcatgttccag ttgccaagctttatgaaagctctcaagtttgaaactcctgcaccaaactattacaatgcc tctgtctcttgctgcaagcagagaaacaacgtctgtactcgagccgggtttatgtcaaaa acccaaagaggatctttcgcttttgctgataaaggacctcccccagaggtgacttcatca tccagtcataaaacagaggaggcagctctgtttccggagcacccactccacgggcattat gatatcaacgaatcccttgtgaagcagtcgccaaatacattaatgtcttgttttaaatca aaaaccaaccgtggattaaaactgacgtcaacaggcccgggacctggttattacaacccc agtgattgcacaaaagttccaaaaaagactcttttcccgaaaaaccccatcctgaacttc tctgctcagccttcgcctctgcctccgaagccacctttcccaggtcctggtcagtatgag atcgtggactacttaggcccccgcaagcatttcatctctagtgcatcattcgtgtccaat accagccggtggacagcggcgccgcctcagccaggcctgcctggcccagctacgtacaag ccagagcttccaggaaagcagtccttcctctacaacgaggacaagaaatggatcccggtt ctgtag >gi568815597f:24231550_24454485|GENSCAN_predicted_peptide_4|402_aa MVRRVLFPTTSGTGPPAGTASYLVRSRQLNLASPTSRPNIRESTFRRPRGVPGGQERRDM FGPKSREGSGERWVFKDVLDLNTCLANLGPVALECSFQFCIEAKRSGGMGYLTKGSDMEL TLPEFLLFVPSGSGFLAAADGGVSRWDEGRRSWSKPASGTEVLSPRVSAGFFAYKFGMYE AKRIQNTQEAHQYAIPGVSSEPFSSEILSSHRKDVFCLQVALFCLAAHRENLIGALLAIF GHLVVSIALNLQKYCHIRLAGSKDPRAYFKTKTWWLGLFLMLLGELGVFASYAFAPLSLI VPLSAVSVIASAIIGIIFIKEKWKPKDFLRRYVLSFVGCGLAVVGTYLLVTFAPNSHEKM TGENVTRHLLVEIILFCLLLYFYKEKNANNIVVILLLVALLX >gi568815597f:24231550_24454485|GENSCAN_predicted_CDS_4|1206_bp atggtgcgccgtgtcctcttccccacgacctcagggaccggtccccccgccggaactgct tcctacctggtccggtcccggcagctgaatctggccagcccaacctcccggcctaacatt cgcgagtccaccttccgccgtccgcgaggtgttccaggagggcaggaaaggagagatatg tttggtcccaagtctcgcgagggctctggggagcgctgggtcttcaaggacgttttggat ttgaatacttgcctcgccaacctgggtcccgttgccttggaatgttctttccagttttgc atcgaggccaagaggagcgggggcatgggctaccttactaaaggaagtgacatggagtta actttgccagaatttctcctcttcgtgccgagcggctcgggcttcctggcggcagcagat ggtggagttagcaggtgggatgaggggaggcgttcttggtctaagcccgcttctggaaca gaggtgctgtctcctcgagttagtgctggcttcttcgcctacaagtttgggatgtatgag gcaaaaagaatacagaatacccaagaagctcaccaatatgccattcctggggtctccagc gagccttttagctctgagatcctgtcatcccacaggaaggacgtgttctgcctgcaggtg gctttattctgtcttgctgctcatcgggaaaacctgattggcgccctcttggcgatcttc gggcacctcgtggtcagcattgcacttaacctccagaagtactgccacatccgcctggca ggctccaaggatccccgggcctatttcaagaccaagacatggtggctgggcctgttcctg atgcttctgggcgagctgggtgtgttcgcctcctacgccttcgcgccgctgtcactcatc gtgcccctcagcgcagtttctgtgatagctagtgccatcataggaatcatattcatcaag gaaaagtggaaaccgaaagactttctgaggcgctacgtcttgtcctttgttggctgcggt ttggctgtcgtgggtacctacctgctggtgacattcgcacccaacagtcacgagaagatg acaggcgagaatgtcaccaggcacctcctggtggagatcattctgttctgcttgctgctc tacttctacaaggagaagaacgccaacaacattgtcgtgattcttctcttggtggcgtta cttgnn