GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:09:17 Sequence gi568815582f:23248600_23480798 : 232199 bp : 46.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 5735 5774 40 -2.86 1.01 Init + 6171 6279 109 2 1 70 86 58 0.129 4.18 1.02 Intr + 53705 53838 134 0 2 77 80 38 0.058 2.26 1.03 Term + 67333 68514 1182 0 0 37 36 3035 0.965 284.97 1.04 PlyA + 70018 70023 6 1.05 2.00 Prom + 76986 77025 40 -6.96 2.01 Init + 82252 82390 139 0 1 114 70 93 0.143 8.74 2.02 Intr + 99993 100311 319 1 1 122 58 604 0.477 55.82 2.03 Intr + 104202 104475 274 0 1 97 94 310 0.985 30.04 2.04 Intr + 106700 106890 191 1 2 67 97 216 0.420 18.78 2.05 Intr + 119257 119360 104 1 2 79 109 138 0.093 14.82 2.06 Intr + 122700 122863 164 1 2 99 44 178 0.833 14.19 2.07 Intr + 123177 123284 108 2 0 58 75 172 0.999 13.38 2.08 Intr + 127139 127256 118 1 1 109 72 127 0.887 13.24 2.09 Intr + 128566 128641 76 2 1 92 89 70 0.888 6.07 2.10 Intr + 128730 128787 58 0 1 61 78 70 0.889 2.19 2.11 Intr + 129933 130168 236 2 2 52 99 112 0.533 5.19 2.12 Intr + 131495 131570 76 2 1 88 92 153 0.999 15.22 2.13 Term + 131822 132202 381 1 0 134 40 650 0.972 59.64 2.14 PlyA + 132410 132415 6 1.05 3.21 PlyA - 134134 134129 6 1.05 3.20 Term - 140487 140321 167 1 2 127 46 225 0.996 20.28 3.19 Intr - 143924 143781 144 0 0 115 72 213 0.991 22.65 3.18 Intr - 144748 144634 115 1 1 105 98 57 0.999 8.42 3.17 Intr - 149530 149447 84 1 0 92 74 111 0.966 10.12 3.16 Intr - 149875 149708 168 1 0 39 72 99 0.891 3.54 3.15 Intr - 150518 150361 158 0 2 18 75 75 0.534 -1.07 3.14 Intr - 151121 151019 103 2 1 84 -6 121 0.682 2.05 3.13 Intr - 155235 155095 141 0 0 109 86 196 0.995 22.05 3.12 Intr - 157663 157477 187 0 1 93 86 102 0.870 10.19 3.11 Intr - 161761 161696 66 0 0 79 91 81 0.975 5.52 3.10 Intr - 164965 164849 117 0 0 132 65 84 0.998 10.08 3.09 Intr - 168672 168368 305 0 2 91 80 176 0.804 12.49 3.08 Intr - 170228 170101 128 0 2 58 63 167 0.995 11.60 3.07 Intr - 176348 176150 199 2 1 90 17 270 0.867 19.02 3.06 Intr - 185068 184946 123 1 0 115 92 54 0.995 9.28 3.05 Intr - 186119 186037 83 0 2 58 116 131 0.999 12.26 3.04 Intr - 194046 193878 169 0 1 108 95 98 0.993 12.02 3.03 Intr - 196565 196449 117 2 0 108 96 141 0.999 17.56 3.02 Intr - 197362 197214 149 2 2 84 68 80 0.563 5.55 3.01 Init - 204395 204227 169 2 1 100 84 327 0.964 33.20 3.00 Prom - 214122 214083 40 -4.06 4.06 PlyA - 214962 214957 6 1.05 4.05 Term - 219101 218991 111 2 0 119 32 141 0.991 10.16 4.04 Intr - 220397 220287 111 2 0 56 53 72 0.673 1.08 4.03 Intr - 221566 221397 170 2 2 68 113 70 0.865 7.17 4.02 Intr - 231288 231166 123 1 0 70 68 102 0.854 6.96 4.01 Init - 232112 232046 67 2 1 65 87 27 0.630 1.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 218083 218256 174 0 0 75 54 199 0.842 12.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:23248600_23480798|GENSCAN_predicted_peptide_1|474_aa MQGRRWLTQFLQVIIPGGHKPETNPRDFLLPNRFPQVGLAALAGCPSVTNTRPPPPAWRA PPPPPPPTARILRVPAPPPEQSPSPPFPSSPLTITILTIIAIIIITTIIPIITITIATIT ITILTITTIRITITTIITTIITITTITILTTITIITITTIIIITTIIPIITITILTITTI RITITTIITTIITIFTITTITILTTITIITIMITITTIAVTISSIITITILTIITIIIIT TITITISFIITIIIITITTITLTITSIITITILTITTIRITITTIITTIITILTITTITI FTTITIITIMITITLTISSVITITILTIITITIIINTITITTITLTISSVITVTILTIIT ITIINTITITTITLTISSIITIIILSTITIIITIISIMTTITLTITSIITITITILTIIT ITTRATLWSPTWFPRPKKQPTDGVTDPKDMMTRHLLERWLRIWALKPERPKFDS >gi568815582f:23248600_23480798|GENSCAN_predicted_CDS_1|1425_bp atgcagggtagacgttggctgacacagttcctgcaggtaattattcctgggggccacaag cccgaaaccaaccccagagattttctacttcccaacagatttccacaggtcggtctcgcc gcgctcgccgggtgtcccagtgtcaccaacactcggccgccgccgccagcttggcgcgca ccgccgcctccgccaccgccgacagcgcgcatcctccgtgtccccgctccgccgcccgag cagtcaccatcaccaccattcccctcatcacctctcaccatcaccatcctcaccatcatc gccatcatcatcatcaccaccatcatccccatcatcaccatcaccatcgccaccatcacc atcaccatcctcaccatcaccaccatcaggatcaccatcaccaccatcatcacaaccatt atcaccatcaccaccatcactatcctcaccaccatcaccatcatcaccatcaccaccatc atcatcatcaccaccatcatccccatcatcaccatcaccatcctcaccatcaccaccatc aggatcaccatcaccaccatcatcacaaccattatcaccatcttcaccatcaccaccatc accatcctcaccaccatcaccatcatcaccatcatgatcaccatcaccaccatcgccgtc accatctcctccatcatcaccatcaccatcctcaccatcatcaccatcatcatcatcacc accatcaccatcaccatctccttcatcatcaccatcatcatcatcaccatcaccaccatc accctgaccatcacctccatcatcaccatcaccatcctcaccatcaccaccatcaggatc accatcaccaccatcatcacaaccattatcaccatcctcaccatcaccaccatcaccatc ttcaccaccatcaccatcatcaccatcatgatcaccatcaccctcaccatctcctctgtc atcaccatcaccatcctcaccatcatcaccatcaccatcatcatcaacaccatcaccatc accaccatcaccctcaccatctcctctgtcatcactgtcaccatcctcaccatcatcacc atcaccatcatcaacaccatcactatcaccaccatcaccctcaccatctcctccatcatc accatcatcatcctcagcaccatcaccattatcatcaccatcatcagcatcatgaccacc atcaccctcaccatcacctccattatcaccatcaccatcaccattctcaccatcatcacc atcaccaccagagcaacattatggtcacccacatggtttcccagacccaagaagcagccc acagatggggtaacagacccaaaggacatgatgacacgtcatctcctagaaagatggcta agaatttgggctctaaagcctgaaagacctaaatttgattcctag >gi568815582f:23248600_23480798|GENSCAN_predicted_peptide_2|747_aa MPGRDLLRAVEASLELAAGDGDNPGYGFTSGDQTRFSKDPGTCSLKGATMHVKKYLLKGL HRLQKGPGYTYKELLVWYCDNTNTHGPKRIICEGPKKKAMWFLLTLLFAALVCWQWGIFI RTYLSWEVSVSLSVGFKTMDFPAVTICNASPFKYSKIKHLLKDLDELMEAVLERILAPEL SHANATRNLNFSIWNHTPLVLIDERNPHHPMVLDLFGDNHNGLTSSSASEKICNAHGCKM AMRLCSLNRTQCTFRNFTSATQALTEWYILQATNIFAQVPQQELVEMSYPGEQMILACLF GAEPCNYRNFTSIFYPHYGNCYIFNWGMTEKALPSANPGTEFGLKLILDIGQEDYVPFLA STAGVRLMLHEQRSYPFIRDEGIYAMSGTETSIGVLVDKLQRMGEPYSPCTVNGSEVPVQ NFYSDYNTTYSIQACLRSCFQDHMIRNCNCGHYLYPLPRGEKYCNNRDFPDWAHCYSDLQ MSVAQRETCIGMCKESCNDTQYKMTISMADWPSEASENSSHSFHYDLPPAPHPNCDSPGG LRKGQGCGLPPPGNRAMTGRDAADGNFCNHLLGFQDWIFHVLSQERDQSTNITLSRKGIV KLNIYFQEFNYRTIEESAANNIVWLLSNLGGQFGFWMGGSVLCLIEFGEIIIDFVWITII KLVALAKSLRQRRAQASYAGPPPTVAELVEAHTNFGFQPDTAPRSPNTGPYPSEQALPIP GTPPPNYDSLRLQPLDVIESDSEGDAI >gi568815582f:23248600_23480798|GENSCAN_predicted_CDS_2|2244_bp atgcccggccgagatctcttaagggcagtcgaggcttccctggagctagctgcaggagat ggagataacccgggttatggcttcacatctggagaccagacaaggttttccaaggaccct ggcacatgctccctgaaaggtgccactatgcacgtgaagaagtacctgctgaagggcctg catcggctgcagaagggccccggctacacgtacaaggagctgctggtgtggtactgcgac aacaccaacacccacggccccaagcgcatcatctgtgaggggcccaagaagaaagccatg tggttcctgctcaccctgctcttcgccgccctcgtctgctggcagtggggcatcttcatc aggacctacttgagctgggaggtcagcgtctccctctccgtaggcttcaagaccatggac ttccctgccgtcaccatctgcaatgctagccccttcaagtattccaaaatcaagcatttg ctgaaggacctggatgagctgatggaagctgtcctggagagaatcctggctcctgagcta agccatgccaatgccaccaggaacctgaacttctccatctggaaccacacacccctggtc cttattgatgaacggaacccccaccaccccatggtccttgatctctttggagacaaccac aatggcttaacaagcagctcagcatcagaaaagatctgtaatgcccacgggtgcaaaatg gccatgagactatgtagcctcaacaggacccagtgtaccttccggaacttcaccagtgct acccaggcattgacagagtggtacatcctgcaggccaccaacatctttgcacaggtgcca cagcaggagctagtagagatgagctaccccggcgagcagatgatcctggcctgcctattc ggagctgagccctgcaactaccggaacttcacgtccatcttctaccctcactatggcaac tgttacatcttcaactggggcatgacagagaaggcacttccttcggccaaccctggaact gaattcggcctgaagttgatcctggacataggccaggaagactacgtccccttccttgcg tccacggccggggtcaggctgatgcttcacgagcagaggtcataccccttcatcagagat gagggcatctacgccatgtcggggacagagacgtccatcggggtactcgtggacaagctt cagcgcatgggggagccctacagcccgtgcaccgtgaatggttctgaggtccccgtccaa aacttctacagtgactacaacacgacctactccatccaggcctgtcttcgctcctgcttc caagaccacatgatccgtaactgcaactgtggccactacctgtacccactgccccgtggg gagaaatactgcaacaaccgggacttcccagactgggcccattgctactcagatctacag atgagcgtggcgcagagagagacctgcattggcatgtgcaaggagtcctgcaatgacacc cagtacaagatgaccatctccatggctgactggccttctgaggcctccgagaacagcagc cacagcttccactacgaccttcctcctgctcctcatccaaattgtgattcccccgggggc ctcaggaagggacagggctgtggtctacctcccccagggaacagagccatgactgggagg gatgctgcagatggcaacttttgcaaccaccttcttgggttccaggactggattttccac gtcttgtctcaggagcgggaccaaagcaccaatatcaccctgagcaggaagggaattgtc aagctcaacatctacttccaagaatttaactatcgcaccattgaagaatcagcagccaat aacatcgtctggctgctctcgaatctgggtggccagtttggcttctggatggggggctct gtgctgtgcctcatcgagtttggggagatcatcatcgactttgtgtggatcaccatcatc aagctggtggccttggccaagagcctacggcagcggcgagcccaagccagctacgctggc ccaccgcccaccgtggccgagctggtggaggcccacaccaactttggcttccagcctgac acggccccccgcagccccaacactgggccctaccccagtgagcaggccctgcccatccca ggcaccccgccccccaactatgactccctgcgtctgcagccgctggacgtcatcgagtct gacagtgagggtgatgccatctaa >gi568815582f:23248600_23480798|GENSCAN_predicted_peptide_3|963_aa MDFSKFLADDFDVKEWINAAFRAGSKEAASGKADGHAATLVMKLQLFIQEVNHAVEETSH QALQNMPKVLRDVEALKQEASFLKEQMILVKEDIKKFEQDTSQSMQVLVEIDQVKSRMQL AAESLQEADKWSTLSADIEETFKTQDIAVISAKLTGMQNSLMMLVDTPDYSEKCVHLEAL KNRLEALASPQIVAAFTSQAVDQSKVFVKVFTEIDRMPQLLAYYYKCHKVQLLAAWQELC QSDLSLDRQLTGLYDALLGAWHTQIQWATQVFQKPHEVVMVLLIQTLGALMPSLPSCLSN GVERAGPEQELTRLLEFYDATAHFAKGLEMALLPHLHEHNLVKVTELVDAVYDPYKPYQL KYGDMEESNLLIQMSAVPLALEKHSNVGTIFLTSLLSDFTTDEKAWRSSPISALSVLKPN AALSVFADQEHGEVIDCVQELSHSVNKLFGLASAAVDRCVRFTNGLGTCGLLSALKSLFA KYVSDFTSTLQSIRKKCKLDHIPPNSLFQEDWTAFQNSIRIIATCGELLRHCGDFEQQLA NRILSTAGKYLSDSCSPRSLAGFQESILTDKKNSAKNPWQEYNYLQKDNPAEYASLMEIL YTLKEKGSSNHNLLAAPRAALTRLNQQAHQLAFDSVFLRIKQQLLLISKMDPLSASVYRI HPMASRFSNRPELFLLLASDTESSRRSGNRIQHQVAADSSSYMTAGVTPGVQPERALFSR AGFRVAQVGSKKRDFCVKRNLDQKYPTFSCYNLSEINERDQFPPHSSANVAVPVSWTCRN AEREIVAFCMAGFQSWNTAGIGETLTDELPAFSLTPLEYISNIGQYIMSLPLNLEPFVTQ EDSALELALHAGKLPFPPEQGDELPELDNMADNWLGSIARATMQTYCDAILQIPELSPHS AKQLATDIDYLINVMDALGLQPSRTLQHIVTLLKTRPEDYRQVSKGLPRRLATTVATMRS VNY >gi568815582f:23248600_23480798|GENSCAN_predicted_CDS_3|2892_bp atggacttctccaagttcctggcagacgacttcgacgtgaaggagtggatcaatgcggcc ttcagggccggctccaaggaggcggcgtccgggaaggcggatggccacgcagccaccctg gtgatgaagctgcagctgttcatccaagaggtgaaccacgccgtggaggaaacaagtcac caagctctccagaacatgcccaaagtgctccgtgatgttgaagccctaaaacaggaggca tctttcctgaaagaacagatgattcttgtcaaggaggacattaaaaaatttgaacaggac acatctcaatccatgcaggtgttggtagaaattgaccaagtgaagtccagaatgcaactt gctgccgaatctcttcaggaagcagataagtggagcacgttgagcgccgatattgaggag acatttaagactcaggacatagctgtgatttctgccaagctaacaggtatgcagaacagc ttaatgatgcttgttgatacaccagactactcagaaaagtgtgtgcacttggaggcactg aagaacaggctggaggccctagccagtccacagattgtagcggcattcacctctcaggct gtagatcagtccaaagtgtttgtgaaggtgtttactgaaattgaccggatgccccagctc ctggcctactactacaagtgtcacaaggtgcagcttttagcagcctggcaagagctgtgt caaagtgacctatccctggaccggcagcttaccggactctatgatgccttgcttggtgct tggcacacacaaatccagtgggctacacaggttttccagaagccccacgaggtggtaatg gtgctgctgattcagaccctgggggccctcatgccctcgctgccctcctgcctcagcaac ggcgtggagagggcagggcccgagcaggagctcaccaggctgctggagttctacgacgcc accgcccacttcgccaagggcttggagatggcactgctcccccacctacatgaacacaat ctggtaaaagtcacggagctggtggatgctgtgtatgatccatacaaaccctaccagctg aagtatggcgacatggaagagagcaacctcctcatccagatgagtgctgtgcctctggct ctagagaagcattcaaacgttgggactatatttctgacatcgttattaagtgattttact acagatgagaaggcttggaggtcttcccctatctcagcattgtctgttctaaagcctaat gcagctttgtctgtatttgctgaccaggagcatggggaagtgattgactgtgtgcaggag ctgagccactccgtgaacaagctgtttggtctggcgtctgcagccgttgacagatgcgtc agattcaccaatggcctggggacctgcggcctgttgtcagccctgaaatccctctttgcc aagtatgtgtctgatttcaccagcactctccagtccatacgaaagaagtgcaaactggac cacattcctcccaactccctcttccaggaagattggacggcttttcagaactccattagg ataatagccacctgtggagagcttttgcggcattgtggggacttcgagcagcagctagcc aacaggattttgtccacagctgggaagtatctatctgattcctgcagcccccggagcctg gctggttttcaggagagcatcttgacagacaagaagaactctgccaagaacccatggcaa gaatataattacctccagaaagataaccctgctgaatatgccagtttaatggaaatactt tatacccttaaggaaaaagggtcaagcaaccacaacctgctggctgcacctcgagcagcg ctgactcggcttaaccagcaggcccaccagctggctttcgattccgtgttcctgcgcatc aaacaacagctgttgcttatttcgaagatggaccctctgtctgcctctgtgtaccgcata catcctatggccagccgcttctctaatcgccctgagctcttcttgctccttgcctctgac accgagagctctagaagatcaggcaaccggatccaacatcaggtggctgcagacagcagt tcctacatgaccgccggagtcactcccggagttcagccagaaagggccttgttttccagg gctggctttagagtggctcaggttggaagtaagaagcgagatttttgtgtgaagagaaac ttggaccaaaaatatccaactttctcctgttacaatctcagcgagataaatgaaagggat cagttcccaccccacagttcagccaatgtggcagttcctgtttcatggacctgcaggaat gcggagagagaaatcgttgccttctgtatggcgggcttccagagctggaatacggctggc atcggagaaaccctcacagatgaactgcccgcctttagtctcacccctctcgagtacatc agcaacatcgggcagtacatcatgtccctccccctgaatcttgagccatttgtgactcag gaggactctgccttagagttggcattgcacgctggaaagctgccatttcctcctgagcag ggggatgaattgcccgagctggacaacatggctgacaactggctgggctcgatcgccaga gccacaatgcagacctactgtgatgcgatcctacagatccctgagctgagcccacactct gccaagcagctggccactgacatcgactatctgatcaacgtgatggatgccctgggcctg cagccgtcccgcaccctccagcacatcgtgacgctactgaagaccaggcctgaggactat agacaggtcagcaaaggcctgccccgtcgcctggccaccaccgtggccaccatgcggagt gtgaattactga >gi568815582f:23248600_23480798|GENSCAN_predicted_peptide_4|193_aa MEGRVTFGNRVTSSLGDIPVSRVFQNPAGCMKTCPLIDLEVDNGPAQMGTVVPSLLHQDL AALGSLPPLIVYDRNGFRILLHFSQTGAPGHPEVQVLLLTMMSTAPQPVWDIMFQVAVPK SMRVKLQPASSSKLPAFSPLMPPAVISQMLLLDNPHKEPIRLRYKLTFNQGGQPFSEVGE VKDFPDLAVLGAA >gi568815582f:23248600_23480798|GENSCAN_predicted_CDS_4|582_bp atggagggccgggtcacctttggaaacagagtgaccagctcattgggagacatccctgtc tccagagtctttcagaatccagcaggctgcatgaagacctgccccctgattgacttggag gtggacaatggacctgcgcagatggggactgtggtgccatctttgcttcatcaggacctg gcagccttgggcagcctgccgcctctcattgtgtatgaccggaatggattcagaattctg ctccacttctcccagacgggagcccctgggcacccagaggtacaggtgctgctcttgacc atgatgagcacggctccccagcctgtctgggatatcatgtttcaagtggctgtgccaaag tcaatgagagtgaagctgcagccggcatccagctccaagcttcctgcattcagtcctttg atgcctccagctgtgatatctcagatgctgctgcttgacaatccacacaaagaacctatc cgcttacggtacaagctgacattcaaccaaggtggacagcctttcagcgaagtaggagaa gtgaaagacttcccagacctggctgtcttgggcgcagcctaa