GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:22:21 Sequence gi568815593r:87294449_87512794 : 218346 bp : 35.87% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 11070 11109 40 -2.05 1.01 Init + 20952 20997 46 2 1 99 102 0 0.015 3.43 1.02 Intr + 24764 24865 102 0 0 58 37 98 0.007 0.93 1.03 Intr + 25038 25167 130 1 1 108 -30 84 0.001 -2.57 1.04 Intr + 36900 37052 153 0 0 124 97 90 0.907 11.77 1.05 Intr + 43526 43643 118 2 1 64 103 126 0.968 11.25 1.06 Intr + 54738 54916 179 2 2 40 99 104 0.765 4.50 1.07 Intr + 58709 58787 79 2 1 46 82 79 0.150 1.73 1.08 Intr + 68103 68223 121 2 1 52 110 136 0.935 11.35 1.09 Intr + 68900 69056 157 0 1 35 110 97 0.883 4.75 1.10 Intr + 75365 75452 88 2 1 79 94 39 0.987 2.65 1.11 Intr + 77670 77747 78 2 0 109 72 100 0.998 9.33 1.12 Intr + 79715 79872 158 1 2 100 80 83 0.934 6.59 1.13 Intr + 80392 80468 77 1 2 95 84 70 0.994 5.54 1.14 Intr + 81945 82117 173 1 2 70 52 141 0.966 7.44 1.15 Intr + 90801 90941 141 2 0 9 93 133 0.544 5.53 1.16 Intr + 92378 92455 78 1 0 58 63 73 0.600 0.63 1.17 Intr + 94945 95079 135 0 0 41 96 192 0.989 15.14 1.18 Intr + 96352 96431 80 0 2 90 69 65 0.651 2.23 1.19 Term + 97532 97586 55 2 1 117 33 35 0.547 -3.05 1.20 PlyA + 98360 98365 6 1.05 2.14 PlyA - 98607 98602 6 1.05 2.13 Term - 103812 103617 196 2 1 51 42 111 0.442 -1.30 2.12 Intr - 105057 104946 112 1 1 88 113 95 0.904 10.52 2.11 Intr - 107324 107254 71 1 2 61 106 10 0.922 -1.99 2.10 Intr - 110559 110396 164 0 2 69 62 85 0.961 1.85 2.09 Intr - 111370 111308 63 1 0 55 105 63 0.819 2.70 2.08 Intr - 112709 112425 285 2 0 -15 68 305 0.938 14.81 2.07 Intr - 113738 113528 211 1 1 124 92 184 0.999 20.39 2.06 Intr - 114915 114842 74 0 2 97 82 0 0.904 -2.41 2.05 Intr - 116898 116776 123 0 0 44 103 96 0.984 6.56 2.04 Intr - 118088 117987 102 2 0 -9 89 124 0.784 2.25 2.03 Intr - 118374 118230 145 2 1 107 100 237 0.974 26.16 2.02 Intr - 151238 150922 317 2 2 46 44 343 0.005 19.64 2.01 Init - 166012 165806 207 1 0 68 80 118 0.460 7.87 2.00 Prom - 171317 171278 40 -3.65 3.00 Prom + 179379 179418 40 -8.25 3.01 Sngl + 179621 180187 567 1 0 51 43 206 0.477 8.30 3.02 PlyA + 180415 180420 6 1.05 4.00 Prom + 183777 183816 40 -3.65 4.01 Init + 187209 187403 195 2 0 50 15 172 0.092 5.08 4.02 Intr + 189921 189943 23 2 2 72 82 16 0.085 -5.18 4.03 Term + 193832 194048 217 2 1 105 46 150 0.133 8.13 4.04 PlyA + 195692 195697 6 1.05 5.03 PlyA - 195786 195781 6 1.05 5.02 Term - 205446 205077 370 2 1 27 48 318 0.577 14.83 5.01 Init - 205497 205490 8 0 2 75 72 0 0.488 -2.35 5.00 Prom - 210350 210311 40 -3.55 6.00 Prom + 213277 213316 40 -6.25 6.01 Init + 216543 216689 147 2 0 72 12 147 0.750 5.64 6.02 Term + 217106 217840 735 1 0 38 55 299 0.741 14.07 6.03 PlyA + 218259 218264 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 17350 17235 116 1 2 47 109 142 0.971 11.93 S.002 Term - 147369 147257 113 1 2 57 39 98 0.951 -0.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:87294449_87512794|GENSCAN_predicted_peptide_1|715_aa MGACPTLMTLSNPHKGTQCKLSVDLPLWGLEDDGPLPTAPLGSALVETLYPTPCGIHQGL WLAPSEAMARVVHWPLLATATAGAGVAGMRGTKWYHGKLDRTIAEERLRQAGKSGSYLIR ESDRRPGSFVLSFLSQMNVVNHFSFLKGDMFIVHNELEDGWMWVTNLRTDEQGLIVEDLV EEVGKTNSLILTVGQVCSFLVRPSDNTPGDYSLYFRTNENIQRFKICPTPNNQFMMGGRY YNSIGDIIDHYRKEQIVEGYYLKEPVPMQDQEQVLNDTVDGKEIYNTIRRKTKDAFYKNI VKKGYLLKKGKGKRWKNLYFILEGSDAQLIYFESEKRATKPKGLIDLSVCSVYVVHDSLF GRPNCFQIVVQHFSEEHYIFYFAGETPEQAEDWMKGLQAFCNLRKSSPGTSNKRLRQVSS LVLHIEEAHKLPVKHFTNPYCNIYLNSVQVAKTHAREGQNPVWSEEFVFDDLPPDINRFE ITLSNKTKKSKDPDILFMRCQLSRLQKGHATDEWFLLSSHIPLKGIEPGSLRVRARYSME KIMPEEEYSEFKELEVLLDLVSLAVPNSVTDSPSPIAARTLILVAKSVQNLANLVEFGAK EPYMEGVNPFIKSNKHRMIMFLDELGNVPELPDTTEHSRTDLSRDLAALHEICVAHSDEL RTLSNERGAQQHVLKKLLAITELLQQKQNQYTKTNDVRGSLNLSIRARPREKLKL >gi568815593r:87294449_87512794|GENSCAN_predicted_CDS_1|2148_bp atgggggcctgccccaccctgatgaccctatctaatcctcacaaaggcacacagtgcaag ctgtcagtggatctacccttgtggggtctggaggatgatggccctcttcccacagctcca ttaggcagtgccctagtggagactctctacccaacaccatgtggaatccaccaaggcttg tggcttgcaccctctgaagccatggcccgagttgtacattggccccttttagccacagcc acagctggagctggagtggctgggatgcggggcaccaagtggtatcacggaaaacttgac agaacgatagcagaagaacgcctcaggcaggcagggaagtctggcagttatcttataaga gagagtgatcggaggccagggtcctttgtactttcatttcttagccagatgaatgttgtc aaccattttagtttcttaaaaggagatatgttcattgttcataatgaattagaagatgga tggatgtgggttacaaatttaagaacagatgaacaaggccttattgttgaagacctagta gaagaggtgggaaaaactaacagcttaattcttacagttggtcaagtctgcagttttctt gtgaggccctcagataatactcctggcgattattcactttatttccggaccaatgaaaat attcagcgatttaaaatatgtccaacgccaaacaatcagtttatgatgggaggccggtat tataacagcattggggacatcatagatcactatcgaaaagaacagattgttgaaggatat tatcttaaggaacctgtaccaatgcaggatcaagaacaagtactcaatgacacagtggat ggcaaggaaatctataataccatccgtcgtaaaacaaaggatgccttttataaaaacatt gttaagaaaggttatcttctgaaaaagggcaaaggaaaacgttggaaaaatttatatttt atcttagagggtagtgatgcccaacttatttattttgaaagcgaaaaacgagctaccaaa ccaaaaggattaatagatctcagtgtatgttctgtctatgtcgttcatgatagtctcttt ggcaggccaaactgttttcagatagtagttcagcactttagtgaagaacattacatcttt tactttgcaggagaaactccagaacaagcagaggattggatgaaaggtctgcaggcattt tgcaatttacggaaaagtagtccagggacatccaataaacgccttcgtcaggtcagcagc cttgttttacatattgaagaagcccataaactcccagtaaaacattttactaatccatat tgtaacatctacctgaatagtgtccaagtagcaaaaactcatgcaagggaagggcaaaac ccagtatggtcagaagagtttgtctttgatgatcttcctcctgacatcaatagatttgaa ataactcttagtaataaaacaaagaaaagcaaagatcctgatatcttatttatgcgctgc cagttgagccgattacagaaagggcatgccacagatgaatggtttctgctcagctcccat ataccattaaaaggtattgaaccagggtccctgcgtgttcgagcacgatactctatggaa aaaatcatgccagaagaagagtacagtgaatttaaagagctggaagtgctgttggacttg gtgtcattagctgtgcccaattctgttacagattctccatctcctattgctgcaagaaca ctgatattagtggctaaatctgtgcagaacttagcaaatcttgtggaatttggagctaag gagccctacatggaaggtgtcaatccattcatcaaaagcaacaaacatcgtatgatcatg tttttagatgaacttgggaatgtacctgaacttccggacactacagagcattctagaacg gacctgtcccgtgatttagcagcattgcatgagatttgcgtggctcattcagatgaactt cgaacgctcagtaatgagcgtggtgcacagcagcacgtattgaaaaagcttctggctata acagaactgcttcaacaaaaacaaaaccagtatacaaaaaccaatgatgtcaggggcagt ttaaaccttagcatcagagcaaggcccagggagaagctgaagctttag >gi568815593r:87294449_87512794|GENSCAN_predicted_peptide_2|689_aa MNRQLSKEDIQMANKHMKNVNITNDHGNAHQNHNVIPPYSYNNGHYKKKKIDVGVDALNR EHFYTAGGKERSSSPATEQSWTENDFDEFREGFRRSNYSELQEEIQTKSKEVENFEKNLD ECKTRITNTEKCLKELMELKAKARELREECRSLRSRCDQLEDRVSVMEDEMNEMKPSGPG HDSIMYHNSSQKRHWTFSSEEQLARLRADANRKFRCKAVANGKRAPCQVPWVQRRTTPSP SSNGRQVTSLCNVVKKRVLPNDPVFLEPHEEMTLCKYYEKRLLEFCSVFKPAMPRSVVGT ACMYFKRFYLNNSVMEYHPRIIMLTCAFLACKVDEFNVSSPQFVGNLRESPLGQEKALEQ ILEYELLLIQQLNFHLIVHNPYRPFEGFLIDLKSRNRREKNIPIRDFPGKKLKFKKPFRA EEPEQSLAMQGEEITDSCCQSGRQSLDCGDNVTDFEYSKEKLVKGLNLESGMIRFAALKD HVGNLMDQKFVGSILGQGIQERNEDQRTLTRYPILENPEILRKTADDFLNRIALTDAYLL YTPSQIALTAILSSASRAGITMESYLSESLMLKENRTCLSQLLDIMKSMRNLVKKYEPPR SEEVAVLKQKLERCHSAELALNVITCSWGISTDLGSVQNMRLFYYSYRYNDKACFSIELE IGGFHSLLGFSGNRNFSEWLQRSNVDFNE >gi568815593r:87294449_87512794|GENSCAN_predicted_CDS_2|2070_bp atgaatagacaactctcaaaagaagatatacaaatggccaacaaacatatgaaaaatgtc aatatcactaatgatcatggaaatgcacatcaaaaccacaatgtgataccaccttactcc tacaacaatggccattacaaaaaaaaaaaaatagatgttggtgtggatgcactgaatagg gaacacttctatactgctggtgggaaagaacgcagttcctcaccagcaacggaacaaagc tggacggagaatgactttgacgagttcagagaaggcttcagacgatcaaactactccgag ctacaggaggaaattcaaaccaaaagcaaagaagttgaaaactttgaaaaaaatttagac gaatgtaaaactagaataaccaatacagagaagtgcttaaaggagctgatggagctgaaa gccaaggctcgagaactacgtgaagaatgcagaagcctcaggagccgatgcgatcaactg gaagacagggtatcagtgatggaagatgaaatgaatgaaatgaaaccctctggacctggt cacgattccataatgtaccacaacagtagtcagaagcggcactggaccttctccagcgag gagcagctggcaagactgcgggctgacgccaaccgcaaattcagatgcaaagccgtggcc aacgggaagcgagcaccttgccaggttccgtgggtacaaagacgaacaacgccatccccg tcgtcgaatggcagacaagtaaccagtctttgtaacgtagtgaagaagagagttcttccg aatgatccagtctttcttgagcctcatgaagaaatgacactctgcaaatactatgagaaa aggttattggaattctgttcggtgtttaagccagcaatgccaagatctgttgtgggtacg gcttgtatgtatttcaaacgtttttatcttaataactcagtaatggaatatcaccccagg ataataatgctcacttgtgcatttttggcctgcaaagtagatgaattcaatgtatctagt cctcagtttgttggaaacctccgggagagtcctcttggacaggagaaggcacttgaacag atactggaatatgaactacttcttatacagcaacttaatttccaccttattgtccacaat ccttacagaccatttgagggcttcctcatcgacttaaagagtaggaataggagagagaag aacattccaatcagagacttccctggtaaaaagctcaaattcaagaaaccattccgtgct gaagagcctgaacagagcctagcaatgcagggggaagagatcacagacagctgctgccag agtggaaggcagagtctagactgtggagacaatgttacagattttgaatacagcaaagag aagctagtgaaaggtttaaatttagagagtggcatgattagatttgcagctttaaaagat catgttggcaaccttatggatcagaaatttgttggcagcatattggggcagggaattcag gagagaaatgaggaccagaggacactgacccgctatcccatattggagaatccagagatt ttgaggaaaacagctgatgactttcttaatagaattgcattgacggatgcttacctttta tacacaccttcccaaattgccctgactgccattttatctagtgcctccagggctggaatt actatggaaagttatttatcagagagtctgatgctgaaagagaacagaacttgcctgtca cagttactagatataatgaaaagcatgagaaacttagtaaagaagtatgaaccacccaga tctgaagaagttgctgttctgaaacagaagttggagcgatgtcattctgctgagcttgca cttaacgtaatcacttgtagttggggaatatctacggacttaggaagtgtgcagaatatg aggttgttttattatagttataggtataacgacaaagcatgttttagcatagaacttgaa attggtggatttcattcactcctcggtttcagtggtaacaggaatttttcagagtggtta cagagaagtaatgttgactttaatgaataa >gi568815593r:87294449_87512794|GENSCAN_predicted_peptide_3|188_aa MGICEKTKHIIGVPESDRENGTKMENTLQDIIQGNFPNLARQANIRIQEIHRTSLRYSLR SKTPTHTIVRFSKVETKEEMLRTASEKGQVTYKGKPISLTVDLTAETLQDRREWGPIFNI LKEKNFQPRISYPAKLSFISEGEIKSFPDKQMLRNFVTTRPDLQEFLKAALNMERKNQHQ PLQKHTKL >gi568815593r:87294449_87512794|GENSCAN_predicted_CDS_3|567_bp atgggaatatgtgaaaagaccaaacatataattggtgtacctgaaagtgacagggagaat ggaaccaagatggaaaacacacttcaggatattatccaggggaacttccccaacctagca agacaggccaacattcgaattcaagaaatacacagaacatcactaagatactccttgaga agtaaaaccccaacacacacaattgtcagattctccaaggttgaaacaaaggaagaaatg ttaaggacagccagtgagaaaggtcaggttacctacaaagggaagcccattagcctaaca gtggatctcactgcagaaaccctacaagacagaagagagtgggggccaatattcaacatt cttaaagaaaagaattttcaacccagaatttcatatccagccaaactaagcttcataagt gaaggagaaataaaatcctttcctgataagcaaatgctgaggaattttgtcaccaccagg cctgacttacaagagttcctgaaggcagcactaaatatggaaaggaaaaaccagcaccag ccactgcaaaaacataccaaattgtaa >gi568815593r:87294449_87512794|GENSCAN_predicted_peptide_4|144_aa MELTPGEDAVNIVEMTTKDLEYCINLVDKAAAGFERTDSNVESSVVVKCYQTASHATEKS FMKERPPPPNKPRTSLKERQQPQSGAYRSNFHLPGTEHLGEGVAVGTASANLNIPACWLR REQLIPQHITRALLRDRLPPYVGP >gi568815593r:87294449_87512794|GENSCAN_predicted_CDS_4|435_bp atggaattgactcctggtgaagatgctgtgaacattgttgaaatgacaacaaaggattta gaatattgtataaacttagttgataaagcagcagcagggtttgagaggactgactcaaat gttgaaagttctgttgtggtaaaatgctatcaaacagcatcacatgctacagagaaatct ttcatgaaagaaagacccccacctcccaacaagccccggacatctctgaaagaaaggcag cagccccaatcaggggcttatagatcaaatttccatctccctgggacagagcacctgggg gaaggggtggctgtgggcacagcttcagcaaacttaaacattcctgcctgctggctccga agagagcagctgatcccccagcacatcactcgagctctgctaagggatagactgcctcct tatgtgggtccctga >gi568815593r:87294449_87512794|GENSCAN_predicted_peptide_5|125_aa MNESKLLAGPTTRPGVTEQVWDPASCCRHQQKQTLGVNCAGAQVGVLATLKPHRACYSAL LAPLSVDACVLTTQLGSCLLTWGNCPPPVRAQCQCDSLSEYLHLVGPELLSEYKKNEVRQ TLEGQ >gi568815593r:87294449_87512794|GENSCAN_predicted_CDS_5|378_bp atgaacgagagcaagctccttgcggggcccacgaccagaccaggtgtgactgaacaagtg tgggatccggccagctgctgcaggcaccagcagaagcaaactctgggtgtgaactgtgcc ggtgcccaggtgggggtgcttgcaactctgaagccccacagggcgtgttacagtgctctc ttagctccactgtctgtggatgcttgtgtgttaacaactcagttgggctcttgcctcctc acgtggggcaactgccctccaccagtgagggcacagtgccagtgtgacagcctttctgag tacctgcacttggtgggccccgagctgttgtcagaatacaagaagaatgaggtcaggcag acacttgaaggacagtga >gi568815593r:87294449_87512794|GENSCAN_predicted_peptide_6|293_aa MGQVWGLVHFTLELFQTDDEEEQEYSKVTEEVTEHVYLPAKAKVAKEGEFLQFKTWWADE ASIQAAQNAQAQPQINITADQLLGVGGWADLDAHLVMQDDAIEQLSGVCIRAWEKITSGG EQYLSFSAIKQGPREPYVDFIARLQESLKKMIADLAAQDIMLQLLAFENANPDCQAALRP IRGKAYLVDYIKACDGIGGNLHKATLLAQAMAGLRVDKGNTPFPGACFNCGKYGHTKKEC RKNQGVRLPDRGKKKKTAEPEMCPKCKKKKKKIGLISVTLSLIKVGTRFRETP >gi568815593r:87294449_87512794|GENSCAN_predicted_CDS_6|882_bp atgggacaagtgtggggtctggttcatttcaccttggaactttttcagactgatgatgag gaggaacaagagtatagcaaagtaacagaagaggttacagagcatgtttatttgccagct aaagctaaagtggcaaaggagggagagttcttacaatttaaaacttggtgggcagatgaa gcttccattcaggctgctcaaaatgcccaggcccaacctcaaattaatataactgcagac cagcttttgggggttggtggctgggctgatttagatgcacatctggtcatgcaggatgat gccatagaacagcttagtggagtgtgcattagagcttgggaaaaaatcacttcaggtggg gaacaatacctttcctttagtgctataaaacagggaccaagagaaccatatgttgatttt atagctcggttacaggagtctcttaaaaagatgattgcagatttggctgctcaggatata atgttgcagttattagcttttgaaaatgctaatcctgattgccaggctgctctgcgacct atcagagggaaagcatatttagttgattatatcaaggcctgtgatggtatcggaggtaat ctgcataaagctactctgctagcacaggcaatggcaggactgagagtggataaaggaaat actccatttcctggagcttgttttaactgtgggaagtatggtcatactaaaaaagaatgt agaaaaaatcaaggagtcaggctgccagataggggaaaaaagaaaaaaactgctgagcct gaaatgtgtccaaaatgtaaaaaaaaaaaaaaaaaaattgggctaatcagtgtcactcta agtttgataaaggtgggaacccggtttcgggaaacaccatga