GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:22:21 Sequence gi568815587r:44164857_44410062 : 245206 bp : 46.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6755 6886 132 1 0 82 103 195 0.976 21.24 1.02 Intr + 32973 33162 190 2 1 95 116 163 0.806 18.96 1.03 Intr + 41937 42103 167 1 2 67 94 177 0.973 15.88 1.04 Intr + 67497 67640 144 2 0 103 116 102 0.997 14.98 1.05 Intr + 69259 69387 129 0 0 45 97 198 0.999 17.29 1.06 Intr + 71437 71519 83 0 2 97 107 24 0.997 3.64 1.07 Term + 79293 79431 139 0 1 107 47 183 0.999 13.54 1.08 PlyA + 86622 86627 6 1.05 2.17 PlyA - 87633 87628 6 1.05 2.16 Term - 100327 99998 330 1 0 111 48 482 0.745 41.16 2.15 Intr - 102766 102638 129 1 0 97 111 193 0.999 23.39 2.14 Intr - 110802 110492 311 1 2 120 82 450 0.488 43.63 2.13 Intr - 113967 113942 26 2 2 138 93 15 0.328 4.67 2.12 Intr - 129032 128965 68 2 2 113 38 13 0.010 -3.50 2.11 Intr - 129440 129315 126 2 0 73 100 2 0.412 0.78 2.10 Intr - 131168 131103 66 2 0 93 85 108 0.895 10.10 2.09 Intr - 135864 135741 124 2 1 68 17 56 0.373 -2.91 2.08 Intr - 139378 139231 148 2 1 79 94 168 0.822 15.79 2.07 Intr - 141582 141453 130 0 1 76 77 2 0.590 -1.83 2.06 Intr - 144545 144234 312 2 0 65 94 196 0.811 14.28 2.05 Intr - 145273 144700 574 0 1 55 1 629 0.021 43.64 2.04 Intr - 152290 152035 256 2 1 32 75 159 0.081 5.50 2.03 Intr - 164032 163931 102 2 0 54 95 26 0.012 0.05 2.02 Intr - 164327 164303 25 2 1 99 65 11 0.005 -2.50 2.01 Init - 175811 175434 378 2 0 72 77 187 0.675 10.91 2.00 Prom - 176581 176542 40 -3.26 3.09 PlyA - 176706 176701 6 1.05 3.08 Term - 178837 178745 93 1 0 40 49 103 0.617 -0.57 3.07 Intr - 196071 195957 115 2 1 88 38 59 0.274 1.35 3.06 Intr - 207107 206935 173 2 2 0 108 98 0.251 1.74 3.05 Intr - 207546 207519 28 2 1 121 78 12 0.270 1.62 3.04 Intr - 211646 211587 60 1 0 107 76 47 0.291 3.25 3.03 Intr - 228032 227381 652 0 1 59 35 238 0.075 6.57 3.02 Intr - 236916 236778 139 0 1 14 90 78 0.101 0.64 3.01 Intr - 244577 244373 205 1 1 40 66 126 0.303 4.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:44164857_44410062|GENSCAN_predicted_peptide_1|327_aa ARWFWEAYFQSIKAIALATLQIINDRIYPYAAISYEEWNDPPAVKWGSVSNPLFLPLIPP QSQGFTAIVLTYDRVESLFRVITEVSKVPSLSKLLVVWNNQNKNPPEDSLWPKIRVPLKV VRTAENKLSNRFFPYDEIETEAVLAIDDDIIMLTSDELQFGYEVWREFPDRLVGYPGRLH LWDHEMNKWKYESEWTNEVSMVLTGAAFYHKYFNYLYTYKMPGDIKNWVDAHMNCEDIAM NFLVANVTGKAVIKVTPRKKFKCPECTAIDGLSLDQTHMVERSECINKFASVFGTMPLKV VEHRADPVLYKDDFPEKLKSFPNIGSL >gi568815587r:44164857_44410062|GENSCAN_predicted_CDS_1|984_bp gcccggtggttctgggaagcgtacttccagtcaattaaagccattgccctggccaccctg cagattatcaatgaccggatctatccatatgctgccatctcctatgaagaatggaatgac cctcctgctgtgaagtggggcagcgtgagcaatccactcttcctcccgctgatcccacca cagtctcaagggttcaccgccatagtcctcacctacgaccgagtagagagcctcttccgg gtcatcactgaagtgtccaaggtgcccagtctatccaaactacttgtcgtctggaataat cagaataaaaaccctccagaagattctctctggcccaaaatccgggttccattaaaagtt gtgaggactgctgaaaacaagttaagtaaccgtttcttcccttatgatgaaatcgagaca gaagctgttctggccattgatgatgatatcattatgctgacctctgacgagctgcaattt ggttatgaggtctggcgggaatttcctgaccggttggtgggttacccgggtcgtctgcat ctctgggaccatgagatgaataagtggaagtatgagtctgagtggacgaatgaagtgtcc atggtgctcactggggcagctttttatcacaagtattttaattacctgtatacctacaaa atgcctggggatatcaagaactgggtagatgctcatatgaactgtgaagatattgccatg aacttcctggtggccaacgtcacgggaaaagcagttatcaaggtaaccccacgaaagaaa ttcaagtgtcctgagtgcacagccatagatgggctttcactagaccaaacacacatggtg gagaggtcagagtgcatcaacaagtttgcttcagtcttcgggaccatgcctctcaaggtg gtggaacaccgagctgaccctgtcctgtacaaagatgactttcctgagaagctgaagagc ttccccaacattggcagcttatga >gi568815587r:44164857_44410062|GENSCAN_predicted_peptide_2|1034_aa MGVVGGAGTAAMPQLLTPLRPYLPQDAHSRFQAQPSWQQYQGHCPTGKEDPREQKQSLSL GLEGIDNSPPLPSHTHTLPHPCAEGHILPLNLTAGCCLQAQRFGHLQIAASEPMVAPSLF SNQEKKIVARKKGEDYVATQVLSLVPISYFDITIQDQFTCGGKCTEAPELRWPPNTVESS PPGLSSQVFRGSPELLSPPCALGLALAREGGAPGSIAHADFAFRREPQPPRGKAGGRLPK SRDAEPGSEYAGQSQGARWRPRALARVPARRLRKPGMNAETCVSYCESPAAAMDAYYSPV SQSREGSSPFRAFPGGDKFGTTFLSAAAKAQGFGDAKSRARYGAGQQDLATPLESGAGAR GSFNKFQPQPSTPQPQPPPQPQPQQQQPQPQPPAQPHLYLQRGACKTPPDGSLKLQEGSS GHSAALQVPCYGECTCGSPGAGDHGARSVSAVKLDLEFVFPGPFGISPNRSLCAARWAGF EADASYRPRACGGKESPRSALALGCGAAGLRGCGAAGLRGCGAAGLRDSRTAEASLLAGW EAPRAAPPQSPIASRQGRCPGQLSLGQAEGVRGGEAASSLCWPGLDRVAERVAQGSPAIS TGPHRGDSHRIGKPTLRGPGYVMIPSGGIRVEADRPVVPLDSATPFLPVLNGTLALSHLG GEYEEDVMMERGSVSAPYILHHQASDQQVTEIQLKDTFVFRDLELLPGPHLQLRKHLVGI ALGHTWTETFCGNPLGDISILNLKMRKSHPENQASPAQDPHVICGSSSCGSSGTVLAAAK ESSLGEPELPPDSDTVGMDSSYLSVKEAGVKGPQDRASSDLPSPLEKADSESNKGKKRRN RTTFTSYQLEELEKVFQKTHYPDVYAREQLAMRTDLTEARVQVWFQNRRAKWRKRERFGQ MQQVRTHFSTAYELPLLTRAENYAQIQNPSWLGNNGAASPVPACVVPCDPVPACMSPHAH PPGSGASSVTDFLSVSGAGSHVGQTHMGSLFGAASLSPGLNGYELNGEPDRKTSSIAALR MKAKEHSAAISWAT >gi568815587r:44164857_44410062|GENSCAN_predicted_CDS_2|3105_bp atgggggtggtcgggggggcggggacagcagcgatgccccagctgctcacccctctgagg ccatacctgcctcaagatgcccattccagattccaggctcagcctagctggcagcagtac cagggtcactgccccactgggaaagaggacccaagggaacaaaaacagagcctgagcctt ggcctagagggtattgacaacagccccccactcccatcacacacccacacactcccccat ccatgtgcagagggccacatcttgccactgaatttaactgctggctgctgcctccaggct cagcgctttggccacctgcagatagctgcttctgagccaatggtggcaccatcactcttc tccaaccaggagaagaagattgtggccaggaagaaaggagaagattatgtggccacccag gtattaagcctagtgcccattagttattttgatattaccattcaggatcagtttacctgt ggaggtaaatgcacagaagctccagaattaaggtggccaccgaacacggtggagagctcg cctccagggctttcgagccaggttttccgcgggtccccggagctcctctccccgccctgc gctcttggccttgcccttgccagagagggcggggcaccaggttccatcgcgcacgcggac ttcgccttcaggcgagaaccccagcccccgcgcggaaaagcgggtggccggctccccaag agcagagatgcagagcctggaagcgaatatgcagggcaaagccaaggcgcgcggtggcgt cctcgcgccctcgctcgcgtccccgcccgccgcctgcgcaagccaggcatgaatgctgag acttgcgtctcttactgcgagtcgccggccgctgccatggacgcctactacagcccggtg tcgcagagtcgggagggctcgtcgccttttagggcatttcccggaggcgacaagttcggc acaactttcctgtcggccgccgccaaagcacagggattcggggacgccaagagccgggcc cgttacggcgctgggcagcaggacctggcgacacccctggagagtggagctggggcgcgg ggctcctttaacaagttccagccccagccgtcgaccccgcagccccagccgccgccgcag ccgcagccgcagcagcagcagccgcagccccagccgcccgcgcaaccgcatctttacttg cagcgaggcgcctgcaagacgcccccggacggcagcctcaaactccaggaaggcagcagc ggccacagcgcggccttgcaggttccctgctacggtgagtgcacgtgcgggtcacctggt gctggggaccacggggcacggagcgtttcggctgttaaactcgacttggagtttgttttt cccggtcccttcggcatttccccaaaccgctcgctttgcgctgcccgctgggcgggattt gaagcggatgcttcttaccggcccagggcgtgcggcggcaaggagtctccgcgcagcgct ctggcgctgggctgcggggctgcggggctgcggggctgcggggctgcggggctgcgaggc tgcggggctgcggggctgcgggacagcaggactgcagaagcctcgctcctcgccggctgg gaggctcctcgggcggcgccgccgcagagtccaattgcctctcggcaaggaaggtgcccc gggcagctgagcctgggccaagccgagggagtgcggggcggggaagccgcctccagtctc tgctggccgggtctggaccgggttgcggaaagagttgcgcagggttctccggccatcagc acaggtcctcaccgaggtgacagtcatcggattgggaaaccaacactgcgaggacccggc tatgtgatgataccgtccgggggaattcgagtggaagccgaccgtcctgtggtgccacta gacagtgccactccatttctccctgtgttgaatggcactttggccctgtcccacctggga ggggagtatgaggaggatgtgatgatggagagaggtagtgtgtctgccccgtacatcctg caccaccaggccagtgaccaacaggtcacagaaattcagctcaaggacacatttgttttc cgggacctggagctgcttcctgggccccatctgcaattaagaaagcacctggtgggcatt gcccttggccacacctggactgagactttttgtggaaacccactgggagatattagtatc ctcaatttgaagatgaggaagagccatcctgagaaccaggccagcccagcccaggacccc catgtgatatgtgggagcagctcctgtgggtcctcagggacagtcctggctgctgctaaa gagagctccctgggtgagccagagttaccccctgactctgacactgtggggatggacagc agctacctgagtgtcaaggaggctggggtgaaggggccccaggaccgggccagctcagac ctccccagcccattggagaaggccgactcagagagcaacaagggcaagaagcggcggaac cggaccaccttcaccagctaccagctggaggagctggagaaggtcttccagaagacccac tacccagacgtgtatgcgcgggaacagctggccatgaggacagacctcactgaggcccgc gtgcaggtctggttccagaaccgaagggccaagtggaggaagcgggagcgttttgggcag atgcagcaggttcgaacccacttctccactgcatatgagctgcccctcctcacccgagct gagaactacgcccagattcagaacccgtcctggctcggcaacaacggggctgcctcacca gtgccagcctgcgtggtcccctgcgacccggtgcctgcctgcatgtcccctcatgcccac ccccctggctctggggccagcagcgtcaccgacttcctgagtgtgtctggggctggcagt cacgtgggccagacgcacatgggcagcctgtttggagcagccagcctcagcccaggcctc aatggctacgagctcaacggcgagccggaccgcaagacctcgagcatcgcggccctccgc atgaaggccaaggagcacagtgcggccatttcctgggccacatga >gi568815587r:44164857_44410062|GENSCAN_predicted_peptide_3|488_aa XTIPNSLVASVGPCAWESIRWPADVSNTPSAYGCHKGVWEQGHVHMKTHGYPEYLGLLLS SQAKAGDSKHSRHKSRLAEVTRAGQFQALFGCMAWSSGISNVPGKSHTAPRRHTQVLEVL ARAIWQEKEIKRIQLEKEEVKLSLFADDIIVYLENPIISAQNLLKLISNFSKVSGYKINV QNSQAFLYTNNRQTKSQIMSELPFAIASKRIKYLRIQLTRDVKDLFKENYKPLLNEIRED TNKWKNIPCSWIGKINIVKMAILPKVIYRFNVIPIKLPMTFFTELEKTTLKFIWNQKRAC IAKTILSQKNKAGSITLPDFKLYYKATVTKTAWASPTITRQTRTFLPPGKPSRNERVEYI SEKSKLTSEELGRHGGGQETGSTFGIGVIGLNQLDPHMLLPLTWCCQIKYKKFELQLNNK FLPERTMDPWNTLQVLLGQYLSSPGFGLPICAKRRLDKIIIRNQKQSAVPTGDPHPGPLL PWLLQESD >gi568815587r:44164857_44410062|GENSCAN_predicted_CDS_3|1467_bp nngacaatacccaactccctggtggcctctgtgggaccttgtgcatgggaaagcatccgg tggccagcggatgtctccaacacgccaagtgcctatgggtgccacaaaggggtttgggag cagggtcatgtccacatgaagacccatgggtaccccgagtatttggggctgctactgtcc tctcaggccaaagctggggacagtaagcactcccggcacaagtccagactggcagaagta acccgtgcagggcagttccaggccctttttggctgtatggcttggtcaagtgggatcagc aatgttcctgggaaatcccacactgcaccaagaaggcacacgcaagtgttggaagttctg gccagggcaatctggcaggagaaagaaataaagcgtattcaattagaaaaagaggaagtc aaattgtccctgtttgcagatgacataattgtatatttagaaaaccccatcatctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaattcacaagcattcctatacaccaataacagacaaacaaagagtcaaatcatgagt gaactcccattcgcaattgcttcaaagagaataaaatacctaagaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataagagaggac acaaacaaatggaagaacattccatgctcatggataggaaaaatcaatattgtgaaaatg gccatactgcccaaggtaatttatagattcaatgtcatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactaccttaaagttcatatggaaccaaaaaagagcctgc attgccaagacaatcctaagccaaaagaacaaagctggaagcatcacactacctgacttc aaactatactacaaggctacagtaaccaaaacagcatgggcctctccaaccatcacccga caaacaaggaccttcttacccccaggaaaaccctccaggaatgaaagagttgagtacatc tcagagaagagcaagctaacctctgaggagcttgggaggcatggagggggccaggagact ggtagcacctttgggatcggggtcatcggcctgaatcagttggacccccacatgctcctt ccccttacctggtgttgccagataaagtacaagaagtttgaattacagctgaataacaag tttcttccagaaaggaccatggacccctggaacactctccaggtcttgcttgggcagtac ctttcctctccaggctttggtttacccatctgtgccaaaaggcgattggataagatcatc atcaggaatcagaagcagtcagctgtgcccactggggaccctcacccaggacccctgctg ccctggctgctgcaggaatcagactga