GENSCAN 1.0 Date run: 6-Nov-116 Time: 03:25:04 Sequence gi568815591f:66671881_66909281 : 237401 bp : 45.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10189 10378 190 0 1 39 80 185 0.750 9.97 1.02 Intr + 10580 10694 115 0 1 90 18 95 0.048 2.21 1.03 Intr + 20844 21061 218 0 2 90 90 89 0.086 7.45 1.04 Term + 21609 21787 179 1 2 82 49 55 0.383 -1.15 1.05 PlyA + 24095 24100 6 1.05 2.00 Prom + 48651 48690 40 -3.06 2.01 Init + 55337 55425 89 1 2 22 96 88 0.857 3.11 2.02 Term + 56520 57090 571 0 1 2 54 541 0.579 36.01 2.03 PlyA + 63552 63557 6 1.05 3.04 PlyA - 65008 65003 6 1.05 3.03 Term - 68956 68498 459 1 0 74 43 216 0.847 10.79 3.02 Intr - 69413 69263 151 1 1 97 81 73 0.918 7.66 3.01 Init - 77176 77142 35 1 2 58 101 16 0.718 -0.76 3.00 Prom - 80760 80721 40 -5.86 4.00 Prom + 85645 85684 40 -5.66 4.01 Init + 92980 92990 11 0 2 79 86 11 0.423 -0.78 4.02 Intr + 97085 97145 61 2 1 70 115 39 0.369 3.54 4.03 Intr + 98217 98250 34 2 1 103 86 10 0.522 0.10 4.04 Intr + 100003 100198 196 2 1 64 40 170 0.844 8.27 4.05 Intr + 103347 103513 167 0 2 68 57 232 0.995 17.70 4.06 Intr + 111792 111961 170 1 2 50 73 108 0.405 5.17 4.07 Intr + 118018 118295 278 0 2 43 26 228 0.001 8.41 4.08 Intr + 121938 122001 64 0 1 88 78 60 0.227 3.72 4.09 Intr + 123631 123712 82 0 1 107 87 100 0.992 11.01 4.10 Intr + 125494 125626 133 2 1 70 68 113 0.975 7.20 4.11 Intr + 127443 127534 92 0 2 98 82 90 0.994 9.04 4.12 Intr + 133260 133516 257 1 2 89 105 333 0.982 32.26 4.13 Term + 137006 137404 399 1 0 105 43 289 0.999 21.22 4.14 PlyA + 137576 137581 6 1.05 5.12 PlyA - 137957 137952 6 1.05 5.11 Term - 139041 138917 125 1 2 98 49 20 0.229 -2.35 5.10 Intr - 144056 144000 57 0 0 102 116 37 0.866 6.76 5.09 Intr - 144322 144273 50 0 2 103 75 50 0.410 3.52 5.08 Intr - 152301 152118 184 1 1 36 94 337 0.150 27.95 5.07 Intr - 160600 160520 81 2 0 135 82 102 0.107 13.91 5.06 Intr - 164920 164855 66 2 0 81 101 105 0.375 9.98 5.05 Intr - 165914 165867 48 0 0 99 72 50 0.891 3.15 5.04 Intr - 167749 167660 90 2 0 48 110 38 0.762 1.97 5.03 Intr - 168054 168017 38 2 2 126 94 38 0.937 6.11 5.02 Intr - 172944 172710 235 1 1 -25 79 252 0.356 9.75 5.01 Init - 173066 172946 121 2 1 47 74 235 0.453 18.35 5.00 Prom - 204751 204712 40 -3.36 6.06 PlyA - 204823 204818 6 1.05 6.05 Term - 207603 207547 57 0 0 89 54 72 0.654 1.59 6.04 Intr - 208562 208445 118 1 1 30 123 51 0.235 3.27 6.03 Intr - 234687 234595 93 2 0 71 76 52 0.093 1.38 6.02 Intr - 236954 236713 242 2 2 114 69 98 0.889 6.85 6.01 Init - 237163 237026 138 1 0 87 108 56 0.814 7.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 20625 20676 52 2 1 93 103 46 0.888 7.93 S.002 Term - 160600 160482 119 2 2 135 55 129 0.892 13.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:66671881_66909281|GENSCAN_predicted_peptide_1|233_aa MFARRALRACAGGAGGAGQAGGVCEAGADWGTQTCRATWASGDLDHGADAGELRLGRQRT RGRVRGLSVQRARCRHLQRPAVPRRPDLDHFLAASLSGGSPRQALMSYIPLSLGWELPPK EPPHSQGGHGDSKKGSDSTESTQLLRSQLAPGTYILAGPYPPGLRAEARIMGVSETQPSV WSPDYSSSPPSAASPPGSQALTHPALSHGVKNPGGAKVFHTYQCSETLRTWTC >gi568815591f:66671881_66909281|GENSCAN_predicted_CDS_1|702_bp atgtttgcgcgccgtgcgttacgggcgtgcgctggcggtgccgggggggcggggcaagca gggggtgtgtgtgaggctggagcggactggggaacccagacctgccgagcgacgtgggcg agcggtgacctggaccacggcgcagacgcgggcgagctgcggctcggacgccagcggaca cgcgggcgagtgcgcggcctctccgtccaacgtgctcgttgccggcatctccagcgcccg gccgtgccccgacggcctgacctggatcacttcctggccgcctccctgtccggcggctcc cctcgccaagcactgatgtcctacatccccctgagcctaggatgggaactccctcctaag gaacctccacattcccaaggaggacatggagactccaaaaaaggaagtgattccacagag tccacacagctgctgagaagtcagctggctcctggcacctacatcctggctggaccctac cccccaggcttgagggctgaggcaaggatcatgggagtttcagagacacagccaagtgtc tggtcccctgactactcctcctcccctccctcggctgccagccctcctggctcccaggcc ctcacccaccctgccctctcccatggagttaagaatcctggaggagcaaaagttttccac acttaccagtgctcagaaacattgagaacttggacctgctag >gi568815591f:66671881_66909281|GENSCAN_predicted_peptide_2|219_aa MEGGVDLKNPVTQLPDLPVELALVLVRGTGFEVHGGTWVKNYSQEVASACSCSHHNHDLR FHDPQWHSHFTIAITSVFTIFTISSTITSILISTSTSIPNFIFTSILIILASTFSSVLTS TFTSILIILTSTFSSVLTSTFTSILTSTLTAILTSTLTFTSILTSTFTSILKTTFTYTYT SILTSTFNSTLISKLTSTLTCIFTSIQISIFTSTFFFFF >gi568815591f:66671881_66909281|GENSCAN_predicted_CDS_2|660_bp atggaaggtggggtggacctcaaaaatccagttactcaactacctgatttgccagtggag ctggccctggttcttgtccgtgggacaggcttcgaggttcatggagggacctgggtcaag aactattctcaggaagttgcctctgcttgttcctgttcccaccataaccatgaccttcga tttcatgacccacaatggcattcccatttcaccattgccatcaccagtgtcttcaccatt tttaccatttcttccaccatcacctccatccttatctccacctccacctctatccccaac ttcattttcacctccatcctcatcatcctcgcctcgactttcagctctgtcctcacctcc accttcacctccatcctcatcatcctcacctcaactttcagctctgtcctcacctccacc ttcacctccatcctcacctcgaccctcaccgccatcctcacctccaccctcaccttcacc tccatcctcacctccaccttcacctccatcctcaagaccaccttcacctacacctacacg tctatcctcacctccaccttcaactccactctcatctccaaactcacctccacgctcacc tgcatcttcacctccatccaaatctccatcttcacctccactttttttttttttttttga >gi568815591f:66671881_66909281|GENSCAN_predicted_peptide_3|214_aa MPPPGLQWRKSRVTWDQVPAPLPRPGAVVWVRASHTSAFRGGCGRGPVRPQTLLRSQRPG RKLRAEQLSAPVRPLRTSPASPPASKPPLARFARVGLGSAASSRLTPTLLLAPPPAPPAN ARYARRVNVTSGAFDSGREEVRVDPKGRAEWGTALTPKSKWKRAASGVKVGRWGAPEKPA FPSLSLSPCFRLPTRNSWRATLAPRRLLGLRPRW >gi568815591f:66671881_66909281|GENSCAN_predicted_CDS_3|645_bp atgccaccaccagggcttcagtggaggaagtcacgggtgacgtgggatcaagtccccgcg cccctgccgcggccgggggctgtggtctgggtccgagcgtcccataccagcgctttccgg ggaggttgtgggaggggccccgtgcgaccacagacgctgcttcggagccagcggcctgga cggaagctgagagccgagcagctctcagccccagtccgtccgctccgtacctcgcccgcg tctccgccggcgtccaaaccaccgctcgcccgcttcgcccgggtaggtctgggctccgcg gccagctcccgcctcacacccaccctcctgctcgccccgcccccggcaccgccagcgaac gcacgttacgcgcggcgcgtaaacgtcacttccggggcctttgactctggacgggaggaa gtgcgagtggatccaaagggtcgagcggagtggggtaccgccctgacgcccaagagcaaa tggaagagggcggcctccggggtaaaggtggggcgctggggagccccggagaaaccggcg tttcctagtctgtcgctcagtccctgcttccggcttccgacgcgcaacagctggagggcg acgctggctccgcggcgactcctggggctgcggccgcggtggtag >gi568815591f:66671881_66909281|GENSCAN_predicted_peptide_4|647_aa MALYHSCRQSTRLLGTVAQEKEEMSACTFGYELHEVVSRKKMSLKSERRGIHVDQSDLLC KKGCGYYGNPAWQGFCSKCWREEYHKARQKQIQEDWELAERLQREEEEAFASSQSSQGAQ SLTFSKFEEKKTNEKTRKVTTVKKFFSASSRVGSKKAEIQEAKAPSPSINRQTSIETDRV SKEFIEFLKTFHKTGQEIYKQTKLFLEGMHYKRLQNKRTVASTWKRMWIQTKWRTDTPST DKILMEEVKLEEQLKEAVEEDKQALADTEDLQQISQKLVEEANMYSIQGFCKDSLEVADV LEKATQVKPDGQTAVGFHRMYRPLACEDLSIEEQSECAQDFYHNVAERMQTRGKVPPERV EKIMDQIEKYIMTRLYKYVFCPETTDDEKKDLAIQKRIRALRWVTPQMLCVPVNEDIPEV SDMVVKAITDIIEMDSKRVPRDKLACITKCSKHIFNAIKITKNEPASADDFLPTLIYIVL KGNPPRLQSNIQYITRFCNPSRLMTGEDGYYFTNLCCAVAFIEKLDAQSLNLSQEDFDRY MSGQTSPRKQEAESWSPDACLGVKQMYKNLDLLSQLNERQERIMNEAKKLEKDLIDWTDG IAREVQDIVEKYPLEIKPPNQPLAAIDSENVENDKLPPPLQPQVYAG >gi568815591f:66671881_66909281|GENSCAN_predicted_CDS_4|1944_bp atggctttgtaccatagctgcagacaaagcacacgattgctgggcactgtggctcaagag aaggaggagatgtctgcctgtacttttggttatgagcttcatgaggtggttagcaggaag aagatgagccttaagtctgaacgccgaggaattcatgtggatcaatcggatctcctgtgc aagaaaggatgtggttactacggcaaccctgcctggcagggtttctgctccaagtgctgg agggaagagtaccacaaagccaggcagaagcagattcaggaggactgggagctggcggag cgactccagcgggaggaagaagaggcctttgccagcagtcagagcagccaaggggcccaa tccctcacattctccaagtttgaagaaaagaaaaccaacgagaagacccgcaaggttacc acagtgaagaaattcttcagtgcatcttccagggtcggatcaaagaaggcagaaattcag gaagcaaaagctcccagtccttccataaaccggcaaaccagcattgaaacggatagagtg tctaaggagttcatagaatttctcaagaccttccacaagacaggccaagaaatctataaa cagaccaagctgtttttggaaggaatgcattacaaaaggctacaaaacaaaagaacagtt gccagcacttggaagaggatgtggatccaaaccaaatggaggacagatactccctctacg gacaagatactcatggaagaagtcaagttagaagagcagctgaaggaggctgtagaagaa gataagcaagcattggcagatactgaggacttgcagcagatcagccaaaaattggtggag gaggcaaatatgtatagcattcagggcttctgcaaggactcgttagaggttgcagatgtt ttggagaaggcaacacaagttaagccagatggtcagactgctgtgggcttccacaggatg tatcggcccttggcgtgtgaggatctaagcattgaagaacagtcagagtgtgctcaggat ttctaccacaatgtggccgaaaggatgcaaactcgtgggaaagtgcctccagaaagagtc gagaagataatggatcagattgaaaagtacatcatgactcgtctctataaatatgtattc tgtccagaaactactgatgatgagaagaaagatcttgccattcaaaagagaatcagagcc ctgcgctgggttacgcctcagatgctgtgtgtccctgttaatgaagacatcccagaagtg tctgatatggtggtgaaggcgatcacagatatcattgaaatggattccaagcgtgtgcct cgagacaagctggcctgcatcaccaagtgcagcaagcacatcttcaatgccatcaagatc accaagaatgagccggcgtcagcggatgacttcctccccaccctcatctacattgttttg aagggcaaccccccacgccttcagtctaatatccagtatatcacgcgcttctgcaatcca agccgactgatgactggagaggatggctactatttcaccaatctgtgctgtgctgtggct ttcattgagaagctagacgcccagtctttgaatctaagtcaggaggattttgatcgctac atgtctggccagacctctcccaggaagcaagaagctgagagttggtctcctgatgcttgc ttaggcgtcaagcaaatgtataagaacttggatctcttgtctcagttgaatgaacgacaa gaaaggatcatgaatgaagccaagaaactggaaaaagacctcatagattggacagatgga attgcaagagaagttcaagacatcgttgagaaatacccactggaaattaagcctccgaat caaccgttagcagctattgactctgaaaacgttgaaaatgataaacttcctccaccactg caacctcaagtttatgcaggatga >gi568815591f:66671881_66909281|GENSCAN_predicted_peptide_5|364_aa MWNQSELEERLPGRPEARMRRRVATAGRSVFAGSDPAANTISYKEELSHCLCYYYRRDFP ACSVGRSKGLTLSEQALHTKRLSPGGHKQKVGQKLLNGHCKPASSGCSRCRTLASPEAGS DKGSMCEDCGPDPSPTSEEMTDSMAGHLPSEDSDCGMEMLTDKGLSEDPRPEERPVEDSQ GDVIRPLWKQVELLFNTRYGTSGELGWLKPIKIEPEDLDIIAVTVPTKAIGILEPVKMPY SKFLMHPEELFVVGLSEGISLRRPNCFGIAKLWKILEASNSIQFVIKRPELLTEGVKEPI IDSQERDSRDPLVGESLKRQGFQVRDDTDVSSLIDLICKFKNKKFYTHPGFIPKLLLCNF GLHP >gi568815591f:66671881_66909281|GENSCAN_predicted_CDS_5|1095_bp atgtggaaccaatcggaactcgaggagcggctgccgggtcgtccagaagcgcgcatgcgc agacgcgtggccacagccggccggtcagtgttcgcaggctccgacccggcggccaacacc ataagctacaaggaggagctttcccactgcctgtgctactactaccggcgcgacttccca gcctgcagcgtagggcgcagcaagggcctgacgctgagcgagcaggcgctgcacaccaag cggttgtcgcctggcgggcacaaacagaaggtcgggcagaagctccttaatggccactgc aagccggctagctccggctgcagccgttgccgcacactcgcctcacctgaggctgggtca gacaagggcagcatgtgtgaagactgcgggccagacccgtcaccaacctctgaggaaatg acagactcaatggctgggcacctgccatcggaggattctgattgtgggatggagatgctg acagacaaaggcctgagtgaggacccacggcctgaggagaggcccgtggaggacagtcaa ggtgacgtgatccggcccctgtggaagcaggtggagctgctcttcaacacaagatacgga acctccggggagctgggttggctgaagccaatcaaaattgagccagaggatctggacatc attgcggtcactgtcccaaccaaggccattggcatcttggaaccagtcaagatgccgtac tccaagtttctgatgcacccggaggagctgtttgtggtggggttgtctgaaggcatctcc ctccgcaggcccaactgcttcgggatcgccaagctctggaagattctggaggccagcaac agcatccagtttgtcatcaagaggcctgagctgctcactgagggagtcaaagagcccatc attgacagtcaagagagggattccagggaccctctggtgggcgagagcctgaagagacag ggctttcaagtgcgtgatgacacagatgtgtcaagcttaatagatcttatatgtaaattt aagaataagaagttttatacccaccctggttttattccaaagttactgctctgcaatttt gggctccatccttag >gi568815591f:66671881_66909281|GENSCAN_predicted_peptide_6|215_aa MRPELPVMNWCFLTHLAIKWVVHSSIPSSDGGGIYVIGLQLVLKAQPEMMASWGVPYDQL TEEEKTRGWFTGGSARYAGTTQKWTAAALQPLSGTSLMDSSEEKSFQWTELQAVHLVVHF AWKEMARTSDSKFFRFGTQNGFLAPQLADSLLWNLVILNDRNPLHTAVAPGSLCILIKRQ GSCSKTGGVYDATKQRSAATGKVDSKGTIPCVTDK >gi568815591f:66671881_66909281|GENSCAN_predicted_CDS_6|648_bp atgagacctgaactacctgtcatgaactggtgttttctgacccacctagccataaagtgg gtcgtgcacagcagcattccatcatcagatggaggtggtatatatgtgatcgggctccag ctggtcctgaaggcacaacctgaaatgatggcctcatggggagttccctatgatcagttg acagaggaagagaagactaggggctggttcacaggtggttctgcacgatatgcaggcacc acccaaaaatggacagctgcagcactacagcccctttctgggacatcccttatggacagc agtgaagaaaaatctttccagtggacagaacttcaagcagtgcacctggttgtgcacttt gcatggaaggaaatggccagaacatcggactccaagttcttcaggtttgggactcagaat ggcttccttgctcctcagcttgcagacagcctattgtggaaccttgtgatcctaaatgac cgaaatccactccatacagcagtggcccctggcagcctttgcattctcatcaagaggcag ggtagctgttcaaagacaggtggtgtctatgatgcaactaagcagcgctcggcagccacg gggaaagtggacagcaaagggaccattccctgtgtcactgataagtga