GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:59:23 Sequence gi568815583r:52009851_52212726 : 202876 bp : 44.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1650 1645 6 1.05 1.02 Term - 9168 8959 210 0 0 39 44 103 0.667 -1.71 1.01 Init - 9643 9545 99 1 0 83 38 182 0.931 10.96 1.00 Prom - 34866 34827 40 0.04 2.00 Prom + 35676 35715 40 -5.46 2.01 Init + 36611 37165 555 1 0 68 95 316 0.957 25.27 2.02 Intr + 40143 40287 145 2 1 88 92 57 0.930 5.96 2.03 Intr + 48783 48947 165 1 0 87 67 80 0.975 5.73 2.04 Intr + 51449 51650 202 0 1 101 94 112 0.997 11.44 2.05 Term + 54052 55150 1099 1 1 64 41 508 0.999 35.26 2.06 PlyA + 55355 55360 6 1.05 3.00 Prom + 70161 70200 40 -2.06 3.01 Init + 77914 77921 8 0 2 103 91 0 0.860 2.30 3.02 Intr + 84407 84638 232 2 1 99 74 88 0.903 6.28 3.03 Term + 85394 85561 168 1 0 37 42 202 0.999 8.38 3.04 PlyA + 85649 85654 6 1.05 4.03 PlyA - 86590 86585 6 1.05 4.02 Term - 92873 92784 90 2 0 92 39 57 0.045 -1.08 4.01 Init - 102876 102388 489 0 0 85 80 833 0.610 77.40 4.00 Prom - 106643 106604 40 -4.36 5.13 PlyA - 108541 108536 6 1.05 5.12 Term - 110056 109976 81 1 0 93 48 -1 0.191 -5.91 5.11 Intr - 110520 110452 69 0 0 63 86 75 0.399 4.18 5.10 Intr - 114789 114623 167 1 2 61 52 107 0.870 4.08 5.09 Intr - 116194 116098 97 1 1 68 83 93 0.984 6.38 5.08 Intr - 118394 118346 49 1 1 63 92 55 0.545 2.08 5.07 Intr - 123619 123528 92 1 2 88 77 183 0.438 15.99 5.06 Intr - 125255 125142 114 2 0 114 93 -16 0.797 2.14 5.05 Intr - 125906 125763 144 2 0 86 55 204 0.974 17.38 5.04 Intr - 131422 131290 133 0 1 86 99 181 0.983 19.65 5.03 Intr - 137685 137609 77 0 2 65 106 8 0.047 -1.39 5.02 Intr - 140075 140034 42 2 0 141 100 42 0.970 9.14 5.01 Init - 144329 144090 240 2 0 58 91 211 0.795 16.38 5.00 Prom - 147714 147675 40 -0.66 6.00 Prom + 148461 148500 40 -5.46 6.01 Init + 148510 148519 10 0 1 99 79 9 0.905 0.67 6.02 Intr + 149443 149573 131 2 2 58 77 108 0.935 7.11 6.03 Term + 149636 149770 135 1 0 76 47 72 0.693 -0.08 6.04 PlyA + 150772 150777 6 1.05 7.09 PlyA - 152472 152467 6 1.05 7.08 Term - 154136 153663 474 2 0 97 36 121 0.001 2.69 7.07 Intr - 163964 163921 44 0 2 90 24 54 0.002 -2.84 7.06 Intr - 170029 169918 112 1 1 23 75 263 0.013 18.45 7.05 Intr - 185607 185527 81 0 0 88 94 44 0.950 4.83 7.04 Intr - 186633 186459 175 2 1 114 60 123 0.985 12.04 7.03 Intr - 195297 195015 283 1 1 95 98 499 0.755 48.08 7.02 Intr - 199333 199297 37 1 1 62 95 -3 0.206 -4.36 7.01 Intr - 202034 201880 155 0 2 94 97 224 0.504 23.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 135050 134980 71 2 2 98 45 101 0.949 7.26 S.002 Init + 153205 153265 61 0 1 94 101 41 0.944 7.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:52009851_52212726|GENSCAN_predicted_peptide_1|102_aa MEEGAGSAVAAAAAAAGPAAAPSRAAPPAHPAHGGRQLGPFALLPALALREQSDQTQCQR LLSTYYVQGTGQSERKRPQSPGRVDLKMDWLEGDSPSTRQTR >gi568815583r:52009851_52212726|GENSCAN_predicted_CDS_1|309_bp atggaggagggagccggctccgccgtcgccgccgccgccgccgccgccgggcccgccgcg gcgccctcccgagccgccccgccggcgcaccccgcacacggaggaagacagctggggccg tttgctctactacctgccctggcattaagggaacaaagtgaccaaacccagtgccagcgt ttactaagcacctactacgtgcaaggcactgggcaatccgagaggaagaggcctcagtct ccaggaagggtggaccttaagatggattggctggaaggcgacagcccgagcacaagacag acgcgttag >gi568815583r:52009851_52212726|GENSCAN_predicted_peptide_2|721_aa MAEKFESLMNIHGFDLGSRYMDLKPLGCGGNGLVFSAVDNDCDKRVAIKKIVLTDPQSVK HALREIKIIRRLDHDNIVKVFEILGPSGSQLTDDVGSLTELNSVYIVQEYMETDLANVLE QGPLLEEHARLFMYQLLRGLKYIHSANVLHRDLKPANLFINTEDLVLKIGDFGLARIMDP HYSHKGHLSEGLVTKWYRSPRLLLSPNNYTKAIDMWAAGCIFAEMLTGKTLFAGAHELEQ MQLILESIPVVHEEDRQELLSVIPVYIRNDMTEPHKPLTQLLPGISREALDFLEQILTFS PMDRLTAEEALSHPYMSIYSFPMDEPISSHPFHIEDEVDDILLMDETHSHIYNWERYHDC QFSEHDWPVHNNFDIDEVQLDPRALSDVTDEEEVQVDPRKYLDGDREKYLEDPAFDTNYS TEPCWQYSDHHENKYCDLECSHTCNYKTRSSSYLDNLVWRESEVNHYYEPKLIIDLSNWK EQSKEKSDKKGKSKCERNGLVKAQIALEEASQQLAGKEREKNQGFDFDSFIAGTIQLSSQ HEPTDVVDKLNDLNSSVSQLELKSLISKSVSQEKQEKGMANLAQLEALYQSSWDSQFVSG GEDCFFINQFCEVRKDEQVEKENTYTSYLDKFFSRKEDTEMLETEPVEDGKLGERGHEEG FLNNSGEFLFNKQLESIGIPQFHSPVGSPLKSIQATLTPSAMKSSPQIPHQTYSSILKHL N >gi568815583r:52009851_52212726|GENSCAN_predicted_CDS_2|2166_bp atggcagagaaatttgaaagtctcatgaacattcatggttttgatctgggttctaggtat atggacttaaaaccattgggttgtggaggcaatggcttggttttttctgctgtagacaat gactgtgacaaaagagtagccatcaagaaaattgtccttactgatccccagagtgtcaaa catgctctacgtgaaatcaaaattattagaagacttgaccatgataacattgtgaaagtg tttgagattcttggtcccagtggaagccaattaacagacgatgtgggctctcttacggaa ctgaacagtgtttacattgttcaggagtacatggagacagacttggctaatgtgctggag cagggccctttactggaagagcatgccaggcttttcatgtatcagctgctacgggggctc aagtatattcactctgcaaatgtactgcacagagatctcaaaccagctaatcttttcatt aatacggaagacttggtgctgaagataggtgactttggtcttgcacggatcatggatcct cattattcccataagggtcatctttctgaaggattggttactaaatggtacagatctcca cgtcttttactttctcctaataattatactaaagccattgacatgtgggctgcaggctgc atctttgctgaaatgctgactggtaaaaccctttttgcaggtgcacatgaacttgaacag atgcagctgattttagaatctattcctgttgtacatgaggaagatcgtcaggagcttctc agcgtaattccagtttacattagaaatgacatgactgagccacacaaacctttaactcag ctgcttccaggaattagtcgagaagcactggatttcctggaacaaattttgacatttagc cccatggatcggttaacagcagaagaagcactctcccatccttacatgagcatatattct tttccaatggatgagccaatttcaagccatccttttcatattgaagatgaagttgatgat attttgcttatggatgaaactcacagtcacatttataactgggaaaggtatcatgattgt cagttttcagagcatgattggcctgtacataacaactttgatattgatgaagttcagctt gatccaagagctctgtccgatgtcactgatgaagaagaagtacaagttgatccccgaaaa tatttggatggagatcgggaaaagtatctggaggatcctgcttttgataccaattactct actgagccttgttggcaatactcagatcatcatgaaaacaaatattgtgatctggagtgt agccatacttgtaactacaaaacgaggtcatcatcatatttagataacttagtttggaga gagagtgaagttaaccattactatgaacccaagcttattatagatctttccaattggaaa gaacaaagcaaagaaaaatctgataagaaaggcaaatcaaaatgtgaaaggaatggattg gttaaagcccagatagcgctagaggaagcatcacagcaactggctggaaaagaaagggaa aagaatcagggatttgattttgattcctttattgcaggaactattcagcttagttcccag catgagcctactgatgttgttgataaattaaatgacttgaatagctcagtgtcccaacta gaattgaaaagtttgatatcaaagtcagtaagccaagaaaaacaggaaaaaggaatggca aatctggctcaattagaagccttgtaccagtcttcttgggacagccagtttgtgagtggt ggggaggactgttttttcataaatcagttttgtgaggtaaggaaggatgaacaagttgag aaggaaaacacttacactagttacttggacaagttctttagcaggaaagaagatactgaa atgctagaaactgagccagtagaggatgggaagcttggggagagaggacatgaggaagga tttctgaacaacagtggggagttcctctttaacaagcagctcgagtccataggcatccca cagtttcacagtccagttgggtcaccacttaagtcaatacaggccacattaacaccttct gctatgaaatcttcccctcaaattcctcatcaaacatacagcagcattctgaaacatctg aactaa >gi568815583r:52009851_52212726|GENSCAN_predicted_peptide_3|135_aa MPSSSQLLALSSFPWKNIPAVGAWNTPASATPLWPDVYGCVNVQQEASGFHPGAKVYGAS RLERTVWLTEVACLAAGSTYVGTFNNVATLHCLTAAIMGRMHAPGKGPSRLALPYRHSIP TWLKLTSDDVKEQIY >gi568815583r:52009851_52212726|GENSCAN_predicted_CDS_3|408_bp atgcccagctcatctcagctgcttgccctctcatcctttccttggaaaaacatcccagct gttggggcctggaatactccagcatcagccacacccctctggcctgatgtgtatggctgt gtcaatgtgcagcaggaggcaagtggcttccaccctggtgccaaggtctatggagccagc agactggagaggacagtctggctgacagaggtggcctgccttgcagctggaagcacttac gtgggaacatttaataacgtggccacacttcactgcctgactgccgccatcatgggtcgc atgcatgctcctgggaagggcccgtcccggttggctttgccctatcgccacagcatcccc acttggctgaagttgacatctgatgacgtgaaggagcagatttactaa >gi568815583r:52009851_52212726|GENSCAN_predicted_peptide_4|192_aa MVDQLRERTTMADPLRERTELLLADYLGYCAREPGTPEPAPSTPEAAVLRSAAARLRQIH RSFFSAYLGYPGNRFELVALMADSVLSDSPGPTWGRVVTLVTFAGTLLERGPLVTARWKK WGFQPRLKEQEGDVARDCQRLVALLSSRLMGQHRAWLQAQGGWGELIACMNADGSDARDG AFDDAGRRGEEI >gi568815583r:52009851_52212726|GENSCAN_predicted_CDS_4|579_bp atggttgaccagttgcgggagcgcaccaccatggccgacccgctgcgggagcgcaccgag ctgttgctggccgactacctggggtactgcgcccgggaacccggcacccccgagccggcg ccatccacgcccgaggccgccgtgctgcgctccgcggccgccaggttacggcagattcac cggtcctttttctccgcctacctcggctaccccgggaaccgcttcgagctggtggcgctg atggcggattccgtgctctccgacagccccggccccacctggggcagagtggtgacgctc gtgaccttcgcagggacgctgctggagagagggccgctggtgaccgcccggtggaagaag tggggcttccagccgcggctaaaggagcaggagggcgacgtcgcccgggactgccagcgc ctggtggccttgctgagctcgcggctcatggggcagcaccgcgcctggctgcaggctcag ggcggctggggagaattaatagcatgcatgaatgcagatggaagtgatgcaagagatgga gcatttgatgatgcaggcaggagaggagaggagatctag >gi568815583r:52009851_52212726|GENSCAN_predicted_peptide_5|434_aa MGPPPSAEHMSVLCQPKAQNLVLSKGTDYGLVIPVHQVAERVEALGQFVMKTRRTLKGHG NKVLCMDWCKDKRRIVSSSQDGKVIVWDSFTTNKEHAVTMPCTWVMACAYAPSGCAIACG GLDNKCSVYPLTFDKNENMAAKKKSVAMHTNYLSACSFTNSDMQILTASGDGTCALWDVE SGQLLQSFHGHGADVLCLDLAPSETGNTFVSGGARNRGFESSELADRATETQRSTRLGVR VPGSGLCYLTGCDKKAMVWDMRSGQCVQAFETHESDINSVRYYPSGDAFASGSDDATCRL YDLRADREVAIYSKESIIFGASSVDFSLSGRLLFAGYNDYTINVWDVLKGSRVSILFGHE NRVSTLRVSPDGTAFCSGSWDHTLRWVLSGFADEILDLDYVTACGDSGVISGLKQHRVVR AFVSRALIVARPAG >gi568815583r:52009851_52212726|GENSCAN_predicted_CDS_5|1305_bp atggggccgcctccctctgcggaacatatgtctgtgctgtgtcagcccaaagcccagaac ctggtccttagcaaggggactgactatggccttgtcattccagtgcaccaggtggcggag cgggtggaggccctggggcagtttgtcatgaagaccagaaggaccctcaaaggccacggg aacaaagtcctgtgcatggactggtgcaaagataagaggaggatcgtgagctcgtcacag gatgggaaggtgatcgtgtgggattccttcaccacaaacaaggagcacgcggtcaccatg ccctgcacgtgggtgatggcatgtgcttatgccccatcgggatgtgccattgcttgtggt ggtttggataataagtgttctgtgtaccccttgacgtttgacaaaaatgaaaacatggct gccaaaaagaagtctgttgctatgcacaccaactacctgtcggcctgcagcttcaccaac tctgacatgcagatcctgacagcgagcggcgatggcacatgtgccctgtgggacgtggag agcgggcagctgctgcagagcttccacggacatggggctgacgtcctctgcttggacctg gccccctcagaaactggaaacaccttcgtgtctgggggtgcaagaaacagaggctttgaa agctcagaattggcagacagggccacagagactcagagaagcactagactaggagtcagg gtgcctgggtcagggctttgctacctgactggatgtgacaagaaagccatggtgtgggac atgcgctccggccagtgcgtgcaggcctttgaaacacatgaatctgacatcaacagtgtc cggtactaccccagtggagatgcctttgcttcagggtcagatgacgctacgtgtcgcctc tatgacctgcgggcagatagggaggttgccatctattccaaagaaagcatcatatttgga gcatccagcgtggacttctccctcagtggtcgcctgctgtttgctggatacaatgattac actatcaacgtctgggatgttctcaaagggtcccgggtctccatcctgtttggacatgaa aaccgcgttagcactctacgagtttcccccgatgggactgctttctgctctggatcatgg gatcataccctcagatgggtgctgtcgggatttgctgatgagatacttgatttggactat gtgacagcatgtggagactcaggggttataagtggcttaaaacagcacagggttgtaagg gcctttgtgtcccgggctctcatagttgccaggccagctggctga >gi568815583r:52009851_52212726|GENSCAN_predicted_peptide_6|91_aa MAYTLEDSVDGDNTGGGKPGKRLLFFGSPLSSGHTICRTTLLCGPSKGEDACLCGLPAQL GPLKLEMLNILDASPDQGIRIKVMAGTQEEV >gi568815583r:52009851_52212726|GENSCAN_predicted_CDS_6|276_bp atggcgtacacgttggaggacagcgtcgatggagacaacactggaggagggaaaccaggg aagagactcctcttctttggcagccccctctccagcggacacaccatctgccgcaccacg cttctttgtggtccctccaagggagaggacgcttgcctctgtggcctccctgctcagttg ggtcctctgaagcttgagatgctgaatattctggatgccagtccagaccaaggaatcaga atcaaagtcatggctggaactcaagaggaagtgtag >gi568815583r:52009851_52212726|GENSCAN_predicted_peptide_7|453_aa XLKPRGVVVNMIPGLPAHILFMCVRYADSLNDANMLKSLMNSTINGIKQVVKLCLHCLFL FFLPFPGMLEYESLQGISGLKPTGFRKRSSSIDDTDGYTMTSVLQQLSYFYTTMCQNGLD PELVRQAVKQLFFLIGAVTLNSLFLRKDMCSCRKGMQIRCNISYLEEWLKDKNLQNSLAK ETLEPLSQAAWLLQVKKTTDSDAKEIYERCTSLSAVQIIKILNSYTPIDDFEKRVTPSFV RKVQMATEGLHENETLASLKSEAESLKGKLEEERAKLHDVEPLSLIDEKAAEIQGGGCCG LLGVHSRPLLPGSLLHLEVSPVKAAKQQRWHPAPSSGDSIPEGHQPDASWNAPGGGICRP LLGGLTQSRGMGSGTRLKKQSGCPLVEQIRCAGGNLPHPDHPDSPEHPGPTESSSLAMIC HSSCAALWGIPPSLDHPDSQELAGQDGLLELQR >gi568815583r:52009851_52212726|GENSCAN_predicted_CDS_7|1362_bp nacttgaagccccgtggcgtggtggtgaacatgatccccgggctgccggctcatatcctg ttcatgtgtgtgcgctacgcagactctctgaatgatgccaacatgctgaagtccctcatg aacagcaccattaatggcatcaagcaggtggttaagctctgccttcactgcttgttcctc ttcttcctgccctttccgggaatgctggagtatgagagcctgcagggcatttccggcctg aagcccacaggcttccggaagcgctcctctagcatagacgacacggacggctacaccatg acctccgtcctgcaacagctgagctacttttacaccaccatgtgccagaacggcctggac cccgagcttgtgaggcaggcggtgaagcagctcttcttcttgatcggggcggtcacgctg aacagcctcttcctgcgcaaggacatgtgctcctgcagaaaagggatgcagatcaggtgc aatatcagctacttagaagaatggcttaaagataagaacttgcagaacagcttagcaaag gaaactttggagcccctctctcaggcagcctggttgcttcaggtcaagaagaccacagac agtgatgccaaggagatctacgaacgctgcacctcactgtctgctgtgcagatcataaag atccttaattcatacacacctatagatgactttgagaagagagtgactccatcctttgtt cgcaaagtacagatggcaaccgaggggctgcacgagaacgagacgctggcgtcgctgaag agcgaggccgagagcctcaagggcaagctggaggaggagcgagccaagctgcacgatgtg gagcccctgtcccttatagatgaaaaagcagctgagatccagggagggggctgctgcggt ttgctgggggtccactccagacccttattgcctgggtccctcctgcacctggaggtatca ccagtgaaggctgcaaaacagcaaagatggcatccagctccttcctctggagactccatc ccagaggggcaccaacctgatgccagctggaacgctcctggaggaggcatttgtagaccc ctgttgggaggtctcacccagtcaagaggaatgggatcagggacccgcttaaagaagcag tctggctgccccctggtggagcagatacgctgtgctgggggaaacctccctcatccagac cacccagactctccagagcatccaggtcccactgaaagtagcagtctggccatgatctgc cacagcagctgtgctgcgctgtggggaattcctcccagtttggaccaccctgactcccag gagctggcaggccaggatggcctacttgagctgcagagatag