GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:56:12 Sequence gi568815584f:51850679_52066684 : 216006 bp : 40.55% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 5105 5016 90 0 0 109 94 65 0.294 8.25 1.01 Init - 30824 30746 79 2 1 60 98 69 0.862 6.37 1.00 Prom - 33659 33620 40 -7.25 2.05 PlyA - 33771 33766 6 1.05 2.04 Term - 34315 34184 132 1 0 109 46 90 0.495 4.01 2.03 Intr - 53697 53642 56 1 2 129 86 2 0.052 1.88 2.02 Intr - 66393 66299 95 2 2 92 68 76 0.349 4.69 2.01 Init - 66714 66656 59 0 2 85 89 20 0.593 2.69 2.00 Prom - 83143 83104 40 -3.35 3.00 Prom + 85791 85830 40 -4.35 3.01 Init + 88834 88924 91 0 1 69 111 0 0.265 1.10 3.02 Intr + 92387 92464 78 0 0 73 94 64 0.318 4.10 3.03 Intr + 99972 100087 116 1 2 106 100 101 0.376 12.35 3.04 Term + 100659 100664 6 2 0 113 37 0 0.157 -5.71 3.05 PlyA + 101024 101029 6 1.05 4.06 PlyA - 101086 101081 6 1.05 4.05 Term - 101723 101532 192 2 0 99 39 172 0.396 9.84 4.04 Intr - 113935 113868 68 2 2 80 42 96 0.108 1.91 4.03 Intr - 115912 115676 237 2 0 47 10 213 0.029 6.06 4.02 Intr - 119318 119294 25 2 1 112 57 10 0.028 -3.02 4.01 Init - 121813 121655 159 1 0 72 77 132 0.073 10.38 4.00 Prom - 124391 124352 40 -3.55 5.03 PlyA - 127325 127320 6 1.05 5.02 Term - 128659 128373 287 2 2 22 40 170 0.187 0.18 5.01 Init - 132741 132591 151 0 1 77 75 122 0.535 10.05 5.00 Prom - 135532 135493 40 -8.35 6.00 Prom + 136243 136282 40 -5.85 6.01 Init + 138308 138347 40 1 1 70 72 72 0.735 4.20 6.02 Intr + 138768 139022 255 1 0 73 100 96 0.720 5.79 6.03 Intr + 140639 140763 125 0 2 79 90 136 0.995 12.28 6.04 Intr + 143045 143144 100 1 1 87 88 107 0.992 9.36 6.05 Intr + 147816 147902 87 1 0 70 101 116 0.997 10.12 6.06 Intr + 149030 149118 89 0 2 64 82 115 0.950 7.27 6.07 Intr + 151120 151188 69 0 0 49 94 79 0.892 3.06 6.08 Intr + 153516 153564 49 2 1 108 107 43 0.999 5.53 6.09 Term + 153684 153838 155 1 2 48 47 155 0.880 4.50 6.10 PlyA + 153997 154002 6 1.05 7.22 PlyA - 154026 154021 6 1.05 7.21 Term - 154818 154808 11 1 2 112 42 0 0.898 -4.82 7.20 Intr - 155171 155059 113 1 2 115 98 71 0.952 9.90 7.19 Intr - 155982 155859 124 1 1 77 83 76 0.988 4.72 7.18 Intr - 157289 157132 158 1 2 53 90 108 0.999 6.23 7.17 Intr - 160369 160198 172 2 1 105 44 284 0.999 23.78 7.16 Intr - 161005 160876 130 1 1 18 76 216 0.947 12.75 7.15 Intr - 163778 163591 188 0 2 150 17 177 0.768 15.29 7.14 Intr - 164597 164376 222 0 0 55 60 274 0.899 18.48 7.13 Intr - 165528 165418 111 1 0 46 38 106 0.598 0.83 7.12 Intr - 168616 168383 234 2 0 11 99 214 0.929 11.54 7.11 Intr - 169500 169381 120 1 0 97 119 67 0.327 10.25 7.10 Intr - 176666 176523 144 0 0 116 105 151 0.985 18.93 7.09 Intr - 178172 178044 129 0 0 79 65 150 0.974 11.55 7.08 Intr - 179012 178869 144 0 0 75 86 49 0.767 2.73 7.07 Intr - 182970 182842 129 1 0 9 81 91 0.476 0.25 7.06 Intr - 188299 188069 231 2 0 77 116 180 0.963 16.52 7.05 Intr - 190173 189973 201 1 0 110 47 154 0.991 11.84 7.04 Intr - 191672 191427 246 0 0 83 109 368 0.991 34.91 7.03 Intr - 192253 192104 150 2 0 59 86 119 0.807 8.01 7.02 Intr - 203643 202901 743 2 2 72 96 741 0.857 63.46 7.01 Intr - 209678 209446 233 2 2 136 89 143 0.917 14.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 10189 9970 220 1 1 66 62 156 0.921 7.75 S.002 Init - 85115 85037 79 2 1 85 48 79 0.977 4.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:51850679_52066684|GENSCAN_predicted_peptide_1|57_aa MLAEYCRSSGERPSKRLYPDVATLHGERSHQLCQQSQPLSACVGCREEGRVKWEQKX >gi568815584f:51850679_52066684|GENSCAN_predicted_CDS_1|171_bp atgctggcagagtactgcagaagtagtggggaaaggcccagcaagcgtctttacccagat gtagcaactttacatggggaaaggagccatcagctgtgccagcaaagccagcccttgagt gcatgtgtgggctgcagggaagaaggaagagtaaaatgggaacaaaaagnn >gi568815584f:51850679_52066684|GENSCAN_predicted_peptide_2|113_aa MSLLSDKWDAIFLCFTLGAHHSHAFTRLCPPVPKPALNMPGSVDKIRSPSMGSEAEAKIH FPGSLAATVQTETQLSMILTALGPERMRPLLSTRVGKFQRKNLLAPADGRPTL >gi568815584f:51850679_52066684|GENSCAN_predicted_CDS_2|342_bp atgtcgttgctgtctgacaaatgggatgcgattttcttgtgctttacacttggggctcac cacagtcatgctttcaccaggctctgcccaccagtgcccaagccagccctgaacatgcct ggcagtgtagataaaatcagatctccatcaatgggttctgaggcagaggctaaaattcat ttcccaggctcccttgcagctacggtacagactgaaacacagctttccatgattcttact gccttgggaccagaaaggatgagacctctcctctcaactcgagttggaaaattccagagg aagaacctactggcccctgctgatggcaggcccaccctctga >gi568815584f:51850679_52066684|GENSCAN_predicted_peptide_3|96_aa MLRLTLHPHSIDGSHADLREEIRCFVSLPLEKVKSTAGHASGVVLWQPQSQCGPYKVFLK DLSSTPMASNNTASIAQARKLVEQLKMEANIDRIKT >gi568815584f:51850679_52066684|GENSCAN_predicted_CDS_3|291_bp atgttgcgtcttactctccacccacacagcatagacggctcccatgctgatttgagggaa gaaatacgttgttttgtctctctcccacttgaaaaagtgaagtctactgctgggcatgcc tctggggtggtcctctggcagccccaaagccagtgtgggccctacaaagtgtttctgaaa gatctatccagcactccgatggccagcaacaacaccgccagcatagcacaagccaggaag ctggtagagcagcttaagatggaagccaatatcgacaggataaagacctaa >gi568815584f:51850679_52066684|GENSCAN_predicted_peptide_4|226_aa MNWKRGFGWQDGISQVGPLKGLESQVVEIGHDVVDKEHHEPENLGFLNMVKDEAVPKMYR PIGHQICSCLGHLERETKTTNTGVWYSQSRPKALEPNVLASNPCSMKQDTQPLKASASSS AKKKVDDTDLTGLLPASLGLEQIEATFWTEAAPSASIPDEGDMQESEKEEARKEEKRKMA KEHGKVALVISVMKKEEELRLLKEEQELIKSCGPLLGIKFYKGDLG >gi568815584f:51850679_52066684|GENSCAN_predicted_CDS_4|681_bp atgaactggaaaagagggtttggatggcaagatgggattagccaagtggggcctctcaag ggccttgagagccaggtagttgagattggacatgatgtggttgacaaagagcaccatgaa ccagagaaccttgggtttctgaacatggtgaaagatgaggctgtacccaagatgtacaga cctataggccatcaaatctgcagctgccttggacacctggaaagggagacaaaaacaacc aacactggagtctggtacagtcagtcaaggcccaaggctttagagcccaatgtcctggca tcaaatccttgctccatgaagcaagatactcagcctctaaaagcttctgcttcctcatct gcaaagaagaaagttgacgacactgaccttacaggattgttgccagcaagtctgggacta gaacagatagaagcaacattctggacagaggctgctccatctgccagcatccctgatgaa ggtgacatgcaagagagtgaaaaagaagaagcaagaaaagaggagaagaggaagatggca aaggagcacggtaaggtcgctttagtgatctcagtcatgaagaaagaagaagaattaagg ctgttgaaagaagagcaggagctgattaaatcctgtggtccattactaggcatcaaattt tacaaaggagatttaggataa >gi568815584f:51850679_52066684|GENSCAN_predicted_peptide_5|145_aa MEEVTSKPAIKYELLPDFPMGKGQSMNKSMSTKAPCQQGFASVKLKEDQGEHLQTKDAMM WRLLRTDQQRALLRKKNDVQDCSGQGWQIPHRRHHLAVIIVKNGKGKWISTEHEPEQPWS DSGANCGKPSGEPSQSGIQARISSL >gi568815584f:51850679_52066684|GENSCAN_predicted_CDS_5|438_bp atggaggaggttacatctaagccagctatcaaatatgagctattaccggatttcccaatg gggaaagggcagagcatgaacaaaagcatgagcacaaaggccccttgtcagcaagggttt gcttctgtgaaactgaaggaagatcagggagagcatctgcagacgaaagacgcgatgatg tggcgccttctgagaactgaccagcagagggcgctcttgagaaagaagaacgatgtgcaa gattgttccggtcagggatggcaaattccacacaggaggcaccacttagctgtaattatt gtaaagaatggtaaaggcaagtggatttccacagaacatgaaccagaacagccatggagt gatagcggtgccaattgcgggaaacctagcggagaaccttcccaaagtggtattcaggct aggatttcaagcctctag >gi568815584f:51850679_52066684|GENSCAN_predicted_peptide_6|322_aa MAGIGDLLDAVTEAGAAIGGHLPAATAFKPRLAKAAVISERLSACPPSRRVAGACASRST SLLLSRPRPGGPEREAGTMFRRKLTALDYHNPAGFNCKDETEFRNFIVWLEDQKIRHYKI EDRGNLRNIHSSDWPKFFEKYLRDVNCPFKIQDRQEAIDWLLGLAVRLEYGDNAEKYKDL VPDNSKTADNATKNAEPLINLDVNNPDFKAGVMALANLLQIQRHDDYLVMLKAIRILVQE RLTQDAVAKANQTKEGLPVALDKHILGFDTGDAVLNEAAQILRLLHIEELRELQTKINEA IVAVQAIIADPKTDHRLGKVGR >gi568815584f:51850679_52066684|GENSCAN_predicted_CDS_6|969_bp atggcaggaattggtgacctactggatgctgtgaccgaggccggagccgcgattggtggg catttgccggcggccaccgcttttaagccacgattggcgaaggccgccgtcatttcggag cgactcagcgcctgcccgccctctcgccgcgtcgccggtgcctgcgcctcccgctccacc tcgcttcttctctcccggccgaggcccgggggaccagagcgagaagcggggaccatgttc cgacgcaagttgacggctctcgactaccacaaccccgccggcttcaactgcaaagatgaa acagaatttagaaacttcatcgtttggcttgaagaccagaaaatcaggcactacaagatt gaagacagagggaatttaagaaacatccacagcagcgactggcccaagttctttgaaaag tatctcagagatgttaactgtcctttcaagattcaagatcgacaagaagctattgactgg cttcttggtttagctgttagacttgaatatggagataatgctgaaaaatacaaggattta gtacctgataattcaaaaactgctgacaatgcaactaaaaatgcagaaccattgatcaat ttggatgtaaataatcctgattttaaggctggtgtgatggctttggctaacctgcttcag attcagcgtcatgatgattacctggtaatgcttaaggcaattcggattttggttcaggag cgcctgacacaggatgcagttgctaaggcaaatcaaacaaaagagggcttacctgttgct ttagacaaacatattcttggttttgacacaggagatgcagttcttaatgaagctgctcaa attctgcgattgctgcacatagaggagctcagagagctacagacaaaaatcaacgaagcc atagtagctgttcaggcaattattgctgatccaaagacagaccacagactgggaaaagtt ggaagatga >gi568815584f:51850679_52066684|GENSCAN_predicted_peptide_7|1310_aa LNTFQAVLASDGSDSYALFLYPANGLQFLGTRPKESYNVQLQLPARVGFCRGEADDLKSE GPYFSLTSTEQSVKNLYQLSNLGIPGVWAFHIGSTSPLDNVRPAAVGDLSAAHSSVPLGR SFSHATALESDYNEDNLDYYDVNEEEAEYLPGEPEEALNGHSSIDVSFQSKVDTKPLEGR ISPPDSDLSSPLHPTPTYWPFYPETESSTLDPHTKEGTSLGEVGGPDLKGQVEPWDERET RSPAPPEVDRDSLAPSWETPPPYPENGSIQPYPDGGPVPSEMDVPPAHPEEEIVLRSYPA SGHTTPLSRGTYEVGLEDNIGSNTEVFTYNAANKETCEHNHRQCSRHAFCTDYATGFCCH CQSKFYGNGKHCLPEGAPHRVNGKVSGHLHVGHTPVHFTDVDLHAYIVGNDGRAYTAISH IPQPAAQALLPLTPIGGLFGWLFALEKPGSENGFSLAGAAFTHDMEVTFYPGEETVRITQ TAEGLDPENYLSIKTNIQGQVPYVSANFTAHISPYKELYHYSDSTVTSTSSRDYSLTFGA INQTWSYRIHQNITYQVCRHAPRHPSFPTTQQLNVDRVFALYNDEERVLRFAVTNQIGPV KELLSFSRGNDSCWNSCETAQLSTPDVGVVRLPRARMQNWGVFPKDSDPTPGNPCYDGSH MCDTTARCHPGTGVDYTCECASGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCE CRSGYEFADDRHTCILITPPANPCEDGSHTCAPAGQARCVHHGGSTFSCACLPGYAGDGH QCTDVDECSENRCHPAATCYNTPGSFSCRCQPGYYGDGFQCIPDSTSSLTPCEQQQRHAQ AQYAYPGARFHIPQCDEQGNFLPLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPHCGPS PAESSQNNLYFGQSLNGTVEAAWEETALQPKPAGLQPWKPTQRPPTICERWRENLLEHYG GTPRDDQYVPQCDDLGHFIPLQCHGKSDFCWCVDKDGREVQGTRSQPGTTPACIPTVAPP MVRPTPRPDVTPPSVGTFLLYTQGQQIGYLPLNGTRLQKDAAKTLLSLHVKWISPGSIIV GIDYDCRERMVYWTDVAGRTISRAGLELGAEPETIVNSGLISPEGLAIDHIRRTMYWTDS VLDKIESALLDGSERKVLFYTDLVNPRAIAVDPIRGNLYWTDWNREAPKIETSSLDGENR RILINTDIGLPNGLTFDPFSKLLCWADAGTKKLECTLPDGTGRRVIQNNLKYPFSIVSYA DHFYHTDWRRDGVVSVNKHSGQFTDEYLPEQRSHLYGITAVYPYCPTGRK >gi568815584f:51850679_52066684|GENSCAN_predicted_CDS_7|3933_bp ctgaacactttccaggcagttttggcatctgatgggtctgatagctacgccctctttctt tatcctgccaacggcctgcagttccttggaacccgccccaaagagtcttacaatgtccag cttcagcttccagctcgggtgggcttctgccgaggggaggctgatgatctgaagtcagaa ggaccatatttcagcttgactagcactgaacagtctgtgaaaaatctctatcaactaagc aacctggggatccctggagtgtgggctttccatatcggcagcacttccccgttggacaat gtcaggccagctgcagttggagacctttccgctgcccactcttctgttcccctgggacgt tccttcagccatgctacagccctggaaagtgactataatgaggacaatttggattactat gatgtgaatgaggaggaagctgaataccttccgggtgaaccagaggaggcattgaatggc cacagcagcattgatgtttccttccaatccaaagtggatacaaagcctttagagggtagg atctcccctccagattctgatctgtcctcccccttgcatccaacacctacttattggcca ttctatcctgaaacagaatcttccaccttggatcctcacaccaaagaaggaacatctctg ggagaggtagggggcccagatttaaaaggccaagttgagccctgggatgagagagagacc agaagcccagctccaccagaggtagacagagattcactggctccttcctgggaaacccca ccaccgtaccccgaaaacggaagcatccagccctacccagatggagggccagtgccttcg gaaatggatgttcccccagctcatcctgaagaagaaattgttcttcgaagttaccctgct tcaggtcacactacacccttaagtcgagggacgtatgaggtgggactggaagacaacata ggttccaacaccgaggtcttcacgtataatgctgccaacaaggaaacctgtgaacacaac cacagacaatgctcccggcatgccttctgcacggactatgccactggcttctgctgccac tgccaatccaagttttatggaaatgggaagcactgtctgcctgaaggggcacctcaccga gtgaatgggaaagtgagtggccacctccacgtgggccatacacccgtgcacttcactgat gtggacctgcatgcgtatatcgtgggcaatgatggcagagcctacacggccatcagccac atcccacagccagcagcccaggccctcctccccctcacaccaattggaggcctgtttggc tggctctttgctttagaaaaacctggctctgagaacggcttcagcctcgcaggtgctgcc tttacccatgacatggaagttacattctacccgggagaggagacggttcgtatcactcaa actgctgagggacttgacccagagaactacctgagcattaagaccaacattcaaggccag gtgccttacgtctcagcaaatttcacagcccacatctctccctacaaggagctgtaccac tactccgactccactgtgacctctacaagttccagagactactctctgacttttggtgca atcaaccaaacatggtcctaccgcatccaccagaacatcacttaccaggtgtgcaggcac gcccccagacacccgtccttccccaccacccagcagctgaacgtggaccgggtctttgcc ttgtataatgacgaagaaagagtgcttagatttgctgtgaccaatcaaattggcccggtc aaagaactgctcagtttttccagagggaacgatagttgttggaattcatgtgaaacagcg cagctgtccacacctgatgtgggtgtggtgcgacttcccagggcgaggatgcagaactgg ggtgtctttcccaaggattcagaccccactccggggaatccttgctatgatgggagccac atgtgtgacacaacagcacggtgccatccagggacaggtgtagattacacctgtgagtgc gcatctgggtaccagggagatggacggaactgtgtggatgaaaatgaatgtgcaactggc tttcatcgctgtggccccaactctgtatgtatcaacttgcctggaagctacaggtgtgag tgccggagtggttatgagtttgcagatgaccggcatacttgcatcttgatcaccccacct gccaacccctgtgaggatggcagtcatacctgtgctcctgctgggcaggcccggtgtgtt caccatggaggcagcacgttcagctgtgcctgcctgcctggttatgccggcgatgggcac cagtgcactgatgtagatgaatgctcagaaaacagatgtcaccctgcagctacctgctac aatactcctggttccttctcctgccgttgtcaacccggatattatggggatggatttcag tgcatacctgactccacctcaagcctgacaccctgtgaacaacagcagcgccatgcccag gcccagtatgcctaccctggggcccggttccacatcccccaatgcgacgagcagggcaac ttcctgcccctacagtgtcatggcagcactggtttctgctggtgcgtggaccctgatggt catgaagttcctggtacccagactccacctggctccaccccgcctcactgtggaccatca ccagcagagtcttctcagaacaacctgtattttggacaaagcctcaatggcactgtggaa gcagcctgggaggaaactgctttgcaaccaaagccagcaggacttcaaccatggaagccc acccagaggcccccgaccatctgtgagcgctggagggaaaacctgctggagcactacggt ggcaccccccgggatgaccagtacgtgccccagtgcgatgacctgggccacttcatcccc ctgcagtgccacggaaagagcgacttctgctggtgtgtggacaaagatggcagagaggtg cagggcacccgctcccagccaggcaccacccctgcgtgtatacccaccgtcgctccaccc atggtccggcccacgccccggccagatgtgacccctccatctgtgggcaccttcctgctc tatactcagggccagcagattggctacttacccctcaatggcaccaggcttcagaaggat gcagctaagaccctgctgtctctgcatgtaaagtggatttctcctggctccataatcgtg ggaattgattacgactgccgggagaggatggtgtactggacagatgttgctggacggaca atcagccgtgctggtctggaactgggagcagagcctgagacgatcgtgaattcaggtctg ataagccctgaaggacttgccatagaccacatccgcagaacaatgtactggacggacagt gtcctggataagatagagagcgccctgctggatggctctgagcgcaaggtcctcttctac acagatctggtgaatccccgtgccatcgctgtggatccaatccgaggcaacttgtactgg acagactggaatagagaagctcctaaaattgaaacgtcatctttagatggagaaaacaga agaattctgatcaatacagacattggattgcccaatggcttaacctttgaccctttctct aaactgctctgctgggcagatgcaggaaccaaaaaactggagtgtacactacctgatgga actggacggcgtgtcattcaaaacaacctcaagtaccccttcagcatcgtaagctatgca gatcacttctaccacacagactggaggagggatggtgttgtatcagtaaataaacatagt ggccagtttactgatgagtatctcccagaacaacgatctcacctctacgggataactgca gtctacccctactgcccaacaggaagaaagtaa