GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:43:26 Sequence gi568815591f:23082086_23297230 : 215145 bp : 41.22% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 898 893 6 -0.45 1.02 Term - 1557 1322 236 1 2 3 43 171 0.154 -0.40 1.01 Init - 2447 2363 85 2 1 72 110 75 0.193 9.20 1.00 Prom - 18826 18787 40 -1.85 2.00 Prom + 20332 20371 40 -4.05 2.01 Init + 23942 24061 120 1 0 76 86 244 0.999 23.24 2.02 Intr + 24199 24327 129 0 0 129 41 48 0.060 4.17 2.03 Intr + 41692 41794 103 0 1 62 108 62 0.148 4.43 2.04 Intr + 42603 42696 94 1 1 96 95 109 0.883 10.50 2.05 Intr + 42963 43087 125 0 2 90 91 86 0.995 8.41 2.06 Term + 47479 47603 125 2 2 70 49 155 0.902 7.37 2.07 PlyA + 47637 47642 6 -0.45 3.00 Prom + 48239 48278 40 -5.05 3.01 Init + 48922 48982 61 0 1 28 105 23 0.310 -0.54 3.02 Intr + 58684 58859 176 2 2 88 62 184 0.972 14.54 3.03 Intr + 61766 61940 175 1 1 114 97 170 0.996 19.09 3.04 Intr + 69982 70124 143 2 2 64 88 126 0.806 9.35 3.05 Intr + 83613 83853 241 2 1 104 82 162 0.809 13.40 3.06 Intr + 85751 85952 202 0 1 104 58 89 0.852 4.92 3.07 Intr + 90863 90960 98 2 2 46 77 48 0.710 -1.77 3.08 Term + 91930 92213 284 2 2 52 48 265 0.916 13.50 3.09 PlyA + 92954 92959 6 1.05 4.00 Prom + 98301 98340 40 -7.45 4.01 Init + 100001 100121 121 1 1 96 62 194 0.329 18.00 4.02 Intr + 102985 103213 229 2 1 88 84 168 0.995 12.41 4.03 Intr + 104967 105061 95 0 2 126 93 50 0.936 7.99 4.04 Intr + 111528 111742 215 1 2 60 78 211 0.978 14.81 4.05 Intr + 114595 114681 87 0 0 120 97 3 0.903 3.65 4.06 Intr + 117373 117457 85 0 1 82 111 125 0.999 12.67 4.07 Term + 118083 118660 578 1 2 101 42 334 0.990 23.74 4.08 PlyA + 118899 118904 6 1.05 5.00 Prom + 119308 119347 40 -9.25 5.01 Init + 121059 121181 123 2 0 88 100 41 0.549 5.52 5.02 Intr + 123980 124395 416 1 2 85 83 171 0.526 8.28 5.03 Intr + 129459 129575 117 0 0 57 94 95 0.623 5.66 5.04 Intr + 148680 148788 109 0 1 45 70 72 0.138 0.47 5.05 Term + 151553 151699 147 1 0 103 43 106 0.468 4.52 5.06 PlyA + 154494 154499 6 1.05 6.00 Prom + 158298 158337 40 -6.45 6.01 Init + 164773 164842 70 0 1 68 92 89 0.783 6.77 6.02 Term + 165484 165917 434 2 2 42 55 193 0.282 5.87 6.03 PlyA + 166281 166286 6 1.05 7.00 Prom + 167672 167711 40 -3.45 7.01 Init + 171194 171374 181 1 1 49 110 130 0.545 10.69 7.02 Intr + 172084 172227 144 2 0 73 95 109 0.995 9.43 7.03 Intr + 177895 178053 159 2 0 76 109 167 0.999 16.64 7.04 Intr + 178371 178688 318 1 0 39 82 255 0.939 15.01 7.05 Intr + 184396 184530 135 2 0 64 82 158 0.947 12.42 7.06 Intr + 185801 185903 103 0 1 20 105 122 0.867 5.41 7.07 Intr + 187882 188090 209 1 2 93 105 217 0.868 21.70 7.08 Term + 191436 191533 98 1 2 97 38 69 0.115 -0.05 7.09 PlyA + 191824 191829 6 1.05 8.05 PlyA - 192761 192756 6 1.05 8.04 Term - 196082 195889 194 0 2 63 37 155 0.047 4.50 8.03 Intr - 205564 205418 147 2 0 93 32 86 0.043 2.79 8.02 Intr - 211788 211628 161 2 2 60 74 78 0.332 2.31 8.01 Init - 214298 214228 71 2 2 48 70 73 0.288 2.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 24199 24342 144 0 0 129 42 87 0.936 5.13 S.002 Init + 41716 41794 79 0 1 51 108 51 0.817 4.67 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:23082086_23297230|GENSCAN_predicted_peptide_1|106_aa MGPQTFCHSFCWTASHISGGASVTLDVAVFVDTFTGWIEAFPTWSEKAIEVSKLLLKEII SRFWLPKSLQSNNGPYFTVTITPNTSSALGIQYRPHSHGGHSLPGK >gi568815591f:23082086_23297230|GENSCAN_predicted_CDS_1|321_bp atggggcctcagaccttctgccacagcttctgttggacagctagccatatctctggagga gcctcggtaactcttgatgtggcagtatttgttgacacctttactggttggatcgaggct tttcctacctggtctgaaaaggcaattgaggtttctaaactcctactaaaggaaataatt tctagattttggctgcctaagagcttacagagcaataatggcccatatttcacagtgaca attaccccaaacacatcttcagccctaggaattcagtaccgccctcattcgcatggaggc cacagtcttccgggaaagtag >gi568815591f:23082086_23297230|GENSCAN_predicted_peptide_2|231_aa MAASGVEKSSKKKTEKKLAAREEAKLLAGFMGVMNNMRKQASDSAVGRGARKTILEQKCF SSAEDVAPKSDSWGTPQAGFRRRKTLCDVILMVQERKIPAHRVVLAAASHFFNLMFTTNM LESKSFEVELKDAEPDIIEQLVEFAYTARISVNSNNVQSLLDAANQYQIEPVKKMCVDFL KEQVDASNCLASGRVDYIEFNRPQTVGIADLTGKPLIRPENNRPETQLSRD >gi568815591f:23082086_23297230|GENSCAN_predicted_CDS_2|696_bp atggcagcctctggggtggagaagagcagcaagaagaagaccgagaagaaacttgctgct cgggaagaagctaaattgttggcgggtttcatgggcgtcatgaataacatgcggaaacag gcgagcgattccgcagttggtagaggggcgcgtaaaacaattctcgagcaaaagtgcttc tcgtctgccgaggatgtagctcccaagtcagactcctgggggactcctcaggctggcttt cgccgtcggaaaacgttgtgtgacgtgatcctcatggtccaggaaagaaagatacctgct catcgtgttgttcttgctgcagccagtcatttttttaacttaatgttcacaactaacatg cttgaatcaaagtcctttgaagtagaactcaaagatgctgaacctgatattattgaacaa ctggtggaatttgcttatactgctagaatttccgtgaatagcaacaatgttcagtctttg ctggatgcagcaaaccaatatcagattgaacctgtgaagaaaatgtgtgttgattttttg aaagaacaagttgatgcttcaaattgtcttgctagtggccgagtcgactacattgaattc aaccgtccccaaactgtgggtattgctgatctgactgggaagcctctcattcggcctgag aataatagaccagagacacagttatcaagagactga >gi568815591f:23082086_23297230|GENSCAN_predicted_peptide_3|459_aa MTLIPGSVRCQPILRVKNSAGISVLAECLDCPELKATADDFIHQHFTEVYKTDEFLQLDV KRVTHLLNQDTLTVRAEDQVYDAAVRWLKYDEPNRQPFMVDILAKVRFPLISKNFLSKTV QAEPLIQDNPECLKMVISGMRYHLLSPEDREELVDGTRPRRKKHDYRIALFGGSQPQSCR YFNPKDYSWTDIRCPFEKRRDAACVFWDNVVYILGGSQLFPIKRMDCYNVVKDSWYSKLG PPTPRDSLAACAAEGKIYTSGGSEVGNSALYLFECYDTRTESWHTKPSMLTQRCSHGMVE ANGLIYVCGGSLGNNVSGRVLNSCEVYDPATETWTELCPMIEARKNHGLVFVKDKIFAVG GQNGLGGLDNVEYYDIKLNEWKMVSPMPWKGVTVKCAAVGSIVYVLAGFQGVGRLGHILE YNTETDKWVANSKVRAFPVTSCLICVVDTCGANEETLET >gi568815591f:23082086_23297230|GENSCAN_predicted_CDS_3|1380_bp atgacattgatacctggtagtgttcgctgccaacctatcttaagggtaaagaattctgct ggtataagtgtgctagcggagtgtctagattgtcctgaattgaaagcaactgcagatgac tttattcatcagcactttactgaagtttacaaaactgatgaatttcttcaacttgatgtc aagcgagtaacacatcttctcaaccaggacactctgactgtgagagcagaggatcaggtt tatgatgctgcagtcaggtggttgaaatacgatgaacctaatcgccagccatttatggtt gatatccttgctaaagtcaggtttcctcttatatcaaagaatttcttaagtaaaacggta caagctgaaccacttattcaagacaatcctgaatgccttaagatggtgataagtggaatg aggtaccatctactgtctccagaggaccgagaagaacttgtagatggcacaagacctaga agaaagaaacatgactaccgcatagccctatttggaggctctcaaccacagtcttgtaga tattttaacccaaaggattatagctggacagacatccgctgcccctttgaaaaacgaaga gatgcagcatgcgtgttttgggacaatgtagtatacattttgggaggctctcagcttttc ccaataaagcgaatggactgctataatgtagtgaaggatagctggtattcgaaactgggt cctccgacacctcgagacagccttgctgcatgtgctgcagaaggcaaaatttatacatct ggaggttcagaagtaggaaactcagctctgtatttatttgagtgctatgatacgagaact gaaagctggcacacaaagcccagcatgctgacccagcgctgcagccatgggatggtggaa gccaatggcctaatctatgtttgtggtggaagtttaggaaacaatgtttctgggagagtg cttaattcctgtgaagtttatgatcctgccacagaaacatggactgagctgtgtccaatg attgaagccaggaagaatcatgggctggtatttgtaaaagacaagatatttgctgtgggt ggtcagaatggtttaggtggtctggacaatgtggaatattacgatattaagttgaacgaa tggaagatggtctcaccaatgccatggaagggtgtaacagtgaaatgtgcagcagttggc tctatagtttatgtcttggctggttttcagggtgttggtcgattaggacacattctcgaa tataataccgaaacagacaaatgggttgccaactccaaagttcgtgcttttccagtcaca agttgtttaatttgtgttgtcgatacttgtggagcaaatgaagagacccttgaaacatga >gi568815591f:23082086_23297230|GENSCAN_predicted_peptide_4|469_aa MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQP SSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGI VKDMEVWESSGQWMFSVYSPVKKKPNISARHKGSPSPHQTQEPSWPHPVDSTPGPQVELP ASPAVCAHTPQPLGGRWDWVPVEQGVALLVEAQAAQERTELNSVQRLINQWRNRVNELKS LNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGF AAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSP GFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLS ASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNV >gi568815591f:23082086_23297230|GENSCAN_predicted_CDS_4|1410_bp atggccatttgtcaattcttccttcaaggccggtgccgctttggagatcggtgctggaac gaacatcccggtgctaggggtgcaggaggaggacggcagcaaccgcagcagcagccttca ggtaataatagacgtggatggaatacaactagccagagatattccaatgtcatccagcca tccagtttctccaaatccacaccatgggggggcagcagagatcaagaaaagccatatttc agttcttttgattctggagcttcaactaacaggaaggaaggctttggattgtctgagaac ccatttgcttcacttagtcctgatgagcagaaagatgaaaagaaacttctggaaggaatt gtaaaagatatggaggtttgggaatcatcagggcagtggatgttttctgtttattcacca gtgaaaaagaaacctaatatttcagctagacataaaggttctccaagtccccaccagact caggagcccagctggcctcacccagtggattccacaccagggccacaggtggagctaccc gccagtcccgcggtgtgtgcccacactcctcagcccttgggtggtcgatgggactgggtg ccggtggagcagggggtggcgctcctcgtggaggctcaggctgctcaggagcgcacggag ctaaattctgtccaacgtttaataaatcaatggaggaacagggtaaatgaactgaaaagt ctaaatatatcaactaaagtagctttgctctctgatgtaaaggatggagtaaatcaagca gcacctgcatttggatttggcagcagtcaagcagcaacatttatgtcgccaggctttcca gtcaataacagcagcagtgataatgctcagaactttagttttaaaacaaactctggattt gctgctgcctcttctggaagccctgctggttttgggagttccccagcatttggagctgca gcctctaccagttcaggtatctctacttctgctccagcttttggatttgggaagcctgaa gtcacatcggctgcatcattttcattcaaaagccctgcagcttccagttttggatcacct ggattttcaggacttccagcttccttggcaacaggtcctgtcagagctccagtggcccca gcctttggaggtggcagttctgtggctggttttggtagtccgggctcacattctcacact gctttttctaagccatccagtgacacttttggaaatagcagcatatccacttctctgtca gcctcaagcagcatcattgcaacagataatgtgttattcacacccagagataaactaaca gtagaagaactggaacaatttcaatccaagaaatttactctgggaaaaattccattaaag cctccacctctggaacttctaaatgtttaa >gi568815591f:23082086_23297230|GENSCAN_predicted_peptide_5|303_aa MHTYWCQKSRKLVTPRPRLRGIAASENQKQVTMRWQQLGIKTRGAPLWLPTGRAQASKPR RTSLPHPTSVTVGGGSPCPDPAAGSCPASTKPRRAAETQCLRVAAGEAGREAWRPPQHRA AETVRVCRAREGGLATQASSAASLGHGAARVRARVTALFPLDETCRVLRTTDKGLNGLGR GHRCEVWWAFIPFPRSTLRLHGQSAVNCPSPVPTLRAPRNPIPSAEATCSAQAPSDSCGP WPSPIDVQCSLSQALVTDTQVASNHPLQRTREKTDFSEDGTRKSWYYVDKNTAGSPPKLL HMK >gi568815591f:23082086_23297230|GENSCAN_predicted_CDS_5|912_bp atgcatacatactggtgtcagaagtcaagaaaactggtaactcccagacctaggctaagg ggcatcgcggcttcagaaaatcagaaacaggtaacaatgaggtggcagcagctgggtata aagacacggggggccccgctgtggctgcccactggccgggcgcaggcctcgaagccgcgg cgaacctctcttccccaccccacctcggtgactgttggcggcggctctccctgcccagac cccgccgccggatcctgcccggcctcgacgaaaccccgccgagccgccgagacgcagtgt ctccgggtggcggcgggagaggcgggccgggaagcatggcggccgccccaacaccgcgcg gcggagaccgttagggtgtgcagggcccgggaaggcggtctcgcgacgcaggcaagctcg gccgcctctttaggccacggagccgcgcgagtccgggcccgggtgaccgctctgttccca ctggacgagacctgccgagtcctcaggacaacggacaaaggccttaacgggcttgggaga ggtcatcgctgtgaagtctggtgggcattcatcccttttcctcgcagcacgctcagactg cacgggcagtctgcagttaactgtccatcaccagtacccactctgcgggctcccagaaac ccaattccttcggcagaagccacatgttccgctcaggccccatctgacagctgtggcccc tggcccagccccatagacgttcagtgttccctttcacaggccctggtgacagacacccag gttgcctccaatcatcctctgcaacgaaccagggaaaagacggatttctcggaagatggt accaggaaatcttggtactatgtggataaaaatacagctggatccccacctaagttactc catatgaagtga >gi568815591f:23082086_23297230|GENSCAN_predicted_peptide_6|167_aa MECLYYFLGFLLLAARLPLDAAKPRVQTTEAVCRFARGRELSRCNPGPGIAGTGRVGSGQ RRSGFWSEPQRPETWAAPRPRRPRGRALTLSLPTATATRENAYVFGKEVPSQGADPVSAR STFPARSAPRVLLAARGARHCDGPASHTRPRATEWARNSPQATQRVS >gi568815591f:23082086_23297230|GENSCAN_predicted_CDS_6|504_bp atggaatgtctctactatttcctgggatttctgctcctggctgcaagattgccacttgat gccgccaaacccagggtccagaccactgaagcagtgtgcagattcgcgagaggaagagag ctgagccgctgtaaccctgggcctgggattgcggggacgggacgggtgggaagcggccaa aggcgcagcggcttctggtcagagccgcagaggcctgagacgtgggccgcgccccggccc aggagaccccgcggccgcgccctaaccctcagtctcccgactgcgactgcgacccgggaa aacgcctacgtttttggaaaggaagtcccatcacagggagctgacccggtgtcagcgcgc agcacgttcccggcccgcagtgcgccgcgcgtcctcctagcagcccgcggagcccggcac tgtgatggtcccgcttcccacacgaggccacgggccacagagtgggcaaggaactcgccc caagccacacagcgggtaagttga >gi568815591f:23082086_23297230|GENSCAN_predicted_peptide_7|448_aa MEFVSNIDTGFHDVLGNERPSAYMREHNQLNGWSSDENDWNEKLYPVWKRGDMRWKNSWK GGRVQAVLTSDSPALVGSNITFAVNLIFPRCQKEDANGNIVYEKNCRNGQYFQKLGRCSV RVSVNTANVTLGPQLMEVTVYRRHGRAYVPIAQVKDVYVVTDQIPVFVTMFQKNDRNSSD ETFLKDLPIMFDVLIHDPSHFLNYSTINYKWSFGDNTGLFVSTNHTVNHTYVLNGTFSLN LTVKAAAPGPCPPPPPPPRPSKPTPSLATTLKSYDSNTPGPAGDNPLELSRIPDENCQIN RYGHFQATITIVEGILEVNIIQMTDVLMPVPWPESSLIDFVVTCQGSIPTEVCTIISDPT CEITQNTVCSPVDVDEMCLLTVRRTFNGSGTYCVNLTLGDDTSLALTSTLISVPDRDPAS PLRMANSALISVGCLAIFVTVISLLVYK >gi568815591f:23082086_23297230|GENSCAN_predicted_CDS_7|1347_bp atggaatttgtttcgaatattgatacaggatttcatgatgtgctgggcaatgaaagacct tctgcttacatgagggagcacaatcaattaaatggctggtcttctgatgaaaatgactgg aatgaaaaactctacccagtgtggaagcggggagacatgaggtggaaaaactcctggaag ggaggccgtgtgcaggcggtcctgaccagtgactcaccagccctcgtgggctcaaatata acatttgcggtgaacctgatattccctagatgccaaaaggaagatgccaatggcaacata gtctatgagaagaactgcagaaatggtcagtatttccagaaattgggacgatgttcagtg agagtttctgtgaacacagccaatgtgacacttgggcctcaactcatggaagtgactgtc tacagaagacatggacgggcatatgttcccatcgcacaagtgaaagatgtgtacgtggta acagatcagattcctgtgtttgtgactatgttccagaagaacgatcgaaattcatccgac gaaaccttcctcaaagatctccccattatgtttgatgtcctgattcatgatcctagccac ttcctcaattattctaccattaactacaagtggagcttcggggataatactggcctgttt gtttccaccaatcatactgtgaatcacacgtatgtgctcaatggaaccttcagccttaac ctcactgtgaaagctgcagcaccaggaccttgtccgccaccgccaccaccacccagacct tcaaaacccaccccttctttagcaactactctaaaatcttatgattcaaacaccccagga cctgctggtgacaaccccctggagctgagtaggattcctgatgaaaactgccagattaac agatatggccactttcaagccaccatcacaattgtagagggaatcttagaggttaacatc atccagatgacagacgtcctgatgccggtgccatggcctgaaagctccctaatagacttt gtcgtgacctgccaagggagcattcccacggaggtctgtaccatcatttctgaccccacc tgcgagatcacccagaacacagtctgcagccctgtggatgtggatgagatgtgtctgctg actgtgagacgaaccttcaatgggtctgggacgtactgtgtgaacctcaccctgggggat gacacaagcctggctctcacgagcaccctgatttctgttcctgacagagacccagcctcg cctttaaggatggcaaacagtgccctgatctccgttggctgcttggccatatttgtcact gtgatctccctcttggtgtacaagtaa >gi568815591f:23082086_23297230|GENSCAN_predicted_peptide_8|190_aa MEDRGAAQEEGLEGALFRARAYAHIVSSAFTAVVCSEAVPLNTHIGPLDYWALSQSQTVF PTASEIGIGVPSCHPDVGALQGKEGFRAKLKSSYERRHLVFTEVALPNRHYSQPWRPHRR VWIFPVGNYSGSFGTILLKTLKDFPPPKAPSECGHHHPWMTTFSSALTLSLWFEVATHSG CNECIERIAL >gi568815591f:23082086_23297230|GENSCAN_predicted_CDS_8|573_bp atggaggacagaggagcagctcaggaagagggactagaaggggctttgtttagggctcgt gcttatgctcacattgtttcctcagctttcacagccgttgtttgttctgaagctgtgcct ctgaacactcacattggccccttggattactgggccctgtcacagagccagactgttttt cccacagcctcagaaattggtattggggtcccctcttgtcaccccgatgttggcgcatta caaggtaaggaaggatttagggcaaaacttaaatcctcatatgaacgaaggcacttggta tttacagaggtcgcacttcccaaccgccactacagccagccatggaggccacaccgaaga gtttggattttccctgtaggtaactattctggcagctttggaaccattcttttaaaaacc ttaaaggattttcccccaccaaaagccccatcagaatgtggtcaccatcacccctggatg accaccttctcatctgcccttacactgtcactctggtttgaagttgctacacattctgga tgtaatgagtgcattgaacgtattgccctctaa